concat.split: Split Concatenated Cells in a Dataset
In mrdwab/splitstackshape: Stack and Reshape Datasets After Splitting Concatenated Values

Description Usage Arguments Details Note Author(s) See Also Examples

The concat.split function takes a column with multiple values, splits the values into a list or into separate columns, and returns a new data.frame or data.table.

1
2
3

concat.split(data, split.col, sep = ",", structure = "compact",
  mode = NULL, type = NULL, drop = FALSE, fixed = FALSE,
  fill = NA, ...)

`data`	The source `data.frame` or `data.table`.
`split.col`	The variable that needs to be split; can be specified either by the column number or the variable name.
`sep`	The character separating each value (defaults to `","`).
`structure`	Can be either `"compact"`, `"expanded"`, or `list`. Defaults to `"compact"`. See Details.
`mode`	Can be either `"binary"` or `"value"` (where `"binary"` is default and it recodes values to 1 or `NA`, like Boolean data, but without assuming 0 when data is not available). This setting only applies when `structure = "expanded"`; a warning message will be issued if used with other structures.
`type`	Can be either `"numeric"` or `"character"` (where `"numeric"` is default). This setting only applies when `structure = "expanded"`; a warning message will be issued if used with other structures.
`drop`	Logical (whether to remove the original variable from the output or not). Defaults to `FALSE`.
`fixed`	Is the input for the `sep` value fixed, or a regular expression? See Details.
`fill`	The "fill" value for missing values when `structure = "expanded"`. Defaults to `NA`.
`...`	Additional arguments to `cSplit()`.

structure

"compact" creates as many columns as the maximum length of the resulting split. This is the most useful general-case application of this function.
When the input is numeric, "expanded" creates as many columns as the maximum value of the input data. This is most useful when converting to mode = "binary".
"list" creates a single new column that is structurally a list within a data.frame or data.table.

fixed

When structure = "expanded" or structure = "list", it is possible to supply a a regular expression containing the characters to split on. For example, to split on ",", ";", or "|", you can set sep = ",|;|\|" or sep = "[,;|]", and fixed = FALSE to split on any of those characters.

This is more of a "legacy" or "convenience" wrapper function encompassing the features available in the separated functions of cSplit(), cSplit_l(), and cSplit_e().

Ananda Mahto

cSplit(), cSplit_l(), cSplit_e()

## Load some data
temp <- head(concat.test)

# Split up the second column, selecting by column number
concat.split(temp, 2)

# ... or by name, and drop the offensive first column
concat.split(temp, "Likes", drop = TRUE)

# The "Hates" column uses a different separator
concat.split(temp, "Hates", sep = ";", drop = TRUE)

## Not run: 
# You'll get a warning here, when trying to retain the original values
concat.split(temp, 2, mode = "value", drop = TRUE)

## End(Not run)

# Try again. Notice the differing number of resulting columns
concat.split(temp, 2, structure = "expanded",
mode = "value", type = "numeric", drop = TRUE)

# Let's try splitting some strings... Same syntax
concat.split(temp, 3, drop = TRUE)

# Strings can also be split to binary representations
concat.split(temp, 3, structure = "expanded",
type = "character", fill = 0, drop = TRUE)

# Split up the "Likes column" into a list variable; retain original column
head(concat.split(concat.test, 2, structure = "list", drop = FALSE))

# View the structure of the output to verify
# that the new column is a list; note the
# difference between "Likes" and "Likes_list".
str(concat.split(temp, 2, structure = "list", drop = FALSE))

mrdwab/splitstackshape documentation built on May 23, 2019, 7:16 a.m.

mrdwab/splitstackshape index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

mrdwab/splitstackshape
Stack and Reshape Datasets After Splitting Concatenated Values

concat.split: Split Concatenated Cells in a Dataset
In mrdwab/splitstackshape: Stack and Reshape Datasets After Splitting Concatenated Values

Description

Usage

Arguments

Details

Note

Author(s)

See Also

Examples

Related to concat.split in mrdwab/splitstackshape...

R Package Documentation

Browse R Packages

We want your feedback!

mrdwab/splitstackshape Stack and Reshape Datasets After Splitting Concatenated Values

concat.split: Split Concatenated Cells in a Dataset In mrdwab/splitstackshape: Stack and Reshape Datasets After Splitting Concatenated Values

Description

Usage

Arguments

Details

Note

Author(s)

See Also

Examples

Related to concat.split in mrdwab/splitstackshape...

R Package Documentation

Browse R Packages

We want your feedback!

mrdwab/splitstackshape
Stack and Reshape Datasets After Splitting Concatenated Values

concat.split: Split Concatenated Cells in a Dataset
In mrdwab/splitstackshape: Stack and Reshape Datasets After Splitting Concatenated Values