splt | R Documentation |
Divides data into groups by a wide range of methods. Splits data by these groups.
Wraps group()
with split()
.
splt(
data,
n,
method = "n_dist",
starts_col = NULL,
force_equal = FALSE,
allow_zero = FALSE,
descending = FALSE,
randomize = FALSE,
remove_missing_starts = FALSE
)
data |
|
n |
Depends on Number of groups (default), group size, list of group sizes,
list of group starts, number of data points between group members,
step size or prime number to start at. See Passed as whole number(s) and/or percentage(s) ( Method |
method |
Note: examples are sizes of the generated groups
based on a vector with greedyDivides up the data greedily given a specified group size
n_dist (default)Divides the data into a specified number of groups and
distributes excess data points across groups
n_fillDivides the data into a specified number of groups and
fills up groups with excess data points from the beginning
n_lastDivides the data into a specified number of groups.
It finds the most equal group sizes possible,
using all data points. Only the last group is able to differ in size
n_randDivides the data into a specified number of groups.
Excess data points are placed randomly in groups (max. 1 per group)
l_sizesDivides up the data by a
l_startsStarts new groups at specified values in the
To skip: If passing
everyCombines every
staircaseUses step size to divide up the data.
Group size increases with 1 step for every group,
until there is no more data
primesUses prime numbers as group sizes.
Group size increases to the next prime number
until there is no more data.
|
starts_col |
Name of column with values to match in method |
force_equal |
Create equal groups by discarding excess data points. Implementation varies between methods. (Logical) |
allow_zero |
Whether |
descending |
Change the direction of the method. (Not fully implemented) (Logical) |
randomize |
Randomize the grouping factor. (Logical) |
remove_missing_starts |
Recursively remove elements from the
list of starts that are not found.
For method |
list
of the split `data`
.
N.B. If `data`
is a grouped data.frame
, there's an outer list
for each group. The names are based on the group indices
(see dplyr::group_indices()
).
Ludvig Renbo Olsen, r-pkgs@ludvigolsen.dk
Other grouping functions:
all_groups_identical()
,
collapse_groups_by
,
collapse_groups()
,
fold()
,
group_factor()
,
group()
,
partition()
# Attach packages
library(groupdata2)
library(dplyr)
# Create data frame
df <- data.frame(
"x" = c(1:12),
"species" = factor(rep(c("cat", "pig", "human"), 4)),
"age" = sample(c(1:100), 12)
)
# Using splt()
df_list <- splt(df, 5, method = "n_dist")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.