group_by: Group data
In jeblundell/multiplyr: Data Manipulation with Parallelism and Shared Memory Matrices

Description Usage Arguments Details Value See Also Examples

Groups data by specified columns: further operations then work within those groups

1
2
3

group_by(.self, ..., auto_partition = NULL)

group_by_(.self, ..., .dots, .cols = NULL, auto_partition = NULL)

`.self`	Data frame
`...`	Additional parameters
`auto_partition`	Re-partition across cluster after operation
`.dots`	Workaround for non-standard evaluation
`.cols`	Columns to group by (used internally)

Many data analysis problems require working with particular combinations of data. For example, finding the average sales for a given day of the week could be achieved with group_by(day) and summarise(sales = mean(sales). This would result in a data frame with 7 rows (1 for each group) with the average sales stored in the sales column.

Multiple grouping variables may be specified, separated by columns. The above example could be extended to group by month as well as weekday, e.g. group_by(month, day). The resulting data frame would then have 12 blocks of 7 (84 rows) with an average for each week day in that month provided the same way as above.

Data frame

Other row manipulations: arrange, distinct, filter, slice

1
2
3

dat <- Multiplyr (x=1:100, G=rep(c("A", "B", "C", "D"), each=25))
dat %>% group_by (G) %>% summarise (N=length(x))
dat %>% shutdown()

jeblundell/multiplyr documentation built on May 19, 2019, 12:39 a.m.

jeblundell/multiplyr index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

jeblundell/multiplyr
Data Manipulation with Parallelism and Shared Memory Matrices

group_by: Group data
In jeblundell/multiplyr: Data Manipulation with Parallelism and Shared Memory Matrices

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to group_by in jeblundell/multiplyr...

R Package Documentation

Browse R Packages

We want your feedback!

jeblundell/multiplyr Data Manipulation with Parallelism and Shared Memory Matrices

group_by: Group data In jeblundell/multiplyr: Data Manipulation with Parallelism and Shared Memory Matrices

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to group_by in jeblundell/multiplyr...

R Package Documentation

Browse R Packages

We want your feedback!

jeblundell/multiplyr
Data Manipulation with Parallelism and Shared Memory Matrices

group_by: Group data
In jeblundell/multiplyr: Data Manipulation with Parallelism and Shared Memory Matrices