chunk_group_by: #' @export #' @importFrom dplyr add_count #' @rdname...

chunk_summarizeR Documentation

#' @export #' @importFrom dplyr add_count #' @rdname dplyr_verbs add_count.disk.frame <- create_chunk_mapper(dplyr::add_count) #' @export #' @importFrom dplyr add_tally #' @rdname dplyr_verbs add_tally.disk.frame <- create_chunk_mapper(dplyr::add_tally)

Description

The disk.frame group by operation perform group WITHIN each chunk. This is often used for performance reasons. If the user wishes to perform group-by, they may choose to use the 'hard_group_by' function which is expensive as it reorganizes the chunks by the shard key.

Usage

chunk_summarize(.data, ...)

chunk_summarise(.data, ...)

chunk_group_by(.data, ...)

chunk_ungroup(.data, ...)

Arguments

.data

a disk.frame

...

passed to dplyr::group_by

See Also

hard_group_by group_by


disk.frame documentation built on Aug. 24, 2023, 5:09 p.m.