group_by: A function to parse the summarize function

summarise.grouped_disk.frameR Documentation

A function to parse the summarize function

Description

The disk.frame group by operation perform group WITHIN each chunk. This is often used for performance reasons. If the user wishes to perform group-by, they may choose to use the 'hard_group_by' function which is expensive as it reorganizes the chunks by the shard key.

Usage

## S3 method for class 'grouped_disk.frame'
summarise(.data, ...)

## S3 method for class 'grouped_disk.frame'
summarize(.data, ...)

## S3 method for class 'disk.frame'
group_by(
  .data,
  ...,
  .add = FALSE,
  .drop = stop("disk.frame does not support `.drop` in `group_by` at this stage")
)

## S3 method for class 'disk.frame'
summarize(.data, ...)

## S3 method for class 'disk.frame'
summarise(.data, ...)

Arguments

.data

a disk.frame

...

same as the dplyr::group_by

.add

from dplyr

.drop

from dplyr

See Also

hard_group_by


disk.frame documentation built on Aug. 24, 2023, 5:09 p.m.