| summarise.grouped_disk.frame {disk.frame} | R Documentation |
The disk.frame group by operation perform group WITHIN each chunk. This is often used for performance reasons. If the user wishes to perform group-by, they may choose to use the 'hard_group_by' function which is expensive as it reorganizes the chunks by the shard key.
## S3 method for class 'grouped_disk.frame' summarise(.data, ...) ## S3 method for class 'grouped_disk.frame' summarize(.data, ...) ## S3 method for class 'disk.frame' group_by(.data, ..., add = FALSE, .drop = dplyr::group_by_drop_default(.data)) ## S3 method for class 'disk.frame' summarize(.data, ...) ## S3 method for class 'disk.frame' summarise(.data, ...)
.data |
a disk.frame |
... |
same as the dplyr::group_by |
add |
from dplyr |
.drop |
from dplyr |
hard_group_by