collect.disk.frame | R Documentation |
Bring the disk.frame into RAM by loading the data and running all lazy operations as data.table/data.frame or as a list
Bring the disk.frame into RAM by loading the data and running all lazy operations as data.table/data.frame or as a list
## S3 method for class 'disk.frame' collect(x, ..., parallel = !is.null(attr(x, "recordings"))) collect_list( x, simplify = FALSE, parallel = !is.null(attr(x, "recordings")), ... ) ## S3 method for class 'summarized_disk.frame' collect(x, ..., parallel = !is.null(attr(x, "recordings")))
x |
a disk.frame |
... |
not used |
parallel |
if TRUE the collection is performed in parallel. By default if there are delayed/lazy steps then it will be parallel, otherwise it will not be in parallel. This is because parallel requires transferring data from background R session to the current R session and if there is no computation then it's better to avoid transferring data between session, hence parallel = FALSE is a better choice |
simplify |
Should the result be simplified to array |
collect return a data.frame/data.table
collect_list returns a list
collect return a data.frame/data.table
cars.df = as.disk.frame(cars) # use collect to bring the data into RAM as a data.table/data.frame collect(cars.df) # clean up delete(cars.df) cars.df = as.disk.frame(cars) # returns the result as a list collect_list(cmap(cars.df, ~1)) # clean up delete(cars.df) cars.df = as.disk.frame(cars) # use collect to bring the data into RAM as a data.table/data.frame collect(cars.df) # clean up delete(cars.df)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.