function to create a data_frame object

Share:

Description

function to create a data_frame object. The data_frame object is an object that is held on disk. It is written to a folder path on disk where the data is written to in blocks or chunks. The data is written in binary format using a C++ function in purely numerical data and a mapping to the table is held in a ".meta_data" file in the folder. The table object accomodates numeric, factor, and character (converted to factor).

Usage

1
2
3
4
data_frame(data = stop("data must be supplied"),
  chunkSize = stop("chunkSize must be specified, a good number is 50000"),
  path = stop("path must be specified"), nCores = parallel::detectCores(),
  ...)

Arguments

data

data.frame object to be converted into a data_frame object

chunkSize

number of rows to be used in each chunk

path

character to folder where the object will be created

nCores

the number of cores to use defaults to parallel::detectCores()

...

not currently used.

Details

Creates a data_frame object

Examples

1
2
3
4
5
irisA <- data_frame(iris[1:75,], 10, "irisA", nCores = 1)
irisA$append(iris[76:150,])
irisA$head()
irisA$tail(10)
irisA$delete(); rm(irisA)

Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker.