parquet_options | R Documentation |
Create a list of nanoparquet options.
parquet_options(
class = getOption("nanoparquet.class", "tbl"),
compression_level = getOption("nanoparquet.compression_level", NA_integer_),
keep_row_groups = FALSE,
num_rows_per_row_group = getOption("nanoparquet.num_rows_per_row_group", 10000000L),
use_arrow_metadata = getOption("nanoparquet.use_arrow_metadata", TRUE),
write_arrow_metadata = getOption("nanoparquet.write_arrow_metadata", TRUE),
write_data_page_version = getOption("nanoparquet.write_data_page_version", 1L),
write_minmax_values = getOption("nanoparquet.write_minmax_values", TRUE)
)
class |
The extra class or classes to add to data frames created
in |
compression_level |
The compression level in
|
keep_row_groups |
This option is used when appending to a Parquet
file with |
num_rows_per_row_group |
The number of rows to put into a row group, if row groups are not specified explicitly. It should be an integer scalar. Defaults to 10 million. |
use_arrow_metadata |
If this option is
|
write_arrow_metadata |
Whether to add the Apache Arrow types as
metadata to the file |
write_data_page_version |
Data version to write by default. Possible values are 1 and 2. Default is 1. |
write_minmax_values |
Whether to write minimum and maximum values
per row group, for data types that support this in |
List of nanoparquet options.
# the effect of using Arrow metadata
tmp <- tempfile(fileext = ".parquet")
d <- data.frame(
fct = as.factor("a"),
dft = as.difftime(10, units = "secs")
)
write_parquet(d, tmp)
read_parquet(tmp, options = parquet_options(use_arrow_metadata = TRUE))
read_parquet(tmp, options = parquet_options(use_arrow_metadata = FALSE))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.