check_parquet_file_compression: Check parquet file compression

View source: R/export.R

check_parquet_file_compressionR Documentation

Check parquet file compression

Description

This function checks the amount of compression attained when exporting a dataset using the Apache parquet file format, by the arrow package. It compares parquet file sizes for a dataset, exported uncompressed and compressed using different available compression types, with the equivalent dataset's file size exported as a csv file.

Usage

check_parquet_file_compression(
  .data,
  compression_type = c("snappy", "gzip", "brotli", "zstd", "lz4", "lzo")
)

Arguments

.data

dataset to be checked for amount of compression using parquet format.

compression_type

parquet compression types to check, defaults to all compression types ("snappy", "gzip", "brotli", "zstd", "lz4", "lzo") unless one or more specific compression types are supplied as a vector.

Value

message giving each file size and compression percentage

Examples

data(penguins, package = "palmerpenguins")
check_parquet_file_compression(penguins)

gcfrench/store documentation built on May 17, 2024, 5:52 p.m.