generate_tpch: Generate tpch data

View source: R/ensure-tpch-source.R

generate_tpchR Documentation

Generate tpch data

Description

Generate tpch data at a given scale factor. By default, data is output relative to the current working directory. However, you can set the environment variable ARROWBENCH_DATA_DIR to point to another directory. Setting this environment variable has the advantage of being a central location for general usage. Running this function will install a custom version of duckdb in an r_libs directory, relative to the directory specified by the environment variable ARROWBENCH_LOCAL_DIR. When running this function for the first time you will see significant output from that installation process. This is normal.

Usage

generate_tpch(scale_factor = 1)

Arguments

scale_factor

a relative measure of the size of data in gigabytes.


ursa-labs/arrowbench documentation built on July 8, 2023, 11:36 a.m.