tools/cpp/src/arrow/dataset/README.md

Arrow C++ Dataset

The arrow::dataset subcomponent provides an API to read and write semantic datasets stored in different locations and formats. It facilitates parallel processing of datasets spread across different physical files and serialization formats. Other concerns such as partitioning, filtering (partition- and column-level), and schema normalization are also addressed.

Development Status

Alpha/beta stage as of April 2020. API subject to change, possibly without deprecation notices.



Try the arrow package in your browser

Any scripts or data that you put into this service are public.

arrow documentation built on Nov. 25, 2023, 1:09 a.m.