Man pages for javierluraschi/sparkwarc
Load WARC Files into Apache Spark

cc_warcProvides WARC paths for commoncrawl.org
rcpp_read_warc_sampleLoads the sample warc file in Rcpp
spark_rcpp_read_warcReads a WARC File into using Rcpp
spark_read_warcReads a WARC File into Apache Spark
spark_read_warc_sampleLoads the sample warc file in Spark
sparkwarcsparkwarc
spark_warc_sample_pathRetrieves sample warc path
javierluraschi/sparkwarc documentation built on Jan. 17, 2022, 5:51 a.m.