spark_read_warc_sample: Loads the sample warc file in Spark

Description Usage Arguments

View source: R/sample.R

Description

Loads the sample warc file in Spark

Usage

1
spark_read_warc_sample(sc, filter = "", include = "")

Arguments

sc

An active spark_connection.

filter

A regular expression used to filter to each warc entry efficiently by running native code using Rcpp.

include

A regular expression used to keep only matching lines efficiently by running native code using Rcpp.


javierluraschi/sparkwarc documentation built on Jan. 17, 2022, 5:51 a.m.