sparkwarc: Load WARC Files into Apache Spark
Version 0.1.1

Load WARC (Web ARChive) files into Apache Spark using 'sparklyr'. This allows to read files from the Common Crawl project .

Getting started

Package details

AuthorJavier Luraschi [aut, cre]
Date of publication2017-01-13 06:42:24
MaintainerJavier Luraschi <[email protected]>
LicenseApache License 2.0
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the sparkwarc package in your browser

Any scripts or data that you put into this service are public.

sparkwarc documentation built on May 30, 2017, 6:16 a.m.