ExtractorWarc: Class to handle warc files

Description Usage Arguments Format Details Inherit Methods See Also

Description

It is a class that inherits from the Instance class and implements the functions of extracting the text and the date of an warc-type file.

Usage

1

Arguments

path

(character) Path of the warc-type file.

Format

An object of class R6ClassGenerator of length 24.

Details

The read_warc function of the jwart package was overwritten because it returned the hours wrong.

The jwart package makes calls to Java so it is necessary to have rJava installed.

Inherit

This class inherits from Instance and implements the obtainSource and obtainDate abstracts functions.

Methods

See Also

Instance


miferreiro/bdp4R documentation built on May 20, 2019, 12:59 a.m.