readNexisOnline: Read preprocessed files from Nexis Online

View source: R/readNexisOnline.R

readNexisOnlineR Documentation

Read preprocessed files from Nexis Online

Description

Reads the HTML-files from preprocessed Lexis Nexis files and seperates the text and meta data.

Usage

readNexisOnline(path = getwd(), file = list.files(path = path, pattern
  = "*.HTML$", full.names = FALSE, recursive = TRUE), do.meta = TRUE,
  do.text = TRUE)

Arguments

path

Path where the data files are.

file

Character string with names of the HTML files.

do.meta

Logical: Should the algorithm collect meta data?

do.text

Logical: Should the algorithm collect text data?

Value

List of two

meta

id topic nr from title source date releaseNote downloadDate loadDate language length dateline byline section type pubType series graphic copyright

text

text


Docma-TU/tmT documentation built on May 5, 2022, 12:45 a.m.