readNexisOnline: Read preprocessed files from Nexis Online
In Docma-TU/tmT: Textmining Tools

View source: R/readNexisOnline.R

readNexisOnline

R Documentation

Read preprocessed files from Nexis Online

Description

Reads the HTML-files from preprocessed Lexis Nexis files and seperates the text and meta data.

Usage

readNexisOnline(path = getwd(), file = list.files(path = path, pattern
  = "*.HTML$", full.names = FALSE, recursive = TRUE), do.meta = TRUE,
  do.text = TRUE)

Arguments

`path`	Path where the data files are.
`file`	Character string with names of the HTML files.
`do.meta`	Logical: Should the algorithm collect meta data?
`do.text`	Logical: Should the algorithm collect text data?

Value

List of two

`meta`	id topic nr from title source date releaseNote downloadDate loadDate language length dateline byline section type pubType series graphic copyright
`text`	text