readNexis: Read preprocessed files from Lexis Nexis

View source: R/readNexis.R

readNexisR Documentation

Read preprocessed files from Lexis Nexis

Description

Reads the XML-files from preprocessed Lexis Nexis files and seperates the text and meta data.

Usage

readNexis(path = getwd(), file = list.files(path = path, pattern =
  "*.xml$", full.names = FALSE, recursive = TRUE), do.meta = TRUE,
  do.text = TRUE, encoding = "utf-8")

Arguments

path

Character string with Path where the data files are.

file

Character string with names of the XML files.

do.meta

Logical: Should the algorithm collect meta data?

do.text

Logical: Should the algorithm collect text data?

encoding

encoding of the input files.

Value

List of two

meta

id url date title page resource author leadtext downloadDate

text

text

Author(s)

Sakander Zirai (<s.zirai@live.de>), Lars Koppers (<koppers@statistik.tu-dortmund.de>)


Docma-TU/tmT documentation built on May 5, 2022, 12:45 a.m.