readTagesschau: Read Tagesschau files
In Docma-TU/tmT: Textmining Tools

View source: R/readTagesschau.R

readTagesschau

R Documentation

Read Tagesschau files

Description

Reads XML-files from Tagesschau and separates the text and meta data.

Usage

readTagesschau(path = getwd(), file = list.files(path = path, pattern =
  "*.xml$", full.names = FALSE, recursive = TRUE), encoding = "utf-8",
  type = c("condensed", "detail"))

Arguments

`path`	Character: string with path where the data files are.
`file`	Character: string with names of the XML files.
`encoding`	encoding of the input files.
`type`	Character: string whether to get one text with paragraphs per show (condensed) or one text per comment/message/report (detail).