readTagesschau: Read Tagesschau files

View source: R/readTagesschau.R

readTagesschauR Documentation

Read Tagesschau files

Description

Reads XML-files from Tagesschau and separates the text and meta data.

Usage

readTagesschau(path = getwd(), file = list.files(path = path, pattern =
  "*.xml$", full.names = FALSE, recursive = TRUE), encoding = "utf-8",
  type = c("condensed", "detail"))

Arguments

path

Character: string with path where the data files are.

file

Character: string with names of the XML files.

encoding

encoding of the input files.

type

Character: string whether to get one text with paragraphs per show (condensed) or one text per comment/message/report (detail).

Value

textmeta object

Author(s)

Jonas Rieger (<jonas.rieger@tu-dortmund.de>)


Docma-TU/tmT documentation built on May 5, 2022, 12:45 a.m.