Home

/

GitHub

/

Docma-TU/tmT

/

readSPIEGEL: Read the SPIEGEL Corpus

readSPIEGEL: Read the SPIEGEL Corpus
In Docma-TU/tmT: Textmining Tools

View source: R/readSPIEGEL.R

readSPIEGEL

R Documentation

Read the SPIEGEL Corpus

Description

Reads the XML-files from the SPIEGEL corpus and seperates the text and meta data.

Usage

readSPIEGEL(path = getwd(), file = list.files(path = path, pattern =
  "*.xml$", full.names = FALSE, recursive = TRUE), do.meta = TRUE,
  do.text = TRUE)

Arguments

`path`	Character string with Path where the data files are.
`file`	Character string with names of the XML files.
`do.meta`	Logical: Should the algorithm collect meta data?
`do.text`	Logical: Should the algorithm collect text data?

Value

`meta`	id date title year number page_start page_stop pagetitle shorttitle rubrik ressort dokumentmerkmal dachzeile abstract
`text`	Text (Paragraphenweise)
`metamult`	signature person koerperschaft company inkl. Kategorie(n)

Examples

##---- Should be DIRECTLY executable !! ----

Docma-TU/tmT documentation built on May 5, 2022, 12:45 a.m.

Docma-TU/tmT index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Docma-TU/tmT
Textmining Tools

readSPIEGEL: Read the SPIEGEL Corpus
In Docma-TU/tmT: Textmining Tools

Read the SPIEGEL Corpus

Description

Usage

Arguments

Value

Examples

Related to readSPIEGEL in Docma-TU/tmT...

R Package Documentation

Browse R Packages

We want your feedback!

Docma-TU/tmT Textmining Tools

readSPIEGEL: Read the SPIEGEL Corpus In Docma-TU/tmT: Textmining Tools

Read the SPIEGEL Corpus

Description

Usage

Arguments

Value

Examples

Related to readSPIEGEL in Docma-TU/tmT...

R Package Documentation

Browse R Packages

We want your feedback!

Docma-TU/tmT
Textmining Tools

readSPIEGEL: Read the SPIEGEL Corpus
In Docma-TU/tmT: Textmining Tools