readWORDPRESS: Read the WORDPRESS Corpus as CSV

View source: R/readWORDPRESS.R

readWORDPRESSR Documentation

Read the WORDPRESS Corpus as CSV

Description

Reads the CSV-files from a WORDPRESS BLOG corpus and seperates the text and meta data.

Usage

readWORDPRESS(path = getwd(), file = list.files(path = path, pattern =
  "*.csv$", full.names = FALSE, recursive = TRUE), do.meta = TRUE,
  do.text = TRUE)

Arguments

path

Character string with Path where the data files are.

file

Character string with names of the CVS files.

do.meta

Logical: Should the algorithm collect meta data?

do.text

Logical: Should the algorithm collect text data?

Value

meta

id date title year number page_start page_stop pagetitle shorttitle rubrik ressort dokumentmerkmal dachzeile abstract

text

Text (Paragraphenweise)

metamult

signature person koerperschaft company inkl. Kategorie(n)

Examples

##---- Should be DIRECTLY executable !! ----

Docma-TU/tmT documentation built on May 5, 2022, 12:45 a.m.