processXMLstring: Function to process XML data from a specific corpus into a...

Description Usage Arguments

Description

This function takes a .xml documents from a corpus of forum posts and returns a vector of strings. Can perhaps be used for other forum corpora which have a similar structure

Usage

1
processXMLstring(pathToFolder, minMaxxWordCount = 300)

Arguments

pathToFolder,

the path to the folder containing the corpus

minMaxWordCount,

no documents with less tokens than indicated will be accepted and all documents longer than the spefified count will be cropped Defaults to 300


mouse0/suicideProject documentation built on May 3, 2019, 5:19 p.m.