getFromFolderWF: Function which takes a folder as an argument and returns a...

Description Usage Arguments

Description

This function takes a path to a folder as an argument. It then excludes all documents in the folder which have less than 300 tokens and process the rest. The output is a data structure which can be iterated through length-wise. Odd indexes in the data structure return lists which hold the unique tokens of an individual document, even indexes hold the frequencies

Usage

1
getFromFolderWF(pathToFolder, minMaxWordCount = 300)

Arguments

pathToFolder,

the folder containing your corpus

minMaxWordCount,

no documents with less tokens than indicated will be accepted and all documents longer than the spefified count will be cropped Defaults to 300


mouse0/suicideProject documentation built on May 3, 2019, 5:19 p.m.