Description Usage Arguments Details Value
Drive the word counter and return a sparse matrix containing word counts. See details for possible extra arguments.
1 |
files |
a list of file and folder names |
... |
extra arguments to control the process. See Details. |
Optional arguments are
locale
: Locale in ISO format, e.g. "en_US".
Defaults to whatever get_locale
returns
encoding
: the file encoding of the files
. Defaults to the
whatever get_encoding
returns
output
: the folder that the results will land in before being served
back as a data.frame. Defaults to a temporary folder
progress
: Whether to show the progress of the java code as it goes
through the files
no.currency
: Whether to remove currency. Defaults to FALSE
no.numbers
: Whether to remove numbers. Defaults to FALSE
stopwords
: File of stop words not to count. By default
every word is counted
stemmer
: A language specific stemmer. By default nothing is stemmed.
Available stemmer languages are danish
, dutch
, english
,
finnish
, french
, german
, hungarian
, italian
,
norwegian
, portuguese
, romanian
, russian
,
spanish
, swedish
, turkish
. These are snowball project stemmers
so I can't vouch for them.
This function also dumps the location of the temporary folder where the results landed to standard error in case you want the original csv data file and yoshikoder dictionary translation.
a sparse Matrix containing the word counts
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.