Description Usage Arguments Value
There is a real risk of overflowing your computer's memory if you search for a frequent pattern in the files. If you specify an output directory, the results out kept out of memory. Linux/Mac users may want to use the command line tool 'grep' instead of this function, as it will be much faster. e.g. grep [PATTERN] corpus_directory/*csv > outputfile.csv
. Information on POS and dependency labels:http://universaldependencies.org/format.html.
1 2 | searchCorpus(pattern, corpus_directory, output_directory = NULL,
field = c("tagged", "text"))
|
pattern |
word or regular expression |
corpus_directory |
directory where you keep the corpus file |
output_directory |
directory where you want the output files to go. Keep to NULL (default) if you want to collect the results in an R object rather than writing them out to files |
field |
one of 'tagged' or 'text'. If you want to search the parsed data, opt for 'tagged', otherwise, opt for 'text' |
Either a data.frame or nothing
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.