View source: R/extract_corpus.R
extract_corpus | R Documentation |
Extract text from one to many pdf documents into a tm Corpus.
extract_corpus(paths, which, ...)
paths |
Path to a file |
which |
One of gs, or xpdf. |
... |
further args passed on |
A tm Corpus or VCorpus
paths <- c("~/github/sac/scott/pdfs/BarraquandEtal2014peerj.pdf", "~/github/sac/scott/pdfs/Chamberlain&Holland2009Ecology.pdf", "~/github/sac/scott/pdfs/Revell&Chamberlain2014MEE.pdf") res <- extract_corpus(paths, "gs") res tm::TermDocumentMatrix(res$data) res <- extract_corpus(path, "xpdf") res
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.