View source: R/extract_corpus.R
| extract_corpus | R Documentation |
Extract text from one to many pdf documents into a tm Corpus.
extract_corpus(paths, which, ...)
paths |
Path to a file |
which |
One of gs, or xpdf. |
... |
further args passed on |
A tm Corpus or VCorpus
paths <- c("~/github/sac/scott/pdfs/BarraquandEtal2014peerj.pdf",
"~/github/sac/scott/pdfs/Chamberlain&Holland2009Ecology.pdf",
"~/github/sac/scott/pdfs/Revell&Chamberlain2014MEE.pdf")
res <- extract_corpus(paths, "gs")
res
tm::TermDocumentMatrix(res$data)
res <- extract_corpus(path, "xpdf")
res
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.