Description Usage Arguments Details Value Author(s) See Also Examples
Get the content of Hillary Rodham Clinton's emails by release.
1 | get_emails(release, save.dir = getwd(), extractor, ...)
|
release |
Name of the batch of release of emails; see details. |
save.dir |
Directory where to save the extracted text defaults to
|
extractor |
Full path to pdf extractor |
... |
additional parameters to pass to |
Below are the valid values for release; follows the
WSJ naming
convention.
Benghazi
June
July
August
September
October
November
January 7
January 29
February 19
february 29
December
Non-disclosure
The extractor argument is the full path to your pdftotext.exe
extractor; visit xpdf to
download or try get_xpdf which attempts to download and
unzip the text to pdf extractor. See examples.
Fetches email zip file from the WSJ and extract text files in
save.dir, returns full path to directory that contains parsed txt
files.
John Coene jcoenep@gmail.com
get_xpdf, download_emails, extract_emails
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 | ## Not run:
# get xpdf extractor
ext <- get_xpdf()
# create
dir.create("emails")
# get emails released in august
emails_aug <- get_emails(release = "August", save.dir = "./emails",
extractor = ext)
# use manually downloaded extractor
# ext <- "C:/xpdfbin-win-3.04/bin64/pdftotext.exe"
# get emails related to Benghazi released in December
emails_bengh <- get_emails(release = "Benghazi", extractor = ext,
save.dir = "./emails")
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.