View source: R/text_analysis.R
get_candidates_fromarchivesearchresults | R Documentation |
This is somewhat challenging because sometimes the url has 'download' in sometimes it doesn't This function includes a hack to find either case. By default it excludes articles from publications in Ireland and Scotland, and documents already classified as 1 (already downloaded), 3 (verbatim repeat), 4 (Ireland), 5 (Scotland), 6 (Abroad)
get_candidates_fromarchivesearchresults( archivesearchresults, include_ocr = FALSE, restrict_EW = TRUE, restrict_classified = TRUE )
archivesearchresults |
The archive search results (including a url column) |
restrict_EW |
remove results published in Republic of Ireland and Scotland |
restrict_classified |
remove results already classified as 1, 3, 4, 5 or 6 |
include |
the ocr in the download (will slow down query) |
Candidate documents with urls matching the urls in archivesearchresults
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.