load_raw_hathifile | R Documentation |
This function loads a previously downloaded hathifile into memory (or
downloads the latest one if it can't find it). It also turns the column
us_gov_doc_flag
into a logical value (TRUE
or FALSE
) and eliminates
9999
values for rights_date_used
(sets them to NA
).
load_raw_hathifile(
filename = NULL,
dir = getOption("hathiTools.hathifile.dir"),
hathifile_date = getOption("hathiTools.hathifile"),
cols,
fix_date = TRUE
)
filename |
The name of the downloaded hathifile. If |
dir |
The directory where the raw hathifile is to be found. |
hathifile_date |
The date of the hathifile. (A new one is released every
month). This defaults to |
cols |
If wanted, a set of columns to load. Since the file is so large,
one can reduce memory use by selecting only certain columns. These can be
any of the following: |
fix_date |
Fixes |
A very large tibble, with over 17 million records, loaded into memory. The tibble package does some lazy loading to minimize resource use, but fully loaded this data frame takes over 5GB in memory.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.