Man pages for xmarquez/hathiTools
Access the Hathi Trust Bookworm and Extracted Features Files from R

add_imputed_dateAdd imputed date
browse_htidsBrowse a set of Hathi Trust IDs interactively at the Hathi...
cache_htidsCaches downloaded JSON Extracted Features files to another...
clear_cacheRemoves cached files for a set of Hathi Trust ids
download_hathifileDownloads the Hathi Trust big hathifile
dramaDrama Dataset
fictionFiction Dataset
find_cached_htidsFinds cached Extracted Features files for a set of HT ids
get_hathi_countsReads the downloaded extracted features file for a given...
get_hathi_metaReads the volume-level metadata of a single downloaded Hathi...
get_hathi_page_metaReads the page-level metadata of a single Hathi Trust...
get_workset_metaGet metadata for a set of Hathi Trust IDs
htid_to_rsyncConverts a list of htids to relative paths for rsync to...
iso639ISO639 language codes
load_raw_hathifileLoads the raw hathifile into memory
pipePipe operator
poetryPoetry Dataset
query_bookwormQueries the Hathi Trust Bookworm Server at...
read_cached_htidsRead Cached HTIDs
rsync_from_hathirsync Hathi Trust EFs from Hathi Trust
workset_builderBuilds a Workset of Hathi Trust vol IDs by querying the...
xmarquez/hathiTools documentation built on June 2, 2025, 5:12 a.m.