drama: Drama Dataset

dramaR Documentation

Drama Dataset

Description

A dataset containing the HTIDs of the volumes classified as "drama" in Ted Underwood, Boris Capitanu, Peter Organisciak, Sayan Bhattacharyya, Loretta Auvil, Colleen Fallaw, J. Stephen Downie (2015). Word Frequencies in English-Language Literature, 1700-1922 (0.2) Dataset. HathiTrust Research Center. doi:10.13012/J8JW8BSJ. Taken from the summary netadata file at http://data.analytics.hathitrust.org/genre/drama_metadata.csv

Usage

drama

Format

An object of class tbl_df (inherits from tbl, data.frame) with 17709 rows and 3 columns.

Details

htid

The Hathi Trust ID of the volume.

drama_prob

A confidence metric: the probability that more than 80% of the pages in the volume assigned to "drama" have been correctly classified.

drama_prop

The proportion of pages in the volume classified as "drama". Calculated from genrepages/totalpages in the original metadata file.

Source

https://wiki.htrc.illinois.edu/display/COM/Word+Frequencies+in+English-Language+Literature,+1700-1922


xmarquez/hathiTools documentation built on June 2, 2025, 5:12 a.m.