Selected fields of metadata about each of the Project Gutenberg
works. These were collected using the gitenberg Python package,
A tbl_df (see tibble or dplyr) with one row for each work in Project Gutenberg and the following columns:
Numeric ID, used to retrieve works from Project Gutenberg
Author, if a single one given. Given as last name first (e.g. "Doyle, Arthur Conan")
Project Gutenberg author ID
Language ISO 639 code, separated by / if multiple. Two letter code if one exists, otherwise three letter. See https://en.wikipedia.org/wiki/List_of_ISO_639-2_codes
Which collection or collections this is found in, separated by / if multiple
Generally one of three options: "Public domain in the USA." (the most common by far), "Copyrighted. Read the copyright notice inside this book for details.", or "None"
Whether there is a file containing digits followed by
.txt in Project Gutenberg for this record (as opposed to, for
example, audiobooks). If not, cannot be retrieved with
To find the date on which this metadata was last updated,
gutenberg_works, gutenberg_authors, gutenberg_subjects
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
library(dplyr) library(stringr) gutenberg_metadata gutenberg_metadata %>% count(author, sort = TRUE) # look for Shakespeare, excluding collections (containing "Works") and translations shakespeare_metadata <- gutenberg_metadata %>% filter(author == "Shakespeare, William", language == "en", !str_detect(title, "Works"), has_text, !str_detect(rights, "Copyright")) %>% distinct(title) ## Not run: shakespeare_works <- gutenberg_download(shakespeare_metadata$gutenberg_id) ## End(Not run) # note that the gutenberg_works() function filters for English # non-copyrighted works and does de-duplication by default: shakespeare_metadata2 <- gutenberg_works(author == "Shakespeare, William", !str_detect(title, "Works")) # date last updated attr(gutenberg_metadata, "date_updated")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.