austen_books: Tidy data frame of Jane Austen's 6 completed, published...

Description Usage Details Value Examples

Description

Returns a tidy data frame of Jane Austen's 6 completed, published novels with two columns: text, which contains the text of the novels divided into elements of up to about 70 characters each, and book, which contains the titles of the novels as a factor in order of publication.

Usage

1

Details

Users should be aware that there are some differences in usage between the novels as made available by Project Gutenberg. For example, "anything" vs. "any thing", "Mr" vs. "Mr.", and using underscores vs. all caps to indicate italics/emphasis.

Value

A data frame with two columns: text and book

Examples

1
2
3
4
library(dplyr)

austen_books() %>% group_by(book) %>%
     summarise(total_lines = n())


Search within the janeaustenr package
Search all R packages, documentation and source code

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.