Tidied version of the book "The Art of Data Science" by Roger D. Peng and Elizabeth Matsui published in 2015 - 2017. This book is for sale at leanpub. It also has its own web page.
1 |
A tibble representing the book in tidy text format with single word as token. Stop words are removed.
Data has the following columns:
id <int> : Index of word inside the book.
book <chr> : Name of the book.
page <int> : Page number.
line <int> : Line number on page (empty lines are ignored).
word <chr> : Word.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.