Convert 'epub' Files to Text
Convert 'epub' Files to Text
The 'epub' file format is really just a structured 'ZIP' archive with metadata, graphics and (usually) 'HTML' text. Tools are provided to turn an 'epub' file into a tidy data frame.
The following functions are implemented:
epub_to_text
: Convert an epub file into a data frame of plaintext chaptersThere are edge cases I've totally not covered yet. Feel free to jump in and make this a real, useful package!
hgr
on CRAN so it's not a GH depdevtools::install_github("hrbrmstr/pubcrawl")
options(width=120)
library(pubcrawl) library(tidyverse) # current verison packageVersion("pubcrawl")
epub_to_text("~/Data/R Packages.epub")
epub_to_text(system.file("extdat", "augustine.epub", package="pubcrawl")) %>% mutate(path = abbreviate(path))
Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.