rtika: R Interface to 'Apache Tika'

Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.

Package details

AuthorSasha Goodman [aut, cre], The Apache Software Foundation [aut, cph], Julia Silge [rev] (Reviewed the package for rOpenSci, see https://github.com/ropensci/onboarding/issues/191), David Gohel [rev] (Reviewed the package for rOpenSci, see https://github.com/ropensci/onboarding/issues/191)
MaintainerSasha Goodman <[email protected]>
LicenseApache License 2.0 | file LICENSE
URL http://github.com/ropensci/rtika
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the rtika package in your browser

Any scripts or data that you put into this service are public.

rtika documentation built on Nov. 15, 2018, 9:04 a.m.