R/boilerpipeR-package.R
In boilerpipeR: Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)

#' \pkg{boilerpipeR} interfaces the boilerpipe Java library, created by Christian
#' Kohlschutter \url{http://code.google.com/p/boilerpipe/}. It implements robust heuristics
#' to extract the main content from HTML files, removing unessecary
#' elements like ads, banners and headers/footers.
#' 
#' @name boilerpipeR-package
#' @aliases boilerpipe
#' @docType package
#' @title Extract the main content from HTML files
#' @author Mario Annau \email{mario.annau@@gmail}
#' @keywords package
#' @seealso \code{\link{Extractor}} \code{\link{DefaultExtractor}} \code{\link{ArticleExtractor}}
#' @examples
#' \dontrun{
#' data(content)
#' extract <- DefaultExtractor(content)
#' cat(extract)
#' }
NULL

#' Wordpress generated Webpage (retrieved from Quantivity Blog \url{http://quantivity.wordpress.com}).
#' Content is saved as character and ready to be extracted.
#' @name content
#' @docType data
#' @author Mario Annau
#' @references \url{http://quantivity.wordpress.com}
#' @keywords data
#' @examples
#' #Data set has been generated as follows:
#' \dontrun{
#' library(RCurl)
#' url <- "http://quantivity.wordpress.com/2012/11/09/multi-asset-market-regimes/"
#' content <- getURL(url)
#' content <- iconv(content, "UTF-8", "ASCII//TRANSLIT")
#' save(content, file = "content.rda")
#' }
NULL

Any scripts or data that you put into this service are public.

boilerpipeR documentation built on May 2, 2019, 5:47 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

boilerpipeR
Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)

R/boilerpipeR-package.R
In boilerpipeR: Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)

Try the boilerpipeR package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

boilerpipeR Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)

R/boilerpipeR-package.R In boilerpipeR: Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)

Try the boilerpipeR package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

boilerpipeR
Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)

R/boilerpipeR-package.R
In boilerpipeR: Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)