extract_text: Extract text from PDFs and HTMLs pages.
In neuropsychology/neuropsychology.R: Toolbox for Psychologists, Neuropsychologists and Neuroscientists

Description Usage Arguments Value Author(s) Examples

Scrap text from PDFs.

extract_text(sources=".",
              type="pdf",
              word.length.min=4,
              word.length.max=Inf,
              freq.min=10,
              freq.max=Inf)

`sources`	Either the name of a file (ending with ".pdf"), a directory, nothing to scrap all the PDFs of the current directory, a html link or a list of links.
`type`	"pdf" or "html".
`word.length.min`	Keep only words with minimum length x.
`word.length.max`	Keep only words with maximum length x.
`freq.min`	Keep only words that appear more than x times.
`freq.max`	Keep only words that appear less than x times.

data

A dataframe containing two columns for words and their frequency.

Dominique Makowski

1
2
3

require(neuropsychology)

# text <- extract_text() # In a folder containg some PDFs.

neuropsychology/neuropsychology.R documentation built on May 23, 2019, 4:27 p.m.

neuropsychology/neuropsychology.R index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

neuropsychology/neuropsychology.R
Toolbox for Psychologists, Neuropsychologists and Neuroscientists

extract_text: Extract text from PDFs and HTMLs pages.
In neuropsychology/neuropsychology.R: Toolbox for Psychologists, Neuropsychologists and Neuroscientists

Description

Usage

Arguments

Value

Author(s)

Examples

Related to extract_text in neuropsychology/neuropsychology.R...

R Package Documentation

Browse R Packages

We want your feedback!

neuropsychology/neuropsychology.R Toolbox for Psychologists, Neuropsychologists and Neuroscientists

extract_text: Extract text from PDFs and HTMLs pages. In neuropsychology/neuropsychology.R: Toolbox for Psychologists, Neuropsychologists and Neuroscientists

Description

Usage

Arguments

Value

Author(s)

Examples

Related to extract_text in neuropsychology/neuropsychology.R...

R Package Documentation

Browse R Packages

We want your feedback!

neuropsychology/neuropsychology.R
Toolbox for Psychologists, Neuropsychologists and Neuroscientists

extract_text: Extract text from PDFs and HTMLs pages.
In neuropsychology/neuropsychology.R: Toolbox for Psychologists, Neuropsychologists and Neuroscientists