ft_collect: Collect article text from local files

View source: R/collect.R

ft_collectR Documentation

Collect article text from local files

Description

ft_collect grabs full text data from file paths in your ft_data object (result of call to ft_get()). ft_text is a convenience function to grab the nested text data and bring it up in the list for easier access

Usage

ft_collect(x, ...)

ft_text(x, ...)

## Default S3 method:
ft_text(x, ...)

## S3 method for class 'ft_data'
ft_text(x, ...)

Arguments

x

Input. An object of class ft_data

...

Further args, ignored.

Details

The result of this call is actual text you can read

Value

an object of class ft_data, but the data slot should have character string of text from the XML/plain text/PDF file

Examples

## Not run: 
# Get some data
x <- ft_get('10.1371/journal.pone.0086169')

# note that the data is not in the object, gives NULL
x$plos$data$data

# Collect data from the .xml file
y <- x %>% ft_collect()

# note how the data is now in the object
y$plos$data$data

# Let's get the actual 
## ft_collect() alone, replaces file pointers with parsed text, 
##  maintaining object structure
x %>% ft_collect() 
## pulls the text out of the object
x %>% ft_collect() %>% ft_text()

## End(Not run)

ropensci/fulltext documentation built on Sept. 12, 2022, 7:57 a.m.