generate.term.dataset: Generate frequencies of terms for each paper

View source: R/generate_term_dataset.R

generate.term.datasetR Documentation

Generate frequencies of terms for each paper

Description

Generate frequencies of terms for each paper

Usage

generate.term.dataset(cleaned_text, in_dir, keywords)

Arguments

cleaned_text

output of 'clean.text' - a list of cleaned text files

in_dir

directory with input text files

keywords

A set of keywords as characters (i.e. traits of interest) in a vector

Value

A list of frequencies of keywords for each paper

Examples

download.file("https://github.com/ajhelmstetter/papieRmache/raw/master/inst/extdata/test_pdfs.zip", destfile = "./test_pdfs.zip")
unzip("./test_pdfs.zip")
ct<-clean.text(in_dir = "./test_pdfs/",all_keywords=kw)
generate.term.dataset(cleaned_text = ct, in_dir = "./testpdfs/",keywords = c("bisse","musse"))


ajhelmstetter/papieRmache documentation built on March 30, 2024, 9:22 p.m.