create_dfm: Create a document-feature matrix

Description Usage Arguments Value

View source: R/deduplication_functions.R

Description

Given a character vector of document information and a language, creates a document-feature matrix.

Usage

1
create_dfm(elements, language)

Arguments

elements

a character vector of document information (e.g. document titles or abstracts)

language

the language to use for tokenizing documents

Value

a matrix with documents as rows and terms as columns


elizagrames/synthesisr documentation built on May 26, 2019, 10:34 a.m.