bootstrap_dfm: Bootstrap a dfm

Description Usage Arguments Details Value Author(s) Examples

Description

Create an array of resampled dfms.

Usage

1
bootstrap_dfm(x, n = 10, ..., verbose = quanteda_options("verbose"))

Arguments

x

a character or corpus object

n

number of resamples

...

additional arguments passed to dfm

verbose

if TRUE print status messages

Details

Function produces multiple, resampled dfm objects, based on resampling sentences (with replacement) from each document, recombining these into new "documents" and computing a dfm for each. Resampling of sentences is done strictly within document, so that every resampled document will contain at least some of its original tokens.

Value

A named list of dfm objects, where the first, dfm_0, is the dfm from the original texts, and subsequent elements are the sentence-resampled dfms.

Author(s)

Kenneth Benoit

Examples

1
2
3
4
5
6
# bootstrapping from the original text
set.seed(10)
txt <- c(textone = "This is a sentence.  Another sentence.  Yet another.", 
         texttwo = "Premiere phrase.  Deuxieme phrase.")
bootstrap_dfm(txt, n = 3, verbose = TRUE)
         

quanteda/quanteda documentation built on June 15, 2019, 8:36 a.m.