sample_docs: Sample Docs

Description Usage Arguments Value Examples

View source: R/sample_docs.R

Description

Output a weighted random sample of documents. Documents weighted by their length.

Usage

1
sample_docs(ocr_dir = NULL, output_dir = NULL, nsamp = NULL)

Arguments

ocr_dir

path to folder containing OCRd documents; required

output_dir

path to folder containing sampled OCRd documents; required

nsamp

number of sampled files; required

Value

a vector of names of the sampled files. It also creates a directory with sampled files

Examples

1
2
3
4
## Not run: 
sample_docs(ocr_dir="path_to_ocr_dir", output_dir = "path_to_output_dir_with_sample_docs")

## End(Not run)

soodoku/recognize documentation built on May 30, 2019, 6:30 a.m.