tokenize_folder: Tokenize_folder

Description Usage Arguments Value

View source: R/text_core.R

Description

Tokenize text files in 'path' in parallel using 'n_workers'

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
tokenize_folder(
  path,
  extensions = NULL,
  folders = NULL,
  output_dir = NULL,
  skip_if_exists = TRUE,
  output_names = NULL,
  n_workers = 6,
  rules = NULL,
  tok = NULL,
  encoding = "utf8"
)

Arguments

path

path

extensions

extensions

folders

folders

output_dir

output_dir

skip_if_exists

skip_if_exists

output_names

output_names

n_workers

number of workers

rules

rules

tok

tokenizer

encoding

encoding

Value

None


fastai documentation built on July 28, 2021, 5:06 p.m.