Description Usage Arguments Format Details Value Methods Author(s) See Also
Split
Splits a corpus into a training, test and optional validation set.
1 |
corpus |
Corpus object. |
name |
Character string indicating the name for the cross-validation set. |
train |
Numeric indicating the proportion of the Corpus to allocate to the training set. Acceptable values are between 0 and 1. The total of the values for the train, validation and test parameters must equal 1. |
validation |
Numeric indicating the proportion of the Corpus to allocate to the validation set. Acceptable values are between 0 and 1. The total of the values for the train, validation and test parameters must equal 1. |
test |
Numeric indicating the proportion of the Corpus to allocate to the test set. Acceptable values are between 0 and 1. The total of the values for the train, validation and test parameters must equal 1. |
stratify |
Logical. If TRUE (default), splits and sampling will be stratefied. |
seed |
Numeric used to initialize a pseudorandom number generator. |
An object of class R6ClassGenerator
of length 24.
Splits a corpus into a training, test and optional validation set. These corpora are combined into a single cross-validation set or CVSet object.
CVSet object
new()
Initializes an object of the Split class.
execute(x, train = 0.75, validation = 0, test = 0.25,
stratify = TRUE, seed = NULL)
Executes the corpus splits.
John James, jjames@datasciencesalon.org
Other CorpusStudio Family of Classes: CorpusStudio
,
KFold
, Sample0
,
Sample
, Segment
,
TokenizerNLP
, TokenizerQ
,
Tokenizer
, Token
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.