split.categories: Function split.categories changes dataframe from the one with...

Description Usage Arguments Value Examples

Description

Such shape of data is required by naiveBayes function and generally recognized. Additionaly some categories defined by used can be dropped from the data frame by passing a list invalid.cats.hash

Usage

1
2
3
## S3 method for class 'categories'
split(df, min.n = 500, invalid.cats.hash = emptyenv(),
  min.w = 2)

Arguments

df

data.frame input data frame with values and categories

min.n

int minimalna liczba wystapien danej kategorii

invalid.cats.hash

hashtable nazwy kategori ktore powinny zostac odrzucone

min.w

int minimalna liczba wystapien danego slowa

Value

splitted_vectors list list that contains $data - output data $fact - output factor corresponding to the data

Examples

1
split.categories(mydataframe, 100, create.hash(c("c.OtherApplicationsNEC", "c.OtherSciencesNEC")), 2)

vovkaOst/TextClassifier documentation built on May 3, 2019, 6:41 p.m.