Description Usage Arguments Format Details Inherit Methods Private fields See Also
This class allows you to preprocess the data of an instance to find the stop words that are in it. Optionally, you can decide whether to remove the data stop words or not.
1 2 3 4 | StopWordPipe$new(propertyName = "stopWord",
propertyLanguageName = "language",
alwaysBeforeDeps = list("GuessLanguagePipe"),
notAfterDeps = list("AbbreviationPipe"))
|
propertyName |
(character) Name of the property associated with the pipe. |
propertyLanguageName |
(character) Name of the language property. |
alwaysBeforeDeps |
(list) The dependences alwaysBefore (pipes that must be executed before this one). |
notAfterDeps |
(list) The dependences notAfter (pipes that cannot be executed after this one). |
An object of class R6ClassGenerator
of length 24.
This class needs files in json format that will contain the stop words to be located. For this it is necessary that the instance contains a property that indicates the language of the data to be able to correctly choose the list of stop words that apply to the data. The format of the file names of the resources has to be: xxx.json (Being xxx the value of the language property of the instance).
To indicate the path where the associated resources are located, the configuration file is used. It is necessary to indicate in the section called resourcesPath, the path of resourcesStopWordsPath.
The pipe will invalidate the instance in the moment that the resulting data is empty.
This class inherits from PipeGeneric
and implements the
pipe
abstract function.
pipe Function that preprocesses the instance to obtain/remove the stop words. The stop words found in the pipe are added to the list of properties of the Instance. If the removeStopWords parameter is TRUE, the instance data will be removed.
Usage
pipe(instance, removeStopWords = TRUE)
Value
The instance with the modifications that have occurred in the pipe.
Arguments
instance (Instance) Instance to preproccess.
removeStopWords (logical) Indicates if the stop words are removed or not.
findStopWord Function that checks if the stop word is in the data.
Usage
findStopWord(data, stopWord)
Value
TRUE or FALSE depending on whether the stop word is on the data.
Arguments
data (character) Text in which the stop word is searched.
stopWord (character) Indicates the stop word to find.
removeStopWord Function that removes the stop word in the data.
Usage
removeStopWord(stopWord, data)
Value
The data with stop word removed.
Arguments
stopWord (character) Indicates the stop word to remove.
data (character) Text in which stop words will be removed.
getPropertyLanguageName Getter of name of property language.
Usage
getPropertyLanguageName()
Value
Value of name of property language.
getPathResourcesStopWords Getter of path of stop words resources.
Usage
getPathResourcesStopWords()
Value
Value of path of stop words resources.
propertyLanguageName (character) The name of property about language.
pathResourcesStopWords (character) The path where are the resources.
PipeGeneric
, Instance
,
ResourceHandler
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.