| FindUrlPipe | R Documentation |
This class is responsible of detecting the existing URLs in the
data field of each Instance. Identified URLs are
stored inside the URLs field of Instance class.
Moreover if required, is able to perform inline URLs removement.
The regular expressions indicated in the URLPatterns
variable are used to identify URLs.
FindUrlPipe will automatically invalidate the
Instance whenever the obtained data is empty.
This class inherits from GenericPipe and implements the
pipe abstract function.
bdpar::GenericPipe -> FindUrlPipe
URLPatternA character value. The regular
expression to detect URLs.
EmailPatternA character value. The regular expression to detect emails.
new()Creates a FindUrlPipe object.
FindUrlPipe$new(
propertyName = "URLs",
alwaysBeforeDeps = list(),
notAfterDeps = list("FindUrlPipe"),
removeUrls = TRUE,
URLPatterns = list(self$URLPattern, self$EmailPattern),
namesURLPatterns = list("UrlPattern", "EmailPattern")
)propertyNameA character value. Name of the property
associated with the GenericPipe.
alwaysBeforeDepsA list value. The dependencies
alwaysBefore (GenericPipes that must be executed before
this one).
notAfterDepsA list value. The dependencies
notAfter (GenericPipes that cannot be executed after
this one).
removeUrlsA logical value. Indicates if the
URLs are removed.
URLPatternsA list value. The regex to find URLs.
namesURLPatternsA list value. The names of regex.
propertyLanguageNameA character value. Name of the
language property.
pipe()Preprocesses the Instance to obtain/remove
the URLs. The URLs found in the data are added to the
list of properties of the Instance.
FindUrlPipe$pipe(instance)
instanceA Instance value. The Instance
to preprocess.
The Instance with the modifications that have
occurred in the pipe.
findUrl()Finds the URLs in the data.
FindUrlPipe$findUrl(pattern, data)
patternA character value. The regex to find URLs.
dataA character value. The text to find the URLs.
The list with URLs found.
removeUrl()Removes the URL in the data.
FindUrlPipe$removeUrl(pattern, data)
patternA character value. The regex to find URLs.
dataA character value. The text to remove the URLs.
The data with URLs removed.
putNamesURLPattern()Sets the names to URL patterns result.
FindUrlPipe$putNamesURLPattern(resultOfURLPatterns)
resultOfURLPatternsA list value. The list with
URLs found.
The URLs found with the names of URL pattern.
getURLPatterns()Gets the URL patterns.
FindUrlPipe$getURLPatterns()
Value of URL patterns.
setURLPatterns()Sets the URL patterns.
FindUrlPipe$setURLPatterns(URLPatterns)
URLPatternsA list value. The new value of
the URL patterns.
getNamesURLPatterns()Gets the names of URLs.
FindUrlPipe$getNamesURLPatterns()
Value of names of URLs.
setNamesURLPatterns()Sets the names of URLs.
FindUrlPipe$setNamesURLPatterns(namesURLPatterns)
namesURLPatternsA list value. The new value of
the names of URLs.
clone()The objects of this class are cloneable with this method.
FindUrlPipe$clone(deep = FALSE)
deepWhether to make a deep clone.
AbbreviationPipe, ContractionPipe,
File2Pipe, FindEmojiPipe,
FindEmoticonPipe, FindHashtagPipe,
FindUserNamePipe, GuessDatePipe,
GuessLanguagePipe, Instance,
InterjectionPipe, MeasureLengthPipe,
GenericPipe, SlangPipe,
StopWordPipe, StoreFileExtPipe,
TargetAssigningPipe, TeeCSVPipe,
ToLowerCasePipe
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.