RemoveStopwords: Remove Persian stop-words.

Description Usage Arguments Value Author(s) Examples

Description

Defines a list of Persian stopwords and removes them from a string.

Usage

1

Arguments

texts

A string from which Persian stopwords should be removed.

Value

RemoveStopwords returns a string with Persian stopwords removed.

Author(s)

Safshekan, Nielsen

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
# Create Persian string with stopwords
x <- '\u0627\u0632 
\u062F\u0627\u0646\u0634\u06AF\u0627\u0647 
\u0622\u0645\u062F'

# Remove new line characters and fixe half-spaces from a string.
x <- RemNewlineHalfspace(x)

# Remove all characters that are not Latin, Persian or punctuation, 
# and standardize Persian characters.
x <- RefineChars(x)

# Remove stopwords
RemoveStopwords(x)

Example output

[1] "دانشگاه آمد"

PersianStemmer documentation built on June 28, 2019, 5:03 p.m.