Description Usage Arguments Examples
Quanteda analysis of text vector
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 | qe_texts(
texts = NULL,
method = "word",
remove_words = c("\\.", "\\-", "\\#", "\\'", "\\,", "\\;", "\\_",
"\\DESCRIPTION:", "\\:", "\\SBIR", "\\I ", "\\II ", "\\III ", "PHASE"),
dfm_dictionary = NULL,
n_top_features = 10,
stem = F,
exclude_features = F,
remove_numbers = T,
remove_punct = T,
remove_symbols = T,
remove_separators = TRUE,
remove_twitter = T,
remove_hyphens = T,
collocation_size = 3,
include_textstat = T,
remove_url = FALSE,
stop_sources = c("smart", "snowball", "stopwords-iso"),
n_gram_tokens = 2,
include_dfm = F,
verbose = T
)
|
texts |
vector of text |
method |
what the unit for splitting the text, available alternatives are:
|
dfm_dictionary |
if not |
n_top_features |
if not |
stem |
if |
exclude_features |
if |
remove_numbers |
logical; if |
remove_separators |
logical; if |
remove_twitter |
logical; if |
remove_hyphens |
logical; if |
collocation_size |
integer collocation size for texstat parameter |
include_textstat |
if |
remove_url |
logical; if |
stop_sources |
stop word source
|
n_gram_tokens |
|
include_dfm |
if |
verbose |
if |
1 | qe_texts(texts = "HIGH SURFACE AREA NON-OXIDE CERAMIC ELECTRODES FOR ULTRACAPACITORS", n_gram_tokens = 1:4)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.