text_to_words: Split string(s) of text 'x' into words.

Description Usage Arguments Details Value See Also Examples

View source: R/text_fun.R

Description

text_to_words splits a string of text x (consisting of one or more character strings) into a vector of its constituting words.

Usage

1

Arguments

x

A string of text (required), typically a character vector.

Details

text_to_words removes all (standard) punctuation marks and empty spaces in the resulting text parts, before returning a vector of the remaining character symbols (as its words).

Internally, text_to_words uses strsplit to split strings at punctuation marks (split = "[[:punct:]]") and blank spaces (split = "( ){1,}").

Value

A character vector (of words).

See Also

text_to_sentences for splitting text into a vector of sentences; text_to_chars for splitting text into a vector of characters; count_words for counting the frequency of words; strsplit for splitting strings.

Other text objects and functions: Umlaut, capitalize(), caseflip(), cclass, count_chars_words(), count_chars(), count_words(), l33t_rul35, map_text_chars(), map_text_coord(), map_text_regex(), metachar, read_ascii(), text_to_chars(), text_to_sentences(), transl33t()

Examples

1
2
3
# Default: 
x <- c("Hello!", "This is a 1st sentence.", "This is the 2nd sentence.", "The end.")
text_to_words(x)

ds4psy documentation built on May 12, 2021, 9:07 a.m.