word_diff_list: Differences In Word Use Between Groups

Description Usage Arguments Value Examples

Description

Look at the differences in word uses between grouping variable(s). Look at all possible "a" vs. "b" combinations or "a" vs. all others.

Usage

1
2
3
  word_diff_list(text.var, grouping.var, vs.all = FALSE,
    vs.all.cut = 1, stopwords = NULL, alphabetical = FALSE,
    digits = 2)

Arguments

text.var

The text variable.

grouping.var

The grouping variables. Takes a single grouping variable or a list of 1 or more grouping variables.

vs.all

logical. If TRUE looks at each grouping variable against all others ("a" vs. all comparison). If FALSE looks at each "a" vs. "b", comparison (e.g. for groups "a", "b", and "c"; "a" vs. "b", "a" vs. "c" and "b" vs. "c" will be considered).

vs.all.cut

If vs.all.cut = TRUE this argument controls the number of other groups that may share a word (default is 1).

stopwords

A vector of stop words to remove.

alphabetical

logical. If TRUE orders the word lists alphabetized by word. If FALSE order first by frequency and then by word.

digits

the number of digits to be displayed in the proportion column (default is 3).

Value

An list of word data frames comparing grouping variables word use against one another. Each dataframe contains three columns:

word

The words unique to that group

freq

The number of times that group used that word

prop

The proportion of that group's overall word use dedicated to that particular word

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
out1 <- with(DATA, word_diff_list(text.var = state,
    grouping.var = list(sex, adult)))
lapply(unlist(out1, recursive = FALSE), head, n=3)

out2 <- with(DATA, word_diff_list(state, person))
lapply(unlist(out2, recursive = FALSE), head, n=3)

out3 <- with(DATA, word_diff_list(state, grouping.var = list(sex, adult),
    vs.all=TRUE, vs.all.cut=2))


out4 <- with(mraja1, word_diff_list(text.var = dialogue,
    grouping.var = list(mraja1$sex, mraja1$fam.aff)))


out5 <- word_diff_list(mraja1$dialogue, mraja1$person)

out6 <- word_diff_list(mraja1$dialogue, mraja1$fam.aff, stopwords = Top25Words)

out7 <- word_diff_list(mraja1$dialogue, mraja1$fam.aff, vs.all=TRUE, vs.all.cut=2)
lapply(out7, head, n=3)

trinker/qdap2 documentation built on May 31, 2019, 9:47 p.m.