Description Usage Arguments Value See Also Examples
View source: R/tidy_colo_dtm.R
Converts non-zero elements of a
DocumentTermMatrix
/TermDocumentMatrix
into a tidy data set made of collocating words.
1 2 3 | tidy_colo_tdm(x, ...)
tidy_colo_dtm(x, ...)
|
x |
A
|
... |
Ignored. |
Returns a tidied data.frame.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 | data(simple_dtm)
tidied <- tidy_colo_dtm(simple_dtm)
tidied
unique_pairs(tidied)
## Not run:
if (!require("pacman")) install.packages("pacman")
pacman::p_load_current_gh('trinker/gofastr', 'trinker/lexicon')
pacman::p_load(tidyverse, magrittr, ggstance)
my_dtm <- with(
presidential_debates_2012,
q_dtm(dialogue, paste(time, tot, sep = "_"))
)
tidy_colo_dtm(my_dtm) %>%
tbl_df() %>%
filter(!term_1 %in% c('i', lexicon::sw_onix) &
!term_2 %in% lexicon::sw_onix
) %>%
filter(term_1 != term_2) %>%
unique_pairs() %>%
filter(n > 15) %>%
complete(term_1, term_2, fill = list(n = 0)) %>%
ggplot(aes(x = term_1, y = term_2, fill = n)) +
geom_tile() +
scale_fill_gradient(low= 'white', high = 'red') +
theme(axis.text.x = element_text(angle = 45, hjust = 1))
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.