View source: R/tidy_colo_dtm.R
tidy_colo_tdm | R Documentation |
DocumentTermMatrix
/TermDocumentMatrix
into Collocating Words in Tidy FormConverts non-zero elements of a
DocumentTermMatrix
/TermDocumentMatrix
into a tidy data set made of collocating words.
tidy_colo_tdm(x, ...)
tidy_colo_dtm(x, ...)
x |
A
|
... |
Ignored. |
Returns a tidied data.frame.
unique_pairs
data(simple_dtm)
tidied <- tidy_colo_dtm(simple_dtm)
tidied
unique_pairs(tidied)
## Not run:
if (!require("pacman")) install.packages("pacman")
pacman::p_load_current_gh('trinker/gofastr', 'trinker/lexicon')
pacman::p_load(tidyverse, magrittr, ggstance)
my_dtm <- with(
presidential_debates_2012,
q_dtm(dialogue, paste(time, tot, sep = "_"))
)
tidy_colo_dtm(my_dtm) %>%
tbl_df() %>%
filter(!term_1 %in% c('i', lexicon::sw_onix) &
!term_2 %in% lexicon::sw_onix
) %>%
filter(term_1 != term_2) %>%
unique_pairs() %>%
filter(n > 15) %>%
complete(term_1, term_2, fill = list(n = 0)) %>%
ggplot(aes(x = term_1, y = term_2, fill = n)) +
geom_tile() +
scale_fill_gradient(low= 'white', high = 'red') +
theme(axis.text.x = element_text(angle = 45, hjust = 1))
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.