Description Usage Arguments Details Value Examples
Plot a cloud comparing the frequencies of words across documents.
1 2 |
term.matrix |
A term frequency matrix whose rows represent words and whose columns represent documents. |
scale |
A vector of length 2 indicating the range of the size of the words. |
max.words |
Maximum number of words to be plotted. least frequent terms dropped |
random.order |
plot words in random order. If false, they will be plotted in decreasing frequency |
rot.per |
proportion words with 90 degree rotation |
colors |
color words from least to most frequent |
use.r.layout |
if false, then c++ code is used for collision detection, otherwise R is used |
title.size |
Size of document titles |
... |
Additional parameters to be passed to text (and strheight,strwidth). |
Let p_{i,j} be the rate at which word i occurs in document j, and p_j be the average across documents(∑_ip_{i,j}/ndocs). The size of each word is mapped to its maximum deviation ( max_i(p_{i,j}-p_j) ), and its angular position is determined by the document where that maximum occurs.
nothing
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 | if(require(tm)){
data(SOTU)
corp <- SOTU
corp <- tm_map(corp, removePunctuation)
corp <- tm_map(corp, removePunctuation)
corp <- tm_map(corp, tolower)
corp <- tm_map(corp, removeNumbers)
corp <- tm_map(corp, function(x)removeWords(x,stopwords()))
term.matrix <- TermDocumentMatrix(corp)
term.matrix <- as.matrix(term.matrix)
colnames(term.matrix) <- c("SOTU 2010","SOTU 2011")
comparison.cloud(term.matrix,max.words=40,random.order=FALSE)
commonality.cloud(term.matrix,max.words=40,random.order=FALSE)
}
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.