plot.LexCA: Plot of LexCA objects
In Xplortext: Statistical Analysis of Textual Data

plot.LexCA

R Documentation

Plot of LexCA objects

Description

Plots textual correspondence analysis (CA) graphs from a LexCA object.

Usage

## S3 method for class 'LexCA'
plot(x, selDoc="ALL", selWord="ALL", selSeg=NULL, selDocSup=NULL,
  selWordSup=NULL, quanti.sup=NULL, quali.sup=NULL, maxDocs=20, eigen=FALSE, 
  title=NULL, axes=c(1,2), col.doc="blue", col.word="red", col.doc.sup="darkblue", 
  col.word.sup="darkred", col.quanti.sup = "blue", col.quali.sup="darkgreen", 
  col.seg="cyan4", col="grey", cex=1, xlim=NULL, ylim=NULL, shadowtext=FALSE,
  habillage="none", unselect=1, label="ALL", autoLab=c("auto", "yes", "no"), 
  new.plot=TRUE, graph.type = c("classic", "ggplot"),...)

Arguments

`x`	object of LexCA class
`selDoc`	vector with the active documents to plot (indexes, names or rules; see details; by default "ALL")
`selWord`	vector with the active words to plot (indexes, names or rules; see details; by default "ALL")
`selSeg`	vector with the supplementary repeated segments to plot (indexes, names or rules; see details; by default NULL)
`selDocSup`	vector with the supplementary documents to plot (indexes, names or rules; see details; by default NULL)
`selWordSup`	vector of the supplementary words to plot (indexes, names or rules; see details; by default NULL)
`quanti.sup`	vector of the supplementary quantitative variables to plot (indexes, names or rules; see details; by default NULL)
`quali.sup`	vector with the supplementary categorical variables/categories to plot (indexes, names or rules; see details; by default NULL). The selected categories (through the variables or directly) are plotted
`maxDocs`	limit to the number of active documents in the lexical table when selecting the words to be plotted for being characteristic of the selected documents (by default 20)
`eigen`	if TRUE, the eigenvalues barplot is drawn (by default FALSE); no other elements can be simultaneously selected
`title`	title of the graph (by default NULL and the title is automatically assigned)
`axes`	length-2 vector indicating the axes considered in the graph (by default c(1,2))
`col.doc`	color for the point-documents (by default "blue")
`col.word`	color for the point-words (by default "red")
`col.doc.sup`	color for the supplementary point-documents (by default "darkblue")
`col.word.sup`	color for the supplementary point-words (by default "darkred")
`col.quanti.sup`	color for the quanti.sup variables (by default "blue")
`col.quali.sup`	color for the categorical supplementary point-categories, (by default "darkgreen")
`col.seg`	color for the supplementary point-repeated segments, (by default "cyan4")
`col`	color for the bars in the eigenvalues barplot (by default "grey")
`cex`	text and symbol size is scaled by cex, in relation to size 1 (by default 1)
`xlim`	range for 'x' values on the graph, defaulting to the finite values of 'x' range (by default NULL)
`ylim`	range for the 'y' values on the graph, defaulting to the the finite values of 'y' range (by default NULL)
`shadowtext`	if TRUE, shadow on the labels (rectangles are written under the labels which may lead to difficulties to modify the graph with another program) (by default FALSE)
`habillage`	index or name of the categorical variable used to differentiate the documents by colors given according to the category; by default "none")
`unselect`	either a value between 0 and 1 or a color. In the first case, transparency level of the unselected objects (if unselect=1 the transparency is total and the elements are not represented; if unselect=0 the elements are represented as usual but without any label); in the case of a color (e.g. unselect="grey60"), the non-selected points are given this color (by default 1)
`label`	a list of character for the variables which are labelled (by default ALL and all the drawn variables are labelled). You can label all the active variables by putting "var" and/or all the supplementary variables by putting "quanti.sup" and/or a list with the names of the variables which should be labelled. Value should be one of "all", "none", "row", "row.sup", "col", "col.sup", "quali.sup" or NULL.
`autoLab`	if autoLab="auto", autoLab turns to be equal to "yes" if there are less than 50 elements and equal to "no" otherwise; if "yes", the labels are moved, as little as possible, to avoid overlapping (time-consuming if many elements); if "no" the labels are placed quickly but may overlap
`new.plot`	if TRUE, a new graphical device is created (by default FALSE)
`graph.type`	a string that gives the type of graph used: "ggplot" or "classic" (by default classic)
`...`	further arguments passed from other methods...

Details

The argument autoLab = "yes" is time-consuming if many overlapping labels. Furthermore, the visualization of the words cloud can result distorted because of the apparent greater dispersion of the words labels. An alternative would be reducing the character size of the words labels to reduce overlapping (e.g. cex=0.7).

selDoc, selWord, selSeg, selDocSup, selWordSup, quanti.sup and quali.sup allow for selecting all or part of the elements of the corresponding type, using either labels, indexes or rules.

The syntax is the same for all types.

1. Using labels:

selDoc = c("doc1","doc5"): only the documents with labels doc1 and doc5 are plotted.
quali.sup=c("varcateg1","category12"): only the categories (all of them) of 
   categorical variable labeled "varcateg1" and the category labeled "category12"
   are plotted.

2.- Using indexes:

selDoc = c(1:5): documents 1 to 5 are plotted.
quali.sup=c(1:5,7): categories 1 to 5 and 7 are plotted. The numbering of the
   categories have to be consulted in the LexCA numerical results.

3.- Using rules: Rules are based on the coordinates (coord), the contribution (contrib or meta; concerning only active elements) or the square cosine (cos2).
Somes examples are given hereafter:

selDoc="coord 10": only the 10 documents with the highest coordinates, as globally
   computed on the 2 axes, are plotted.
selWord="contrib 10": the words with a contribution to the inertia, of any of 
   the 2 axes.
selWord="meta 3": the words with a contribution over 3 times the average word 
   contribution on any of the two axes are plotted. Only active words or documents 
   can be selected.
selDocSup="cos2 .85": the supplementary documents with a cos2 over 0.85, as summed
   on the 2 axes, are plotted.
selWord="char 0.05": only the characteristic words of the documents selected in 
   SelDoc are plotted. The selection of the words follow the rationale used in 
   function LexChar using as limit for the p-value the value given, here.0.05.

Author(s)

Ramón Alvarez-Esteban ramon.alvarez@unileon.es, Mónica Bécue-Bertaut, Josep-Antón Sánchez-Espigares

References

Husson F., Lê S., Pagés J. (2011). Exploratory Multivariate Analysis by Example Using R. Chapman & Hall/CRC. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1201/b10345")}.

Examples

data(open.question)
res.TD<-TextData(open.question,var.text=c(9,10), var.agg="Age_Group", Fmin=10, Dmin=10,
        remov.number=TRUE, stop.word.tm=TRUE)
res.CA <- LexCA(res.TD, graph=FALSE)
plot(res.CA, selDoc="contrib 30", selWord="coord 20")

Xplortext documentation built on April 12, 2025, 2:13 a.m.