zeta.craig: Compare two subcorpora using Craig's Zeta
In computationalstylistics/stylo: Stylometric Multivariate Analyses

zeta.craig

R Documentation

Compare two subcorpora using Craig's Zeta

Description

This is a function for comparing two sets of texts; unlike keywords analysis, it this method the goal is to split input texts into equal-sized slices, and to check the appearance of particular words over the slices. Number of slices in which a given word appeared in the subcorpus A and B is then compared using Craig's formula, which is based on original Zeta as developed by Burrows (Craig and Kinney 2009, Burrows 2007).

Usage

zeta.craig(input.data, filter.threshold)

Arguments

`input.data`	a matrix of two columns.
`filter.threshold`	this parameter (default 0.1) gets rid of words of weak discrimination strength; the higher the number, the less words appear in the final wordlists. It does not normally exceed 0.5. In original Craig's Zeta, no threshold is used: instead, the results contain the fixed number of 500 top avoided and 500 top preferred words.

Value

The function returns a list of two elements: the first contains words (or other units, like n-grams) statistically preferred by the authors of the primary subcorpus, while the second element contains avoided words. Since the applied measure is symmetrical, the preferred words are ipso facto avoided by the secondary authors, and vice versa.

Author(s)

Maciej Eder

References

Burrows, J. F. (2007). All the way through: testing for authorship in different frequency strata. "Literary and Linguistic Computing", 22(1): 27-48.

Craig, H. and Kinney, A. F., eds. (2009). Shakespeare, Computers, and the Mystery of Authorship. Cambridge: Cambridge University Press.

Examples

## Not run: 
zeta.craig(input.data, filter.threshold)

## End(Not run)

computationalstylistics/stylo documentation built on Jan. 4, 2025, 1:56 p.m.

computationalstylistics/stylo index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

computationalstylistics/stylo
Stylometric Multivariate Analyses

zeta.craig: Compare two subcorpora using Craig's Zeta
In computationalstylistics/stylo: Stylometric Multivariate Analyses

Compare two subcorpora using Craig's Zeta

Description

Usage

Arguments

Value

Author(s)

References

See Also

Examples

Related to zeta.craig in computationalstylistics/stylo...

R Package Documentation

Browse R Packages

We want your feedback!

computationalstylistics/stylo Stylometric Multivariate Analyses

zeta.craig: Compare two subcorpora using Craig's Zeta In computationalstylistics/stylo: Stylometric Multivariate Analyses

Compare two subcorpora using Craig's Zeta

Description

Usage

Arguments

Value

Author(s)

References

See Also

Examples

Related to zeta.craig in computationalstylistics/stylo...

R Package Documentation

Browse R Packages

We want your feedback!

computationalstylistics/stylo
Stylometric Multivariate Analyses

zeta.craig: Compare two subcorpora using Craig's Zeta
In computationalstylistics/stylo: Stylometric Multivariate Analyses