Description Usage Arguments Details Value See Also Examples
In the zipfR library, spc objects are used to represent
a word frequency spectrum (either an observed spectrum or the expected
spectrum of a LNRE model at a given sample size).
With the spc constructor function, an object can be initialized
directly from the specified data vectors. It is more common to read
an observed spectrum from a disk file with read.spc or
compute an expected spectrum with lnre.spc, though.
spc objects should always be treated as read-only.
1 2 |
m |
integer vector of frequency classes m (if omitted,
|
Vm |
vector of corresponding class sizes V_m (may be fractional for expected frequency spectrum E[V_m]) |
VVm |
optional vector of estimated variances Var[V_m] (for expected frequency spectrum only) |
N, V |
total sample size N and vocabulary size V of
frequency spectrum. While these values are usually determined
automatically from |
VV |
variance Var[V] of expected
vocabulary size. If |
m.max |
highest frequency class m listed in incomplete
spectrum. If |
expected |
set to |
A spc object is a data frame with the following variables:
mfrequency class m, an integer vector
Vmclass size, i.e. number V_m of types in frequency class m (either observed class size from a sample or expected class size E[V_m] based on a LNRE model)
VVmoptional: estimated variance V[V_m] of expected class size (only meaningful for expected spectrum derived from LNRE model)
The following attributes are used to store additional information about the frequency spectrum:
m.maxif non-zero, the frequency spectrum is
incomplete and lists only frequency classes up to m.max
N, Vsample size N and vocabulary size V
of the frequency spectrum. For a complete frequency spectrum,
these values could easily be determined from m and
Vm, but they are essential for an incomplete spectrum.
VVvariance of expected vocabulary size; only present
if hasVariances is TRUE. Note that VV may
have the value NA is the user failed to specify it.
expectedif TRUE, frequency spectrum lists
expected class sizes E[V_m] (rather than observed
sizes V_m). Note that the VVm variable is only
allowed for an expected frequency spectrum.
hasVariancesindicates whether or not the VVm
variable is present
An object of class spc representing the specified frequency
spectrum. This object should be treated as read-only (although such
behaviour cannot be enforced in R).
read.spc, write.spc,
spc.vector, sample.spc,
spc2tfl, tfl2spc,
lnre.spc, plot.spc
Generic methods supported by spc objects are
print, summary, N,
V, Vm, VV, and
VVm.
Implementation details and non-standard arguments for these methods
can be found on the manpages print.spc,
summary.spc, N.spc, V.spc,
etc.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 | ## load Brown imaginative prose spectrum and inspect it
data(BrownImag.spc)
summary(BrownImag.spc)
print(BrownImag.spc)
plot(BrownImag.spc)
N(BrownImag.spc)
V(BrownImag.spc)
Vm(BrownImag.spc,1)
Vm(BrownImag.spc,1:5)
## compute ZM model, and generate PARTIAL expected spectrum
## with variances for a sample of 10 million tokens
zm <- lnre("zm",BrownImag.spc)
zm.spc <- lnre.spc(zm,1e+7,variances=TRUE)
## inspect extrapolated spectrum
summary(zm.spc)
print(zm.spc)
plot(zm.spc,log="x")
N(zm.spc)
V(zm.spc)
VV(zm.spc)
Vm(zm.spc,1)
VVm(zm.spc,1)
## generate an artificial Zipfian-looking spectrum
## and take a look at it
zipf.spc <- spc(round(1000/(1:1000)^2))
summary(zipf.spc)
plot(zipf.spc)
## see manpages of lnre, and the various *.spc mapages
## for more examples of spc usage
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.