datasets: Data sets provided with the MPIA package

Description Usage Details Value Author(s) References See Also Examples

Description

The package comes with three data sets. As with all corpus data, the corpora provided in the Domain objects have to be utilised with care: they are all general purpose and may not contain professional vocabulary required for a particular application. If words are not contained in the vocabulary, they will simply be disregarded in any analysis (garbage in, garbage out!).

The first one, generic.domain, contains a small standard English corpus, suitable for many try-out applications.

The second one, essays.domain, contains a small German domain corpus d. Complementary essay data is contained in the three supplementary vectors essays.scores for scores of the essays (assigned by human raters), essays.students holds randomly assigned firstnames (and in brackets the number of the collection they are from), whereas essays contains character vectors with the actual essay text. These serve demonstration purposes only, simply call demo("essay-scoring") to see how they are used.

Finally, docs, im, and space contain the objects resulting from demo("sna-lsa-mpia"), for easier referencing in tutorials.

All these data sets are also used in the package tests.

Usage

1
2
3
4
5
6
7

Details

Note that generic.domain contains an object d.

Value

Fetches the according data set from the package's data directory.

Author(s)

Fridolin Wild <wild@brookes.ac.uk>

References

Fridolin Wild (2016): Learning Analytics in R with SNA, LSA, and MPIA, Springer:Berlin.

See Also

dimcalc_raw()

Examples

1
dimcalc_var()(c(5,4,3,2,1))

mpia documentation built on May 2, 2019, 4:18 p.m.