linguistic.relatedness: Hartigan (1975) Relatedness Values of Selected Words

Description Usage Format Details Source References Examples

Description

Frequencies with which a pair is judged more highly related than other pairs, over many triads and subjects. This is Table 10.4 in Chapter 10 of Hartigan (1975) on page 184.

Usage

1

Format

A data frame with 6 observations on the following 7 variables.

word

a character vector for the

the

a numeric vector for the frequency with which words are related to 'the'

boy

a numeric vector for the frequency with which words are related to 'boy'

has

a numeric vector for the frequency with which words are related to 'has'

lost

a numeric vector for the frequency with which words are related to 'lost'

a

a numeric vector for the frequency with which words are related to 'a'

dollar

a numeric vector for the frequency with which words are related to 'dollar'

Details

This is an unusual data set to be used with the triads-leader algorithm.

Source

Levelt, W. J. M (1967). Psychological representations of syntactic structures, in The Structure and Psychology of Language, T. G. Bever and W. Weksel, eds, Holt, Rinehart and Winston, New York.

SPAETH2 Cluster Analysis Datasets http://people.sc.fsu.edu/~jburkardt/datasets/spaeth2/spaeth2.html

References

Hartigan, J. A. (1975). Clustering Algorithms, John Wiley, New York.

Examples

1

cluster.datasets documentation built on May 2, 2019, 3:39 p.m.