linguistic.relatedness: Hartigan (1975) Relatedness Values of Selected Words
In cluster.datasets: Cluster Analysis Data Sets

Description Usage Format Details Source References Examples

Frequencies with which a pair is judged more highly related than other pairs, over many triads and subjects. This is Table 10.4 in Chapter 10 of Hartigan (1975) on page 184.

1	data(linguistic.relatedness)

A data frame with 6 observations on the following 7 variables.

word: a character vector for the
the: a numeric vector for the frequency with which words are related to 'the'
boy: a numeric vector for the frequency with which words are related to 'boy'
has: a numeric vector for the frequency with which words are related to 'has'
lost: a numeric vector for the frequency with which words are related to 'lost'
a: a numeric vector for the frequency with which words are related to 'a'
dollar: a numeric vector for the frequency with which words are related to 'dollar'

This is an unusual data set to be used with the triads-leader algorithm.

Levelt, W. J. M (1967). Psychological representations of syntactic structures, in The Structure and Psychology of Language, T. G. Bever and W. Weksel, eds, Holt, Rinehart and Winston, New York.

SPAETH2 Cluster Analysis Datasets http://people.sc.fsu.edu/~jburkardt/datasets/spaeth2/spaeth2.html

Hartigan, J. A. (1975). Clustering Algorithms, John Wiley, New York.