Description Usage Arguments Details Value Author(s) References See Also Examples
This function aggregates vectorial data in the form of a matrix for the iterative application of Variability-based Neighbor Clustering (VNC, Gries and Hilpert 2008, 2012) and is part of the diahclust
function.
1 | aggregate_data(x, y)
|
x |
an object of class |
y |
a data matrix where each column represents a vector containing the changing syntactic features extracted from a text. |
aggregate_data
is called inside the diahclust
function and is in charge of the aggregation of the data for the iterative application of the vnc
function. Data points which belong to a single cluster are aggregated by averaging the corresponding syntactic vectors in the underlying dataset with aggregate_data
. To keep track of the texts and time stages which form clusters across the iterations, the names of the new vectors consist of the sequence of the names of the aggregated vectors.
A data matrix containing the aggregated vectors of change.
Christin Schätzle
Stefan Th. Gries and Martin Hilpert. 2008. The identification of stages in diachronic data: variability-based neighbour clustering. Corpora, 3(1):59–81. Stefan Th. Gries and Martin Hilpert. 2012. Variability-based neighbor clustering: A bottom-up approach to periodization in historical linguistics. In Nevalainen Terttu and Elizabeth Closs Traugott, editors, The Oxford Handbook of the History of English, pages 134–144. Oxford University Press, Oxford. Christin Schätzle and Hannah Booth. 2019. DiaHClust: an iterative hierarchical clustering apprach for identifying stages in language change. to appear.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 | icelandic=data(icelandic)
icelandic.cor=cor(icelandic[,-1]) #[,-1] because rows are labeled
icelandic.dist=dist(icelandic.cor)
icelandic.vnc=vnc(icelandic.dist, method="average")
plot(icelandic.vnc, hang=-1) #plotting the resulting dendrogram
#cluster validation for identification of time stages
optimal=optimal_clust(icelandic.vnc, icelandic.dist)
#aggregation of data for DiaHClust
icelandic_aggregated=aggregate_data(optimal, icelandic[,-1])
#function is automatically called inside diahclust
icelandic.diahclust=diahclust(optimal, icelandic[,-1], method="average")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.