Generalized Principal Component of Symbolic Interval Variables.


This package implements an extension of principal component analysis (PCA) tailored to handle multiple data tables. These multiple data tables contain the same number of Interval variables and the same observations. This package can handle Big Data in the sense that the variation in massive data can be described by intervals [a, b] and multiple tables. If only one data table is specified, in this case this package performs a PCA of interval data.


Package: GPCSIV
Type: Package
Version: 1.0
Date: 2013-06-06
License: GPL (>= 2)

Each dataset can be in csv, excel, access, txt,...,file. The only constraint is that for each variable, the maximum column must follow the minimum column. The Resdata class implemented returns two list of data frames (list of minimums and maximums). These lists of data frames are the inputs of the gpca function.


Brahim Brahim and Sun Makosso-Kallyth Maintainer : Brahim Brahim <>


Billard, L. and E. Diday (2006). Symbolic Data Analysis: conceptual statistics and data Mining. Berlin: Wiley series in computational statistics.

Diday, E. and M. Noirhomme-Fraiture (2008). Symbolic Data Analysis and the SODAS Software. Chichester: Wiley Interscience.

Makosso-Kallyth, S (2013). Analysis of m sets of symbolic interval variables. Revue des Nouvelles Technologies de l'Information, vol. RNTI-E25. pp. 97-108.

comments powered by Disqus