List of parameters defining the SSP2003 classifier for identification of breast cancer molecular subtypes (Sorlie et al 2003).
1 2 3
List of parameters for SSP2003:
Gene expression centroids for each subtype.
Mapping for centroids.
Method of correlation used to compute distance to the centroids.
Method used to compute the centroids.
Method of standardization for gene expressions.
Minimum number of samples within each cluster allowed during the fitting of the model.
Three versions of the model are provided, each of ones differs by the gene expressions standardization method since it has an important impact on the subtype classification:
Use of the official centroids without scaling of the gene expressions.
Use of the official centroids with traditional scaling of the gene expressions (see
Use of the official centroids with robust scaling of the gene expressions (see
ssp2003.robust has been shown to reach the best concordance with the traditional clinical parameters (ER IHC, HER2 IHC/FISH and histological grade). However the use of this model is recommended only when the dataset is representative of a global population of breast cancer patients (no sampling bias, the 5 subtypes should be present).
T. Sorlie and R. Tibshirani and J. Parker and T. Hastie and J. S. Marron and A. Nobel and S. Deng and H. Johnsen and R. Pesich and S. Geister and J. Demeter and C. Perou and P. E. Lonning and P. O. Brown and A. L. Borresen-Dale and D. Botstein (2003) "Repeated Observation of Breast Tumor Subtypes in Independent Gene Expression Data Sets", Proceedings of the National Academy of Sciences, 1(14):8418–8423
1 2 3 4