hack_cinsarc: Hack the CINSARC classification

View source: R/hack_cinsarc.R

hack_cinsarcR Documentation

Hack the CINSARC classification

Description

Given a gene expression matrix and a 0-1 vector indicating the distant metastasis status of samples, hack_cinsarc() classifies samples into one of two risk classes, C1 or C2, using the CINSARC signature as implemented in Chibon et al., 2010.

Usage

hack_cinsarc(expr_data, dm_status)

Arguments

expr_data

A normalized gene expression matrix (or data frame) with gene symbols as row names and samples as columns.

dm_status

A numeric vector specifying whether a sample has either (1) or not (0) developed distant metastasis.

Details

CINSARC (Complexity INdex in SARComas) is a prognostic 67-gene signature related to mitosis and control of chromosome integrity. It was developed to improve metastatic outcome prediction in soft tissue sarcomas over the FNCLCC (Fédération Francaise des Centres de Lutte Contre le Cancer) grading system.

Value

A tibble with one row for each sample in expr_data and two columns: sample_id and cinsarc_class.

Algorithm

The CINSARC method implemented in hacksig makes use of leave-one-out cross validation (LOOCV) to classify samples into C1/C2 risk groups (see Lesluyes & Chibon, 2020). First, gene expression values are centered by their mean across samples. Then, for each iteration of the LOOCV, mean normalized gene values are computed by metastasis group (i.e. compute the metastatic centroids). Then, one minus the Spearman's correlation between centered samples and metastatic centroids are computed. Finally, if a sample is more correlated to the non-metastatic centroid, then it is assigned to the C1 class (low risk). Conversely, if a sample is more correlated to the metastatic centroid, then it is assigned to the C2 class (high risk).

Source

codeocean.com/capsule/4933686/tree/v4

References

Chibon, F., Lagarde, P., Salas, S., Pérot, G., Brouste, V., Tirode, F., Lucchesi, C., de Reynies, A., Kauffmann, A., Bui, B., Terrier, P., Bonvalot, S., Le Cesne, A., Vince-Ranchère, D., Blay, J. Y., Collin, F., Guillou, L., Leroux, A., Coindre, J. M., & Aurias, A. (2010). Validated prediction of clinical outcome in sarcomas and multiple types of cancer on the basis of a gene expression signature related to genome complexity. Nature medicine, 16(7), 781–787. doi: 10.1038/nm.2174.

Lesluyes, T., & Chibon, F. (2020). A Global and Integrated Analysis of CINSARC-Associated Genetic Defects. Cancer research, 80(23), 5282–5290. doi: 10.1158/0008-5472.CAN-20-0512.

Examples

# generate random distant metastasis outcome
set.seed(123)
test_dm_status <- sample(c(0, 1), size = ncol(test_expr), replace = TRUE)

hack_cinsarc(test_expr, test_dm_status)

hacksig documentation built on March 18, 2022, 6:44 p.m.