MACS-data: Real Dataset: MACS Cohort Study

Description Usage Format Acknowledgments Author(s) Source References See Also Examples


Publicly available dataset from the Multicenter AIDS Cohort Study (MACS) available at ( The dataset provides longitudinal account of viral tropism in relation to the HIV full spectrum of rates of HIV-1 disease progression (Shepherd, et al. 2008). To our knowledge, this cohort provides a unique dataset with well characterized clinical information for analyzing associations between host genetic variation and viral tropism as well as disease progression. Here, we determined whether copy number variation in beta-defensin and its interactions with certain polymorphisms in chemokine receptors and ligand genes are associated, either alone or jointly, with clinical events in HIV-seropositive patients, such as time to HIV change of tropism or time to AIDS diagnosis (See Dazard et al. (2017) for additional descriptions of the dataset and materials).




The dataset consists of a numeric data.frame containing n=50 complete observations (samples) by rows and p=7 covariates by columns, not including the censoring indicator and (censored) time-to-event variables.

The variables included in the MACS cohort study were 5 genetic variants (DEFB4/103A CNV [1-5], CCR2 SNP [190G>A], CCR5 [SNP -2459G>A, ORF], CXCL12 SNP [801G>A]) and 2 non-genetic variables, taken as two additional covariates. All input variables were categorical with no more than three levels (experimental groups) each. We used genetic variables with original and aggregated categories as follows: DEFB CNV [CNV = 2 or CNV > 2]; CCR2 SNP [GG or GA], CCR5 SNP [GG or GA]; CCR5 ORF [WT or D32], CXCL12 SNP [GG or GA]. The first covariate was the two-level disease progression Group variable [Fast, Slow], and the second was the three-level Race/Ethnicity variable [White, Hispanic, Black]. For each observation i \in \{1,...,n\}, we denote the j-th variable by the n-dimensional vector {\bf x}_{j} = (x_{1,j},...,x_{n,j})^{T}, where j \in \{1,...,p\}. Here, p denotes the number of variables. Hereafter, we denoted the p=7 included variables as follows:

The time-to-event outcomes included in the MACS cohort study, generically denoted E, were the time-to-X4-Emergence (denoted XE) and the time-to-AIDS-Diagnosis (denoted AD), whether each was observed or not during each patient's follow-up time. The corresponding event-free (EF) ("survival") probability function S(t) of time-to-event E := XE (X4-Emergence) or E := AD (AIDS-Diagnosis), were called X4-Emergence-Free (E := XEF) or AIDS-Diagnosis-Free (E := ADF) probability.

The dataset comes as a compressed Rda data file.


This work made use of the High Performance Computing Resource in the Core Facility for Advanced Research Computing at Case Western Reserve University. We are thankful to Ms. Janet Schollenberger, Senior Project Coordinator, CAMACS, as well as Dr. Jeremy J. Martinson, Sudhir Penugonda, Shehnaz K. Hussain, Jay H. Bream, and Priya Duggal, for providing us the data related to the samples analyzed in the present study. Data in this manuscript were collected by the Multicenter AIDS Cohort Study (MACS) at ( with centers at Baltimore, Chicago, Los Angeles, Pittsburgh, and the Data Coordinating Center: The Johns Hopkins University Bloomberg School of Public Health. The MACS is funded primarily by the National Institute of Allergy and Infectious Diseases (NIAID), with additional co-funding from the National Cancer Institute (NCI), the National Heart, Lung, and Blood Institute (NHLBI), and the National Institute on Deafness and Communication Disorders (NIDCD). MACS data collection is also supported by Johns Hopkins University CTSA. This study was supported by two grants from the National Institute of Health: NIDCR P01DE019759 (Aaron Weinberg, Peter Zimmerman, Richard J. Jurevic, Mark Chance) and NCI R01CA163739 (Hemant Ishwaran). The work was also partly supported by the National Science Foundation grant DMS 1148991 (Hemant Ishwaran) and the Center for AIDS Research grant P30AI036219 (Mark Chance).


Jean-Eudes Dazard <[email protected]>

Maintainer: Jean-Eudes Dazard <[email protected]>


See real data application in Dazard et al., 2017.


See Also

Multicenter AIDS Cohort Study (MACS) websitea at


   # Loading the library and its dependencies

   # Help on MACS dataset
   data("MACS", package="IRSF")

jedazard/IRSF documentation built on Oct. 19, 2017, 11:49 p.m.