View source: R/groupComparison.R
groupComparison | R Documentation |
Whole plot testing
groupComparison(
contrast.matrix,
data,
save_fitted_models = TRUE,
log_base = 2,
use_log_file = TRUE,
append = FALSE,
verbose = TRUE,
log_file_path = NULL,
numberOfCores = 1
)
contrast.matrix |
comparison between conditions of interests. |
data |
name of the (output of dataProcess function) data set. |
save_fitted_models |
logical, if TRUE, fitted models will be added to the output. |
log_base |
base of the logarithm used in dataProcess. |
use_log_file |
logical. If TRUE, information about data processing will be saved to a file. |
append |
logical. If TRUE, information about data processing will be added to an existing log file. |
verbose |
logical. If TRUE, information about data processing wil be printed to the console. |
log_file_path |
character. Path to a file to which information about data processing will be saved. If not provided, such a file will be created automatically. If 'append = TRUE', has to be a valid path to a file. |
numberOfCores |
Number of cores for parallel processing. When > 1, a logfile named 'MSstats_groupComparison_log_progress.log' is created to track progress. Only works for Linux & Mac OS. Default is 1. |
contrast.matrix : comparison of interest. Based on the levels of conditions, specify 1 or -1 to the conditions of interests and 0 otherwise. The levels of conditions are sorted alphabetically. Command levels(QuantData$FeatureLevelData$GROUP_ORIGINAL) can illustrate the actual order of the levels of conditions. The underlying model fitting functions are lm and lmer for the fixed effects model and mixed effects model, respectively. The input of this function is the quantitative data from function (dataProcess).
A list with the following components:
A 'data.frame' containing the results of the statistical testing for each protein. The columns include:
The name of the protein for which the comparison is made.
The label of the comparison, typically derived from the 'contrast.matrix'.
The log2 fold change between the conditions being compared. The base of the logarithm is specified by the 'log_base' parameter.
The standard error of the log2 fold change estimate.
The t-statistic value for the comparison.
The degrees of freedom associated with the t-statistic.
The p-value for the statistical test of the comparison.
The adjusted p-value using the Benjamini-Hochberg method for controlling the false discovery rate.
Any issues encountered during the comparison. NA indicates no issues. "oneConditionMissing" occurs when data for one of the conditions being compared is entirely missing for a particular protein.
The percentage of missing features for a given protein across all runs. This column is included only if missing values were imputed.
The percentage of features that were imputed for a given protein across all runs. This column is included only if missing values were imputed.
A 'data.frame' containing quality control data used to fit models for group comparison. The columns include:
Identifier for the specific MS run.
Identifier for the protein.
Summarized intensity for the protein in a given run.
Original run identifier before any processing.
Experimental group identifier.
Subject identifier within the experimental group.
Total number of feature measurements for the protein in the given group.
Number of features measured for the protein in the given run.
Percentage of missing feature values for the protein in the given run.
Logical indicator of whether more than 50 percent of the features values are missing for the protein in the given run.
Number of features for which values were imputed due to missing or censored data for the protein in the given run.
Contains the differences between the observed values and the values predicted by the fitted model.
The predicted values obtained from the model for a protein measurement for a given run in the dataset.
A list of fitted models for each protein. This is included only if 'save_fitted_models' is set to TRUE. Each element of the list corresponds to a protein and contains the fitted model object.
# Consider quantitative data (i.e. QuantData) from yeast study with ten time points of interests,
# three biological replicates, and no technical replicates.
# It is a time-course experiment and we attempt to compare differential abundance
# between time 1 and 7 in a set of targeted proteins.
# In this label-based SRM experiment, MSstats uses the fitted model with expanded scope of
# Biological replication.
QuantData <- dataProcess(SRMRawData, use_log_file = FALSE)
head(QuantData$FeatureLevelData)
levels(QuantData$ProteinLevelData$GROUP)
comparison <- matrix(c(-1,0,0,0,0,0,1,0,0,0),nrow=1)
row.names(comparison) <- "T7-T1"
groups = levels(QuantData$ProteinLevelData$GROUP)
colnames(comparison) <- groups[order(as.numeric(groups))]
# Tests for differentially abundant proteins with models:
# label-based SRM experiment with expanded scope of biological replication.
testResultOneComparison <- groupComparison(contrast.matrix=comparison, data=QuantData,
use_log_file = FALSE)
# table for result
testResultOneComparison$ComparisonResult
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.