View source: R/preprocess_MaxQuant.R
preprocess_MaxQuant | R Documentation |
Performs a standard preprocessing pipeline on MSnSet
objects (Gatto et al., 2012) originating from MaxQuant (Cox and Mann, 2008) peptides.txt files.
By default, intensity values are log2 transformed and then quantile normalized. Next, the smallestUniqueGroups
function is applied,
which removes proteins groups for which any of its member proteins is present in a smaller protein group. Then, contaminants and reverse sequences are removed.
Next, irrelevant columns are dropped. Then, peptide sequences that are identified only once in a single mass spec run are removed because with only 1 identification, the model will be perfectly confounded. Finally, potential experimental annotations are added to the data frame.
preprocess_MaxQuant(MSnSet, accession = "Proteins", exp_annotation = NULL, type_annot = NULL, logtransform = TRUE, base = 2, normalisation = "quantiles", weights = NULL, smallestUniqueGroups = TRUE, useful_properties = c("Proteins", "Sequence", "PEP"), filter = c("Potential.contaminant", "Reverse"), filter_symbol = "+", minIdentified = 2, remove_only_site = FALSE, file_proteinGroups = NULL, colClasses = "keep", droplevels = TRUE, printProgress = FALSE, shiny = FALSE, message = NULL)
MSnSet |
An |
accession |
A character indicating the column that contains the the protein identifiers. This is only used if |
exp_annotation |
Either the path to the file which contains the experiment annotation or a data frame containing the experiment annotation. Exactly one colum in the experiment annotation should contain the mass spec run names. Annotation in a file can be both a tab-delimited text document or an Excel file. For more details, see |
type_annot |
If |
logtransform |
A logical value indicating whether the intensities should be log-transformed. Defaults to |
base |
A positive or complex number: the base with respect to which logarithms are computed. Defaults to 2. |
normalisation |
A character vector of length one that describes how to normalise the |
weights |
Only used when |
smallestUniqueGroups |
A logical indicating whether protein groups for which any of its member proteins is present in a smaller protein group should be removed from the dataset. Defaults to |
useful_properties |
The columns of the |
filter |
A vector of names corresponding to the columns in the |
filter_symbol |
A character indicating the symbol in the columns corresponding to the |
minIdentified |
A numeric value indicating the minimal number of times a peptide sequence should be identified in the dataset in order not to be removed. Defaults to 2. |
remove_only_site |
A logical indicating wheter proteins that are only identified by peptides carrying one or more modification sites should be removed from the data. This requires the extra input of a proteinGroups.txt file in the |
file_proteinGroups |
The name of the proteinGroups.txt file, which is used to remove proteins that are only identified by peptides carrying one or more modification sites. Only used when |
colClasses |
character. Only used when the |
droplevels |
A logical indicating if levels of factors that disappeared during preprocessing should be removed from the data. Defaults to |
printProgress |
A logical indicating whether the R should print a message before performing each preprocessing step. Defaults to |
shiny |
A logical indicating whether this function is being used by a Shiny app. Setting this to |
message |
Only used when |
A preprocessed MSnSet
object that is ready to be converted into a protdata
object.
Gatto L, Lilley KS. MSnbase - an R/Bioconductor package for isobaric tagged mass spectrometry data visualization, processing and quantitation. Bioinformatics. 2012 Jan 15;28(2):288-9. https://doi.org/10.1093/bioinformatics/btr645. PubMed PMID:22113085.
Cox, J. and Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol, 2008, 26, pp 1367-72. http://www.nature.com/nbt/journal/v26/n12/full/nbt.1511.html.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.