intIRTeasyRunFromRaw: The easyrun function for integrating multiple modalities of...

View source: R/intIRTeasyRunFromRaw.R

intIRTeasyRunFromRawR Documentation

The easyrun function for integrating multiple modalities of high throughput assays using raw data.

Description

This function performs data dichotomization, IRT fitting on individual assay, latent trait estimation for integrated data and significance assessment of latent trait by permutation. An option for parallel computing is implemented to speed up the computation.

Usage

intIRTeasyRunFromRaw(platforms, platformsCtr, 
	assayType = c("Expr", "Methy", "CN"), 
	model = 3, guessing = FALSE, permutationMethod = NULL, 
	fold = 1, nPerm = 200, echo = TRUE, 
	parallel = FALSE, ...)

Arguments

platforms

A list of matrices of the raw data for tumor samples. The matrices should have equal row number corresponding to the same set of genes. The columns representing the tumor samples can differ.

platformsCtr

A list of matrices of the raw data for normal control samples. The matrices should have equal row number corresponding to the same set of genes. The column number of each matrix can differ. When normal control is not available, i.e. in CN data, use NA instead.

assayType

A vector of assay types. Candidates can only be a combination of "Expr", "Methy", "CN" in the order of the assays specified in the input platforms. When assays other then these three types, we recommend the user to dichotomize the data first and use the intIRTeasyRun function.

model

The model type as described in fitOnSinglePlat. 1: Rasch model where all item discrination are set to 1; 2: all item discrimation are set to be equal but not necessarily as 1; 3: the 2PL model where no constraint is put on the item difficulty and discrimination parameter.

guessing

A logical variable indicating whether to include guessing parameter in the model.

permutationMethod

What permutation method to use. It can only be 'gene sampling', 'sample label permutation' or NULL. if NULL, no permutation is performed

fold

The fold of sampling to calculate permuted score as used in calculatePermutedScoreByGeneSampling(). Only relevant when permutationMethod=gene sampling is used.

nPerm

Number of permutations for sample label permutation. It is effective only when permutationMethod='sample label permutation'.

echo

A logical variable indicating whether to print out the progress information.

parallel

Logical indicating whether to use parallel computing with foreach package as backend.

...

Additional parameters for dichotomizing binary data.

Value

A list quite similar to the results returned by intIRTeasyRun. The following elements are included:

fits

Model fits for each platform as returned by fitOnSinglePlat function

estimatedScoreMat

A matrix of estimated latent traits. The first several columns correspond to the individual assays; the last column represents the integrated latent trait with all data.

permutedScoreMat

A matrix of latent trait estimates after permuting the binary matrix within columns. This is only available if permutationMethod='gene sampling' is used. The first several columns correspond to the individual assays; the last column represents the integrated data.

dscrmnList

A list of discrimination parameters. Each element contains all of the discrimination parameters as a vector for each assay. The last element contains the discrimination parameters for the integrated data which is formed by combining discrimination parameters from each assay sequentially.

dffcltList

Same format as dscrmnList except it contains difficulty parameter.

gussngList

Same format as dscrmnList except it contains guessing parameter. Be default, this is just all 0's.

permutedScoreMatWithLabelPerm

A matrix of latent trait estimates using sample label permutation. This is only available if permutationMethod='sample label permutation' is used. The first several columns correspond to the individual assays; the last column represents the integrated data.

Author(s)

Pan Tong (nickytong@gmail.com), Kevin R Coombes (krc@silicovore.com)

References

Tong P, Coombes KR. integIRTy: a method to identify altered genes in cancer accounting for multiple mechanisms of regulation using item response theory. Bioinformatics, 2012 Nov 15; 28(22):2861–9.

See Also

intIRTeasyRun, fitOnSinglePlat, calculatePermutedScoreByGeneSampling

Examples

data(OV)
# 
controlList <- list(Expr_N, Methy_N, CN_N)
tumorList <- list(Expr_T, Methy_T, CN_T)
# not run as it takes time
#runFromRaw <- intIRTeasyRunFromRaw(platforms=tumorList, 
#		platformsCtr=controlList, 
#		assayType=c("Expr", "Methy", "CN"), 
#		permutationMethod="gene sampling")

integIRTy documentation built on May 3, 2022, 9:08 a.m.