metagenomeSeq: Statistical analysis for sparse high-throughput sequencing

Documented in fitLogNormal

#' Computes a log-normal linear model and permutation based p-values.
#' 
#' Wrapper to perform the permutation test on the t-statistic. This is the original
#' method employed by metastats (for non-sparse large samples). We include CSS normalization
#' though (optional) and log2 transform the data. In this method the null distribution is not assumed to be a t-dist.
#' 
#' 
#' @param obj A MRexperiment object with count data.
#' @param mod The model for the count distribution.
#' @param useCSSoffset Boolean, whether to include the default scaling
#' parameters in the model or not.
#' @param B Number of permutations.
#' @param coef The coefficient of interest.
#' @param sl The value to scale by (default=1000).
#'
#' @return Call made, fit object from lmFit, t-statistics and p-values for each feature.
#' @export
#' @examples
#' 
#' # This is a simple demonstration
#' data(lungData)
#' k = grep("Extraction.Control",pData(lungData)$SampleType)
#' lungTrim = lungData[,-k]
#' k = which(rowSums(MRcounts(lungTrim)>0)<30)
#' lungTrim = cumNorm(lungTrim)
#' lungTrim = lungTrim[-k,]
#' smokingStatus = pData(lungTrim)$SmokingStatus
#' mod = model.matrix(~smokingStatus)
#' fit = fitLogNormal(obj = lungTrim,mod=mod,B=1)
#' 
fitLogNormal <- function(obj,mod,useCSSoffset=TRUE,B=1000,coef=2,sl=1000){
    if(class(obj)=="MRexperiment"){
        mat = MRcounts(obj,norm=FALSE,log=FALSE)
        mat = log2(mat + 1)
    } else if(class(obj) == "matrix") {
        mat = obj
    } else {
        stop("Object needs to be either a MRexperiment object or matrix")
    }

	if(useCSSoffset==TRUE){
		if(any(is.na(normFactors(obj)))){
            stop("Calculate the normalization factors first!")
        }
		mmCount=cbind(mod,log2(normFactors(obj)/sl +1))}
	else{ 
       	mmCount=mod
   	}
    
    # fit of the data
	fitRes = limma::lmFit(mat,mmCount)	

    # The t-statistic
    tt <- fitRes$coef[,coef] / fitRes$stdev.unscaled[,coef] / fitRes$sigma

    perms = replicate(B,sample(mmCount[,coef]))
    mmCount1=mmCount[,-coef]
    nc = ncol(mmCount)

    tobs<- sapply(1:B,function(i){
        # This code forces the covariate of interest to be a factor (might not apply)
        mmCountPerm = cbind(mmCount1,factor(perms[,i]))
        fit = limma::lmFit(mat,mmCountPerm)
        ttObs <- fit$coef[,nc] / fit$stdev.unscaled[,nc] / fit$sigma
        ttObs
    })
    p = rowMeans(abs(tobs)>=abs(tt))

	dat = list(call=match.call(),fit=fitRes,t = tt,p = p,type="perm")
	return(dat)
}

HCBravoLab/metagenomeSeq documentation built on Dec. 20, 2024, 4:06 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

HCBravoLab/metagenomeSeq
Statistical analysis for sparse high-throughput sequencing

R/fitLogNormal.R
In HCBravoLab/metagenomeSeq: Statistical analysis for sparse high-throughput sequencing

Defines functions fitLogNormal

Documented in fitLogNormal

R Package Documentation

Browse R Packages

We want your feedback!

HCBravoLab/metagenomeSeq Statistical analysis for sparse high-throughput sequencing

R/fitLogNormal.R In HCBravoLab/metagenomeSeq: Statistical analysis for sparse high-throughput sequencing

Defines functions fitLogNormal

Documented in fitLogNormal

R Package Documentation

Browse R Packages

We want your feedback!

HCBravoLab/metagenomeSeq
Statistical analysis for sparse high-throughput sequencing

R/fitLogNormal.R
In HCBravoLab/metagenomeSeq: Statistical analysis for sparse high-throughput sequencing