metagene_cds: Get average ribosome footprint alignment values at the 5' and...
In celalp/ribofootprintR: Fast data extraction and visualization from .bam files for ribosome footprint profiling experiments

Usage Arguments Details Value Author(s) See Also Examples

1	metagene_cds(data, genedf, before_start = 50, after_start = 100, before_stop = 100, after_stop = 50, norms = "total", cores = 1, plot = F)

`data`	list returned by read_profiling_data
`genedf`	data frame that contains the gene name, and the nucleotide coordinates of start and stop locations as well as number of nucleotides and codon of the ORF.
`before_start`	integer, the number of nucleotide to calculate before the start nucleotide, defaults to 50
`after_start`	integer, the number of nucleotide to calculate after the start nucleotide, defaults to 100
`before_stop`	integer, the number of nucleotide to calculate before the stop nucleotide, defaults to 100
`after_stop`	integer, the number of nucleotide to calculate before the stop nucleotide, defaults to 50
`norms`	character, normalization, character either "total", "rna" or "none". Defaults to "total"
`cores`	integer, number of threads to use. See details.
`plot`	logical, whether to plot or not, defaults to F

If norms is set to "total"" returns the average fraction of reads that map to the region specified. If norms are set to "rna" then the number of reads that map to each codon are divided by the RNA-Seq coverage. "none" returns the average number of reads per codon.

Currently PSOCK multithreadding is not supported. For Windows set cores to 1. For operating systems that support forking the number of threads that are used can be set by the cores argument. If the cores>1 then the data is processed by mclapply instead of lapply.

a data frame with normalized nucleotide values.

Alper Celik

metagene_bins

##---- Should be DIRECTLY executable !! ----
##-- ==>  Define data, use random,
##--	or do  help(data=index)  for the standard data sets.

## The function is currently defined as
function (data, genedf, before_start = 50, after_start = 100,
    before_stop = 100, after_stop = 50, norms = "total", cores = 1,
    plot = F)
{
    returner <- function(nucleotide, frame, df) {
        a <- df[df$nucleotide == nucleotide & df$frame == frame,
            ]
        b <- sum(a$norm, na.rm = T)
        b
    }
    if (is.null(data[["rna"]]) & norms == "rna") {
        warning("No RNA-Seq data setting norms to total")
        norms = "total"
    }
    gene_names <- genedf$gene
    get_cds <- function(gene_name, direction, norm = norms, genes = genedf) {
        gene <- get_gene(gene_name, data = data, seq = NA, genedf = genes)
        if (is.null(gene) == T) {
            NULL
        }
        else {
            coord <- genes[genes$gene == gene_name, ]
            if (direction == "five") {
                cdss <- data.frame(nucleotide = c((before_start *
                  -1):after_start))
                cdss <- left_join(cdss, gene, by = "nucleotide")
            }
            else if (direction == "three") {
                cdss <- data.frame(nucleotide = c((coord$end -
                  coord$start - before_stop):(coord$end - coord$start +
                  after_stop)))
                cdss <- left_join(cdss, gene, by = "nucleotide")
                cdss$nucleotide <- cdss$nucleotide - coord$end +
                  coord$start
            }
            if (norm == "total") {
                cdss$norm <- as.numeric(cdss$freq/sum(gene$freq,
                  na.rm = T))
            }
            else if (norm == "rna") {
                cdss$norm <- as.numeric(cdss$freq/cdss$coverage)
            }
            else if (norm == "none") {
                cdss$norm <- as.numeric(cdss$freq)
            }
            cdss$norm[!is.finite(cdss$norm)] <- NA
            cdss
        }
    }
    if (cores > 1) {
        fives <- do.call("rbind", mclapply(gene_names, get_cds,
            "five", mc.cores = cores))
    }
    else {
        fives <- do.call("rbind", lapply(gene_names, get_cds,
            "five"))
    }
    x <- unique(fives[, c(1, 3)])
    x <- na.omit(x)
    counts <- mdply(x, returner, fives)
    fives <- counts
    colnames(fives)[3] <- "value"
    fives$codon <- ceiling(fives$nucleotide/3)
    if (cores > 1) {
        threes <- do.call("rbind", mclapply(gene_names, get_cds,
            "three", mc.cores = cores))
    }
    else {
        threes <- do.call("rbind", lapply(gene_names, get_cds,
            "three"))
    }
    x <- unique(threes[, c(1, 3)])
    x <- na.omit(x)
    counts <- mdply(x, returner, threes)
    threes <- counts
    colnames(threes)[3] <- "value"
    threes$codon <- ceiling(threes$nucleotide)
    fives$location <- "from_start"
    threes$location <- "from_stop"
    cds <- rbind(fives, threes)
    cds
    fives$location <- "from_start"
    threes$location <- "from_stop"
    cds <- rbind(fives, threes)
    if (plot) {
        print(ggplot(cds, aes(x = codon, y = value, fill = frame)) +
            geom_bar(stat = "identity", position = "dodge") +
            facet_grid(. ~ location, scales = "free_x"))
    }
    invisible(cds)
  }

celalp/ribofootprintR documentation built on May 12, 2019, 12:04 p.m.

celalp/ribofootprintR index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

celalp/ribofootprintR
Fast data extraction and visualization from .bam files for ribosome footprint profiling experiments

metagene_cds: Get average ribosome footprint alignment values at the 5' and...
In celalp/ribofootprintR: Fast data extraction and visualization from .bam files for ribosome footprint profiling experiments

Usage

Arguments

Details

Value

Author(s)

See Also

Examples

Related to metagene_cds in celalp/ribofootprintR...

R Package Documentation

Browse R Packages

We want your feedback!

celalp/ribofootprintR Fast data extraction and visualization from .bam files for ribosome footprint profiling experiments

metagene_cds: Get average ribosome footprint alignment values at the 5' and... In celalp/ribofootprintR: Fast data extraction and visualization from .bam files for ribosome footprint profiling experiments

Usage

Arguments

Details

Value

Author(s)

See Also

Examples

Related to metagene_cds in celalp/ribofootprintR...

R Package Documentation

Browse R Packages

We want your feedback!

celalp/ribofootprintR
Fast data extraction and visualization from .bam files for ribosome footprint profiling experiments

metagene_cds: Get average ribosome footprint alignment values at the 5' and...
In celalp/ribofootprintR: Fast data extraction and visualization from .bam files for ribosome footprint profiling experiments