View source: R/adjustSignaturesForRegionSet.R
adjustSignaturesForRegionSet | R Documentation |
'adjustSignaturesForRegionSet()' takes a set of signatures that have been orginally defined with respect to the nucleotide frequencies within a specific reference genome or region (e.g., by deriving them from whole genome mutation data) and adjusts or normalizes them to the often different nucleotide frequencies of another specific subset of genomic regions.
adjustSignaturesForRegionSet(signatures, regionsTarget, regionsOriginal=NULL, refGenome=BSgenome.Hsapiens.UCSC.hg19::BSgenome.Hsapiens.UCSC.hg19)
signatures |
(Mandatory) Signatures to be adjusted to the nucleotide
frequencies of the genomic regions defined by the parameter |
regionsTarget |
(Mandatory) |
regionsOriginal |
(Optional) |
refGenome |
(Optional) Reference genome sequence from which to
compute the nucleotide frequencies. Default: |
This may be useful, for example, to perform signature refitting (using
decomposeTumorGenomes
) for mutation data from targetted
sequencing (e.g., only a subset of genes), whole exome sequencing (only
exonic regions), or other limited subsets of the genome with particular
nucleotide frequencies.
For Alexandrov-type signatures, the important frequencies are those of the whole sequence patterns (e.g., trinucleotides) whose central base can be mutated. Therefore, adjustment factors for individual mutation types (e.g., A[C>T]G) are computed by comparing the corresponding sequence pattern frequencies (e.g., ACG) between the original reference regions (e.g., whole genome) and the target regions (e.g., target regions of whole exome sequencing).
In the Shiraishi-type signature model, individual bases of the sequence patterns are considered as independent features. Thus, to compute nucleotide frequencies for such signatures, the frequencies of the sequence patterns (computed as for Alexandrov-type signatures) are broken down to single-nucleotide frequencies for the individual positions of the patterns.
In both cases, after the appropriate adjustment of individual features, signatures are re-normalized such that overall probabilites sum up to 1.
A set of adjusted mutational signatures in the same format as those
specified for signatures
.
Rosario M. Piro
Politecnico di Milano
Maintainer: Rosario
M. Piro
E-Mail: <rmpiro@gmail.com> or <rosariomichael.piro@polimi.it>
http://rmpiro.net/decompTumor2Sig/
Krueger, Piro (2019) decompTumor2Sig: Identification of mutational
signatures active in individual tumors. BMC Bioinformatics
20(Suppl 4):152.
decompTumor2Sig
decomposeTumorGenomes
### get Alexandrov signatures from COSMIC signatures <- readAlexandrovSignatures() ### get gene annotation for the default reference genome (hg19) library(TxDb.Hsapiens.UCSC.hg19.knownGene) txdb <- TxDb.Hsapiens.UCSC.hg19.knownGene::TxDb.Hsapiens.UCSC.hg19.knownGene ### get a GRanges object for gene promoters (-2000 to +200 bases from TSS) ### [taking only the first 1000 for testing purpose] library(GenomicRanges) regionsTarget <- promoters(txdb, upstream=2000, downstream=200)[seq(1000)] ### assume these signatures were derived only from mutation data from ### exons on chromosome X [not true; just for illustrative purpose] filter <- list(tx_chrom = c("chrX")) regionsOriginal <- exons(txdb, filter=filter) ### adjust signatures according to nucleotide frequencies in the target ### subset of the genome sign_adj <- adjustSignaturesForRegionSet(signatures, regionsTarget, regionsOriginal)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.