convertGenomicBasesToCodingAA | R Documentation |
Functions to try to convert DNA to/from AA given a local, possibly indel modified, context.
convertGenomicBasesToCodingAminoAcids(seqID, position, end, strand = "+",
dnaQuery, genomeDNA, geneMap = NULL, exonMap = NULL)
convertGenomicDNAtoCodingDNA(geneID, genomeDNA = NULL)
convertAApositionToGenomicDNAposition(geneID, AAposition, AAlength)
seqID , geneID |
a single character string giving the SeqID or GeneID that is the reference location in the genome |
position , end |
location in reference chromosomal units of the start and stop of the DNA sequence of interest |
strand |
the coding strand to use, for interpreting that region of DNA |
dnaQuery |
the DNA sequence for that region, as called by any non-reference means, such as by a SNP/Indel caller |
genomeDNA |
the DNA sequence for that region, from the genomic reference |
geneMap , exonMap |
the current species GeneMap and ExonMap, or the relavant subset thereof, needed to interpret the DNA |
AAposition , AAlength |
the location and extent of the coding amino acid sequence of interest |
These functions try to map between DNA and AA given possibly modified genomic information, such as INDEL and SNP calling data. They are not gauranteed to work in all possible contexts! Used mostly for adding DNA to AA notation to SNP plots, etc.
For convertGenomicBasesToCodingAA
, a list of 3 elements. Two are character vectors of
the same length as the query DNA, having empty strings and amino acid calls that represent the
peptide/protein fragmant in the same physical layout as the given DNA. The third is an integer
vector of amino acid ordinals giving the relative position in the final protein sequence. Insertions
and/or deletions may cause elements of the query to be other than a single character.
genomic |
the AA sequence of the reference genome in that region |
query |
the AA sequence of the query sequence in that region |
aa.ordinal |
the protein sequence locations of the query AA sequence in that region |
For convertGenomicDNAtoCodingDNA
, a character string that is the cDNA sequence for the given gene, after removing
any introns and accounting for strand by possible reverse complementation.
For convertAApositionToGenomicDNAposition
, a list of 2 elements:
SEQ_POSITION |
the location in the genomic DNA that is the start of the first codon |
SEQ_END |
the location in the genomic DNA that is the end of the last codon |
These functions are only as valid as the underlying genome and annotation. They are not foolproof.
Bob Morrison
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.