geno2proteo-package: Finding the DNA and Protein Sequences of Any Genomic or...

geno2proteo-packageR Documentation

Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci

Description

Using the DNA sequence and gene annotation files provided in 'ENSEMBL' <https://www.ensembl.org/index.html>, the functions implemented in the package try to find the DNA sequences and protein sequences of any given genomic loci, and to find the genomic coordinates and protein sequences of any given protein locations, which are the frequent tasks in the analysis of genomic and proteomic data.

Details

The DESCRIPTION file: This package was not yet installed at build time.
Index: This package was not yet installed at build time.

~~ An overview of how to use the package and the most important functions ~~

The package needs three data files. One contains the genetic table, another one contains the DNA sequences of a certain genome, and the third one contains the gene annotations in the same format as the GTF file used in ENSEMBL. The standard genetic table was provided in this package. The other two data files can be downloaded from ENSEMBL web site. For details about those data files and how to obtain them, see the introduction document of this package. Of course you can create your own data files, as long as you observe the format of the files which are specified in the introduction document as well.

The package also needs Perl installed and being available for use in order to run some of the functions.

Once you have the three data files, you need to run the function generatingCDSaaFile to generate another data file, which will be used by some of the functions in this package.

Four main functions were implemented in this package so far. The function genomicLocToProteinSequence will find the protein sequences and DNA sequences of the coding regions within a list of genomic loci given as input. genomicLocToWholeDNASequence will obtain the whole DNA sequences of any genomic loci given in the input data. proteinLocsToGenomic will find the genomic coordinates for a list of sections in proteins as input. proteinLocsToProteinSeq will find the the protein sequences of a list of the protein sections.

Author(s)

Yaoyong Li

Maintainer: Yaoyong Li<liyaoyong85@gmail.com>


geno2proteo documentation built on June 13, 2022, 5:08 p.m.