Home

/

CRAN

/

geno2proteo

/

proteinLocsToProteinSeq: Obtaining the amino acid sequences of a list of protein...

proteinLocsToProteinSeq: Obtaining the amino acid sequences of a list of protein...
In geno2proteo: Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci

View source: R/proteinLocsToProteinSeq.R

proteinLocsToProteinSeq

R Documentation

Obtaining the amino acid sequences of a list of protein sections

Description

Given a list of sections in proteins defined by the ENSEMBL IDs of those proteins and thestart and end coordinates of those sections along the amino acid sequences of the proteins, the function returns the amino acid sequences of those sections.

Usage

proteinLocsToProteinSeq(inputLoci, CDSaaFile)

Arguments

inputLoci

A data frame containing the coordinates of the protein sections in the protein sequences. The 1st column must be the ENSEMBL ID of either the protein or the transcript that the protein corresponds to (or the equivalent of ENSEMBL ID if you have created your own gene annotation GTF file). But you have to use onnly one of two formats (namely protein ID or transcript ID), and cannot use both of them in the input of one function call. The 2nd and 3rd columns give the coordinate of the first and last amino acids of the section in the protein sequence. Other columns are optional and will not be used by the function.

CDSaaFile

The data file generated by the package's function generatingCDSaaFile, containing the genomic locations, DNA sequences and protein sequences of all coding regions in a specific genome which is used in your analysis.

Value

The function returns a data frame containing the original protein locations specified in the input and after them, one added columnfor the amino acid sequences of the protein sections.

Author(s)

Yaoyong Li

Examples

    dataFolder = system.file("extdata", package="geno2proteo")
    inputFile_loci=file.path(dataFolder, 
        "transId_pfamDomainStartEnd_chr16_Zdomains_22examples.txt")
    CDSaaFile=file.path(dataFolder, 
        "Homo_sapiens.GRCh37.74_chromosome16_35Mlong.gtf.gz_AAseq.txt.gz")

    inputLoci = read.table(inputFile_loci, sep="\t", stringsAsFactors=FALSE)

    ProtSeqNow = proteinLocsToProteinSeq(inputLoci=inputLoci, 
                                            CDSaaFile=CDSaaFile)

geno2proteo documentation built on June 13, 2022, 5:08 p.m.

geno2proteo index

Package overview An Introduction to the geno2proteo package

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

geno2proteo
Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci

proteinLocsToProteinSeq: Obtaining the amino acid sequences of a list of protein...
In geno2proteo: Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci

Obtaining the amino acid sequences of a list of protein sections

Description

Usage

Arguments

Value

Author(s)

Examples

Related to proteinLocsToProteinSeq in geno2proteo...

R Package Documentation

Browse R Packages

We want your feedback!

geno2proteo Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci

proteinLocsToProteinSeq: Obtaining the amino acid sequences of a list of protein... In geno2proteo: Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci

Obtaining the amino acid sequences of a list of protein sections

Description

Usage

Arguments

Value

Author(s)

Examples

Related to proteinLocsToProteinSeq in geno2proteo...

R Package Documentation

Browse R Packages

We want your feedback!

geno2proteo
Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci

proteinLocsToProteinSeq: Obtaining the amino acid sequences of a list of protein...
In geno2proteo: Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci