R/GC.R

Defines functions SequenceAnalysis.GC

Documented in SequenceAnalysis.GC

#' @title SequenceAnalysis.GC
#' @description GC content : percentage of nucleotide g and c
#' @details Calculate GC content for input sequence or by passing UniProt ID it will catch the sequence online from UniProt database and then calculate GC content for it.
#' @author Babak Khorsand
#' @export SequenceAnalysis.GC
#' @param Nucleotide_Sequence Nucleotide Sequence
#' @param UniprotKB UniProt ID of desired sequence
#' @param CDS if TRUE GC content of CDS Region will be calculated
#' @return GC content
#' @examples
#' SequenceAnalysis.GC("actagtcacgatcag")
#' SequenceAnalysis.GC(UniprotKB="O15131")
#' SequenceAnalysis.GC(UniprotKB="O15131",CDS=TRUE)
SequenceAnalysis.GC = function(Nucleotide_Sequence=NULL,UniprotKB=NULL,CDS=FALSE)
{
  GC=NULL
  if (is.null(Nucleotide_Sequence))
  {
    if (is.null(UniprotKB))
    {
      stop("Nucleotide Sequence or UniprotKB must be set")
    }else
    {
      Nucleotide_Sequence=SequenceAnalysis.GetNucleotideSequence(UniprotKB)
      if (CDS)
      {
        Nucleotide_Sequence=Nucleotide_Sequence[[2]]
        if (Nucleotide_Sequence=="N/A")
          Nucleotide_Sequence=Nucleotide_Sequence[[3]]
      }else
      {
        Nucleotide_Sequence=Nucleotide_Sequence[[3]]
      }
    }
  }
  if (Nucleotide_Sequence!="N/A")
  {
    Nucleotide_Sequence=tolower(Nucleotide_Sequence)
    GC=round((table(strsplit(Nucleotide_Sequence,""))["g"]+table(strsplit(Nucleotide_Sequence,""))["c"])/nchar(Nucleotide_Sequence),3)
  }
  if (is.null(GC))
    GC="N/A"
  names(GC)="GC"
  return(GC)
}

Try the SequenceAnalysis package in your browser

Any scripts or data that you put into this service are public.

SequenceAnalysis documentation built on May 30, 2017, 8 a.m.