computeMLC | R Documentation |
This function compute the most-like CDS (MLC) region of one RNA. Methods based on the longest open reading frame (ORF) and maximum subarray sum (MSS) are supported.
computeMLC(oneRNA, mode = c("ORF", "MSS"))
oneRNA |
one RNA loaded by function |
mode |
can be |
A data frame. The MLC region, length and coverage of the MLC will be returned. MSS-based method is based on [1] and [2]. ORF extraction function is borrowed from our previous work [3].
[1] Yang C, Yang L, Zhou M, et al. LncADeep: an ab initio lncRNA identification and functional annotation tool based on deep learning. Bioinformatics. 2018; 34(22):3825-3834.
[2] Sun L, Luo H, Bu D, et al. Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucleic acids research. 2013; 41(17):e166-e166.
[3] Han S, Liang Y, Ma Q, et al. LncFinder: an integrated platform for long non-coding RNA identification utilizing sequence intrinsic composition, structural information and physicochemical property. Briefings in bioinformatics. 2019; 20(6):2009-2027.
# Use "read.fasta" function of package "seqinr" to read a FASTA file:
seqRNA <- seqinr::read.fasta(file =
"http://www.ncbi.nlm.nih.gov/WebSub/html/help/sample_files/nucleotide-sample.txt")
# Compute the MLC region using both "ORF" and "MSS" methods:
MLC_list <- lapply(seqRNA, computeMLC, mode = c("ORF", "MSS"))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.