demo_dataset: A demo of dataset

Description Usage Format Source

Description

This dataset contains the features of 20 lncRNA sequences and 20 protein-coding sequences.

Usage

1

Format

A data frame with 40 rows and 20 variables:

Label

the class of the sequences

ORF.Max.Len

the length of the longest ORF

ORF.Max.Cov

the coverage of the longest ORF

Seq.lnc.Dist

Log-Distance.lncRNA

Seq.pct.Dist

Log-Distance.protein-coding transcripts

Seq.Dist.Ratio

Distance-Ratio.sequence

Signal.Peak

Signal as 1/3 position

SNR

Signal to noise ratio

Signal.Min

the minimum value of the top 10% power spectrum

Signal.Q1

the quantile Q1 of the top 10% power spectrum

Signal.Q2

the quantile Q2 of the top 10% power spectrum

Signal.Max

the maximum value of the top 10% power spectrum

Dot_lnc.dist

Log-Distance.acguD.lncRNA

Dot_pct.dist

Log-Distance.acguD.protein-coding transcripts

Dot_Dist.Ratio

Distance-Ratio.acguD

SS.lnc.dist

Log-Distance.acgu-ACGU.lncRNA

SS.pct.dist

Log-Distance.acgu-ACGU.protein-coding transcripts

SS.Dist.Ratio

Distance-Ratio.acgu-ACGU

MFE

Minimum free energy

UP.PCT

Percentage of Unpair-Pair

Source

Sequences are selected from GENCODE.


LncFinder documentation built on Dec. 11, 2021, 9:39 a.m.