test.train.cds: test.train.cds function

View source: R/test.train.cds.R

test.train.cdsR Documentation

test.train.cds function

Description

This function allows you to split a fast format sequence file into two sets: test and train. As a result, the function outputs a list consisting of two elements "test" and "train" containing sequence names.

Usage

test.train.cds(cds.fa, percent_train = 0.6)

Arguments

cds.fa

the variable containing sequences in fasta format (from read.fasta() function from seqinr library)

percent_train

the ratio value to divide the input set (he default is 0.6, which means that 60% of the sequence is to be allocated to the training set)

Examples

cds <- read.fasta("lncRNA/Mus_musculus.GRCm39.cds.all.fa", seqtype = "DNA", as.string = T, set.attributes = F)
cds_tt <- test.train.cds(cds.fa = cds, percent_train = 0.6)

prodakt/lncRna documentation built on June 1, 2025, 1:05 p.m.