View source: R/test.train.cds.R
test.train.cds | R Documentation |
This function allows you to split a fast format sequence file into two sets: test and train. As a result, the function outputs a list consisting of two elements "test" and "train" containing sequence names.
test.train.cds(cds.fa, percent_train = 0.6)
cds.fa |
the variable containing sequences in fasta format (from read.fasta() function from seqinr library) |
percent_train |
the ratio value to divide the input set (he default is 0.6, which means that 60% of the sequence is to be allocated to the training set) |
cds <- read.fasta("lncRNA/Mus_musculus.GRCm39.cds.all.fa", seqtype = "DNA", as.string = T, set.attributes = F)
cds_tt <- test.train.cds(cds.fa = cds, percent_train = 0.6)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.