R/data.R

#' CRISPR data
#' 
#' Example training dataset consisting of a sequence of nucleotides of CRISPR loci
#' Filtered for unambiguous characters and contains only characters in the vocabulary \{A,G,G,T
#' \}.
#' Can be loaded to workspace via `data(crispr_sample)`.
#' @format Large character of 442.41 kB
#' @usage data(crispr_sample)
#' @references \url{https://github.com/philippmuench}
"crispr_sample"

#' Parenthesis data
#' 
#' Training dataset of synthetic parenthesis language.
#' Can be loaded to workspace via `data(parenthesis)`.
#' @format Large character of 1.00 MB
#' @usage data(parenthesis)
#' @references \url{https://github.com/philippmuench}
"parenthesis"

#' Ecoli subset
#' 
#' Subset of the E. coli genome for evaluation.
#' Can be loaded to workspace via `data(ecoli_small)`.
#' @format character 326.73 kB
#' @usage data(ecoli_small)
#' @references \url{https://www.science.org/doi/10.1126/science.277.5331.1453}
"ecoli_small"
GenomeNet/deepG documentation built on Dec. 24, 2024, 12:11 p.m.