training: Training data

trainingR Documentation

Training data

Description

This is the data used to train the predictive model

Usage

training

Format

A data frame with 67,775 rows and 8 variables:

gene_id

Ensembl gene id

specie

The specie one of: human, fish, mouse, or xenopus

cell_type

The cell type: 293t, embryo mz, hela, mES cells, k562, or RPE

datatype

experimental technique used to obtain the mRNA stability measurements (decay rate)

decay_rate

The decay rate (variable to predict), scaled with respect to each experiment

utrlenlog

size in log scale of the 5' UTR

coding

coding dna sequence in frame

cdslenlog

size in log scale of the coding sequence

Details

The decay rate measurements were obtained from the following papers:

  1. Bazzini, Ariel A., et al. "Codon identity regulates mRNA stability and translation efficiency during the maternal‐to‐zygotic transition."

  2. Wu, Qiushuang, et al. "Translation affects mRNA stability in a codon-dependent manner in human cells." Elife 8 (2019): e45396.

  3. Herzog, Veronika A., et al. "Thiol-linked alkylation of RNA to assess expression dynamics." Nature methods 14.12 (2017): 1198.


santiago1234/iCodon documentation built on Nov. 2, 2023, 2:03 p.m.