genes: Genetics datasets

Description Usage Format Source

Description

Eight different genetics datasets b1, b2, b3, g1, g2, w1, w2, w3 are used in this paper

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17

Format

Each of the eight datasets contains 40 columns. Columns 2 through 40 contain the 20 nucleotides to the left and 19 nucleotides to the right of a particular nucleotide. The first column, count, contains the number of times those sequences were found in different RNA sequences.

The datasets beginning with b are human and originated with the Burge group http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2593745/.

The datasets beginning with g are from mice and due to the Grimmond group http://www.nature.com/nmeth/journal/v5/n7/full/nmeth.1223.html.

The datasets beginning with w are also from mice and due to the Wold group http://www.nature.com/nmeth/journal/v5/n7/full/nmeth.1226.html.

Source

These data were downloaded from Prof. Jun Li and then processed using his package mseq.


dajmcdon/cplr documentation built on May 14, 2019, 3:29 p.m.