GPL200.3pUTR.Words: Data: A probeset:3'-UTR word annotation matrix

Description Usage Format

Description

This is a binary probeset annotation matrix containing Affymetrix probeset IDs in rows and predicted 3'-UTR regulatory words (oligonucleotides) in columns. A value of 1 in the matrix indicates that the word appears more often than expected by random chance in the 3'-UTR of the gene to which the probeset has been mapped. Because 3'-UTRs vary in length between genes, it is necessary to divide the number of observed occurences by the number of expected occurences for each word in each 3'-UTR. If this value is greater than 0.5, we score the word as 'present', and we score it as 'absent' if it is not. The 3'-UTR regulatory word dictionary was compiled using the MobyDick algorithm (see https://www.ncbi.nlm.nih.gov/pubmed/10977067).

Usage

1

Format

A binary matrix with probeset IDs in rows and words in columns.


MPCary/DEXDATA.Celegans documentation built on May 4, 2019, 2:35 p.m.