ch22Promoters Data Set

Share:

Description

Toy data set consisting of promoter regions of 150 random genes from the human chromosome 22 (obsolete)

Usage

1

Format

A data set with 150 DNA sequences. Each string is a nucleotide sequence that corresponds to the promoter region of a gene from the human chromosome no. 22 (according to the human genome assembly hg18). The sequences start 999 bases upstream of the transcription start site (TSS) and end with the TSS itself. The names attribute contains the RefSeq IDs of the genes.

In previous version of the apcluster package, this was an R object that can be loaded via data(ch22Promoters). For better compatibility with the kebabs package, the data set has been moved to a plain text file (in FASTA format) that can be loaded from inst/examples (see examples below).

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
## load Biostrings package
library(Biostrings)

## load data set
filepath <- system.file("examples", "ch22Promoters.fasta",
                        package="apcluster")
ch22Promoters <- readDNAStringSet(filepath)

## display sequences
ch22Promoters