getFeatures: Obtain feature sets from UCSC genome browser tables

Description Usage Arguments

View source: R/annotate.r

Description

Given a vector of table names from the UCSC genome browser that all contain "chrom", "chromStart", and "chromEnd" fields, converts them to input suitable for the goldmine() "features" argument (changes column names and adjusts 0-based start coordinates to 1-based).

Usage

1
2
getFeatures(tables = c("wgEncodeRegDnaseClusteredV3",
  "wgEncodeRegTfbsClusteredV3"), genome, cachedir, sync = TRUE)

Arguments

tables

A vector of table names from UCSC (default: DnaseI and TFBS from encode for hg19).

genome

The UCSC name specific to the genome of the query coordinates (e.g. "hg19", "hg18", "mm10", etc)

cachedir

A path to a directory where a local cache of UCSC tables are stored. If equal to NULL (default), the data will be downloaded to temporary files and loaded on the fly. Caching is highly recommended to save time and bandwidth.

sync

If TRUE, then check if newer versions of UCSC tables are available and download them if so. If FALSE, skip this check. Can be used to freeze data versions in an analysis-specific cachedir for reproducibility.


jeffbhasin/goldmine documentation built on Nov. 13, 2019, 9:11 a.m.