getProfiles: Get profiles of ChIP-signal over annotated features

Description Usage Arguments Value Author(s) See Also Examples

View source: R/getProfiles.R

Description

This function associates the measured ChIP signals to annotated features and stores the profile of each feature in a list. Each profile is divided in three parts. The first entry is ”upstream”, which saves the signal upstream of start. Then follows ”region”, which is from start to end and then ”downstream”, which stores the signals downstream of end.

Usage

1
getProfiles(eSet, probeAnno, gffAnno, upstream, downstream, feature="ORF", borderNames, method, sameLength=T, fill=T, distance=8, spacing=4)

Arguments

eSet

an ExpressionSet, containing on sample.

probeAnno

a probeAnno object for the given ExpressionSet

gffAnno

a data frame containing the annotation of the features of interest

upstream

how many basepairs upstream of the feature start (feature start on the crick strand is end in gffAnno) should be taken.

downstream

how many basepairs downtream of the feature start (feature end on the crick strand is start in gffAnno) should be taken.

feature

name of the features (e.g. ORF, transcript, rRNA, ...)

borderNames

names of the borders, flaking the feature (e.g. c("start", "stop"))

method

Two methods are available. "middle", just takes the middle position of each probe and its corresponding value. This method should be used if the whole genome is tiled in an high resolution. "basewise" calculates for each base the mean of all probes overlapping with this position.

fill

if "middle" is chosen the distance of the taken values equals the probe spacing on the chip. To avoid errors, because of regions lacking of probes, one can fill up these regions with NAs.

distance

if method "middle" and fill==TRUE are chosen, distance is the max distance of no value occuring before filling in one NA.

spacing

probe spacing on the chip. Only used for filling up with NAs in method "middle".

sameLength

if method "middle" is chosen it can occur that the length of the upstream/downstream region vary a little. If sameLength==TRUE, upstream/downstream regions get all the same length.

Value

a list with the following entries

ID

the ID/name of the sample

upstream

number of basepairs, taken upstream of the feature

downstream

number of basepairs, taken upstream of the feature

method

method used

borderNames

names of the borders

feature

feature type (e.g. "ORF")

profile

a list which contains all profiles of the features in the gffAnno. Each entry consists of a list with the elements "upstream", "region", "downstream".

Author(s)

Benedikt Zacher zacher@lmb.uni-muenchen.de

See Also

fill,fillNA,mapFeatures,getIntensities,getFeature, fill,getProfilesByBase

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
## 
# dataPath <- system.file("extdata", package="Starr")
# bpmapChr1 <- readBpmap(file.path(dataPath, "Scerevisiae_tlg_chr1.bpmap"))

# cels <- c(file.path(dataPath,"Rpb3_IP_chr1.cel"), file.path(dataPath,"wt_IP_chr1.cel"), 
# 	file.path(dataPath,"Rpb3_IP2_chr1.cel"))
# names <- c("rpb3_1", "wt_1","rpb3_2")
# type <- c("IP", "CONTROL", "IP")
# rpb3Chr1 <- readCelFile(bpmapChr1, cels, names, type, featureData=TRUE, log.it=TRUE)

# ips <- rpb3Chr1$type == "IP"
# controls <- rpb3Chr1$type == "CONTROL"

# rpb3_rankpercentile <- normalize.Probes(rpb3Chr1, method="rankpercentile")
# description <- c("Rpb3vsWT")
# rpb3_rankpercentile_ratio <- getRatio(rpb3_rankpercentile, ips, controls, description, fkt=median, featureData=FALSE)

# probeAnnoChr1 <- bpmapToProbeAnno(bpmapChr1)
# transcriptAnno <- read.gffAnno(file.path(dataPath, "transcriptAnno.gff"), feature="transcript")

# profile <- getProfiles(rpb3_rankpercentile_ratio, probeAnnoChr1, transcriptAnno, 500, 500, feature="transcript", borderNames=c("TSS", "TTS"), method="basewise", sameLength=T, fill=T, distance=8, spacing=4)

Starr documentation built on April 28, 2020, 7:52 p.m.