reg2gff: Converting Information about Enriched Regions into GFF Format
In humburg/tileHMM: Hidden Markov Models for ChIP-on-Chip Analysis

Description Usage Arguments Details Value Author(s) References See Also Examples

Creates a GFF formatted file with information about enriched regions.

1
2
3

reg2gff(regions, post, probe.pos, file = NULL, score.fun = mean, 
    source = "tHMM", feature.type = "posterior_prob", 
    class = "ChIP_region", name = "tHMM")

`regions`	A matrix indicating enriched regions (see `region.position`).
`post`	A vector with posterior probabilities for each probe.
`probe.pos`	A `data.frame` with columns "`chromosome`" and "`position`" providing genomic coordinates for each probe.
`file`	Name of GFF file to create.
`score.fun`	Function used to calculate score of enriched regions.
`source`	Entry for ‘source’ field of GFF file.
`feature.type`	Entry for ‘feature’ field of GFF file.
`class`	Class of documented feature. This is used in the ‘attribute’ field together with `name`.
`name`	Name of documented feature. This is used in the ‘attribute’ field together with `class`.

post should provide scores for each probe. These scores are then summarised for each region by applying score.fun to the probe scores in each region. The default for score.fun is mean but any function that accapts a single numeric vector as its argument and returns a scalar can be used.

A GFF formated data.frame is returned invisibly. Usually this function is called for its side effect.

Peter Humburg

GFF specification: http://www.sanger.ac.uk/Software/formats/GFF/GFF_Spec.shtml

region.position

## create two state HMM with t distributions
state.names <- c("one","two")
transition <- c(0.1, 0.2)
location <- c(1, 2)
scale <- c(1, 1)
df <- c(4, 6)
model <- getHMM(list(a=transition, mu=location, sigma=scale, nu=df),
    state.names)

## obtain observation sequence from model
obs <- sampleSeq(model, 100)

## make up some genomic probe coordinates
pos <- data.frame(chromosome = rep("chr1", times = 100), 
    position = seq(1, 4000, length = 100))

## compute most likely state sequence for obs
vit.res <- viterbi(model, obs)

## find regions attributed to state "one"
reg.pos <- region.position(vit.res$stateSeq, region="one")

## calculate posterior probability for state "one"
post <- posterior(obs, model, log=FALSE)[1,]

## create gff annotations
gff <- reg2gff(reg.pos, post, pos)