get_overlaps: Find overlapping intervals.

Description Usage Arguments See Also Examples

View source: R/main.R

Description

Takes the genomic intervals of genes and TEs as input. Besides, the user also requires to provide three additional parameters: (i) the maximum distance allowed between the start sites of genes and TEs, (ii) whether genes and TEs must be located in same strand and (iii) TE family or subfamily name (e.g. SINE, LINE). This function helps to detect TEs that are localized within the upstream of genes of interest.

Usage

1
get_overlaps(g, r, strand, distance, repeat_type)

Arguments

g

It is a data frame containing gene annotations retrieved via get_intervals() function.

r

It is a data frame containing reformatted repeat annotations created via rm_format() function.

strand

This argument takes one of the two values: either "same" or "strandness". In case "same" option is used, gene and TE must be in same strand.

distance

It is a integer value that can be negative or positive. This argument is used for determining the maximum distance allowed between the start sites of genes and TEs.

repeat_type

This argument takes a character string and it is used to filter a specific repeat type for the downstream analysis.

See Also

.

Examples

1
2
#Find TEs that are located within the upstream of the given gene list.
TEffectR::get_overlaps(g = genes, r = rm, strand = "same", distance = 1000, repeat_type = "LINE")

karakulahlab/Tool1 documentation built on Feb. 2, 2020, 11:05 a.m.