rlregions: R-loop Regions

Description Usage Arguments Details Value Source Structure Examples

Description

R-loop regions (RL regions) are consensus sites of R-loop formation derived from a meta-analysis of the R-loop mapping experiments in RLBase. RLHub includes information about these regions (rlregions_meta), the overlap of genomic features with RL regions (rlregions_annot), and the read count matrices for each sample in RLBase within each RL region (rlregions_counts).

Usage

1
2
3
4
5

Arguments

quiet

If TRUE, messages are suppressed. Default: FALSE.

Details

R-loop regions (RL regions) are consensus sites of R-loop formation derived from a meta-analysis of the R-loop mapping experiments in RLBase. In RLHub, we provide access to:

Value

A list of tbl objects. See details.

Source

RL Regions - rlregions_meta

RL Regions were derived during this step of the RLBase-data protocol.

Here is a brief summary of the key processing steps.

  1. Human RLBase samples which were labeled as "POS", classified as "POS" (see rlfs_res), and which have at least 5000 peaks were selected.

  2. Then, these peaks were then randomly downsampled to 5000 ranges each and aggregated.

  3. Then, the hg38 genome was binned into 10bp bins and the number of aggregated peaks was overlapped with each bin and counted, producing a bedGraph.

  4. Then, macs3 was implemented to call peaks on the bedGraph file.

  5. Finally, the resulting ranges were combined to create summary ranges.

Note: steps 2-4 were performed separately on datasets from dRNH (catalytically-dead RNaseH1) R-loop mapping experiments and S9.6-based experiments due to the noted differences in R-loop mapping between these modalities. In step 5, they were combined to create summary ranges, and the original source (dRNH or S9.6) was noted in the resulting metadata.

RL Regions anno - rlregions_annot

Once RL regions were generated, they were overlapped with the annotations found in annots_full_hg38(). This was done separately for dRNH-derived, S9.6-derived, and combined RL regions.

RL Regions Counts - rlregions_counts

Once RL regions were generated, Rsubread::featureCounts() was used to count the number of overlapping reads from the .bam alignment files for each sample in RLBase (rlbase_samples) within each RL region (only "combined" regions used).

These data were then VST-transformed and TPM was also calculated.

Structure

RL Regions - rlregions_meta

rlregions_meta() is a tbl with the following structure:

rlregion location is_rlfs source confidence_level medSignalVal medPVal medQVal avgSignalVal avgPVal avgQVal numPeaks nSamples nStudies nModes nTissues ip_types nIPTypes pct_case avgNumPeaks medNumPeaks geneIDs allGenes mainGenes is_repeat
All_RL64926 chrM:0-16569:. TRUE dRNH S96 53.21 1.962881 20.92420 17.01782 2.563848 653.89461 649.8001 52373 298 44 16 36 S9.6,dRNH,RNA-m6A,None 4 75.16779 80117.79 45063 NA NA NA TRUE
All_RL19159 chr5:49656610-49661950:. FALSE dRNH S96 36.55 2.826610 19.16160 15.24025 3.282115 108.41481 104.6716 107393 261 43 15 34 S9.6,RNA-m6A,None,dRNH 4 77.01149 73058.96 43815 NA NA NA TRUE
All_RL50251 chr16:46386300-46391280:. FALSE dRNH S96 31.68 3.258560 25.21965 21.00418 3.428755 87.33233 83.3639 133750 243 43 16 33 RNA-m6A,S9.6,dRNH,None 4 76.13169 65721.60 38878 NA NA NA TRUE
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...

Column description:

RL Regions anno - rlregions_annot

rlregions_annot() is a tbl with the following structure:

rlregion annotation
All_RL4615 CNA__3__4
All_RL4615 CNA__1__8
All_RL4615 G4Qexp__G4Q_PDS_NaK_GSE63874__95848
... ...

Column description:

RL Regions Counts - rlregions_counts

rlregions_counts() is a SummarizedExperiment with the following structure:

experiment label strand_specific paired_end mode prediction discarded numPeaks bam bam_avail
SRX113814 POS FALSE FALSE DRIVE NEG TRUE 337 ../RLBase-data/rlbase-data/rlpipes-out/bam/SRX113814/SRX113814_hg38.bam TRUE
SRX113812 POS FALSE FALSE DRIP NEG TRUE 78 ../RLBase-data/rlbase-data/rlpipes-out/bam/SRX113812/SRX113812_hg38.bam TRUE
SRX2675003 POS TRUE FALSE R-ChIP POS FALSE 10331 ../RLBase-data/rlbase-data/rlpipes-out/bam/SRX2675003/SRX2675003_hg38.bam TRUE
... ... ... ... ... ... ... ... ... ...

Column description:

Examples

1
2
3
4
5
rlregions <- rlregions_meta()

rlregionsAnno <- rlregions_annot()

rlregionsCounts <- rlregions_counts()

Bishop-Laboratory/RLHub documentation built on Jan. 20, 2022, 3:47 p.m.