rlbase_samples: RLBase Sample Manifest

Description Usage Arguments Details Value Examples

View source: R/rlbase_samples.R

Description

A tbl containing metadata about each sample in RLBase.

Usage

1

Arguments

quiet

If TRUE, messages are suppressed. Default: FALSE.

Details

Source

RLBase samples were curated by hand in Excel from searching keywords such as "R-loops" and "RNA:DNA hybrids" in GEO, SRA, and PubMed. Where R-loop mapping data was publically available, entries were added to the Excel spreadsheet such that every sample (SRX.../ERX.../GSM...) had it's own line. Information was noted for each sample, such as the "mode" (the type of R-loop mapping it was) and the "Condition" (e.g., "RNaseH1", "WKKD", etc). When genomic input controls were available, they were manually matched to the experimental samples for which they could serve as a background control during peak calling.

The up-to-date excel sheet is found here.

Throughout the process of analyzing the data (see RLBase-data), additional metadata was added to the sample sheet (see structure for full account).

Structure

rlbase_samples is a tbl with the structure:

rlsample label condition mode lab tissue genotype other PMID group family ip_type strand_specific moeity bisulfite_seq file_type experiment_original control_original study name paired_end read_length control eff_genome_size genome prediction discarded numPeaks expsamples exp_matchCond coverage_s3 peaks_s3 fastq_stats_s3 bam_stats_s3 report_html_s3 rlranges_rds_s3 rlfs_rda_s3
SRX1070676 POS S96 DRIPc Fred Chedin NT2 WT NT 27373332 rl DRIP S9.6 TRUE RNA FALSE public GSM1720613 NA SRP059800 GSM1720613: NT2 DRIPc-seq, rep 1; Homo sapiens; OTHER FALSE 50 NA 2706186140 hg38 POS FALSE 34092 SRX1070685,SRX1070686 WT_NT_SRP059800_NT2 coverage/SRX1070676_hg38.bw peaks/SRX1070676_hg38.broadPeak fastq_stats/SRX1070676_hg38__fastq_stats.json bam_stats/SRX1070676_hg38__bam_stats.txt reports/SRX1070676_hg38.html rlranges/SRX1070676_hg38.rds rlfs_rda/SRX1070676_hg38.rlfs.rda
SRX1070677 POS S96 DRIPc Fred Chedin NT2 WT NT 27373332 rl DRIP S9.6 TRUE RNA FALSE public GSM1720614 NA SRP059800 GSM1720614: NT2 DRIPc-seq, rep 2; Homo sapiens; OTHER FALSE 50 NA 2706186140 hg38 POS FALSE 22117 SRX1070685,SRX1070686 WT_NT_SRP059800_NT2 coverage/SRX1070677_hg38.bw peaks/SRX1070677_hg38.broadPeak fastq_stats/SRX1070677_hg38__fastq_stats.json bam_stats/SRX1070677_hg38__bam_stats.txt reports/SRX1070677_hg38.html rlranges/SRX1070677_hg38.rds rlfs_rda/SRX1070677_hg38.rlfs.rda
SRX1070678 POS S96 DRIP Fred Chedin NT2 WT NT 27373332 rl DRIP S9.6 FALSE DNA FALSE public GSM1720615 NA SRP059800 GSM1720615: NT2 DRIP-seq, 1; Homo sapiens; OTHER FALSE 50 NA 2706186140 hg38 POS FALSE 73924 SRX1070685,SRX1070686 WT_NT_SRP059800_NT2 coverage/SRX1070678_hg38.bw peaks/SRX1070678_hg38.broadPeak fastq_stats/SRX1070678_hg38__fastq_stats.json bam_stats/SRX1070678_hg38__bam_stats.txt reports/SRX1070678_hg38.html rlranges/SRX1070678_hg38.rds rlfs_rda/SRX1070678_hg38.rlfs.rda
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...

Column description:

Value

A tbl.

Examples

1
rlsamples <- rlbase_samples()

Bishop-Laboratory/RLHub documentation built on Jan. 20, 2022, 3:47 p.m.