preTesting: NMDer workflow: Check and match input annotations

Description Usage Arguments Value

View source: R/prep.preTesting.R

Description

This program checks the chromosome ID and gene ID of query, reference and genome objects and if requested, attempts to match these entries

Usage

1
2
3
4
5
6
7
8
9
preTesting(
  inputGRanges,
  basicGRanges,
  genome,
  correct_chrom,
  correct_gene_id,
  primary_gene_id,
  secondary_gene_id
)

Arguments

inputGRanges

Query GRanges object

basicGRanges

Reference GRanges object

genome

Genome sequence as a Biostring object

correct_chrom

Supplementary feature. If TRUE, program will attempt to match chromosome names of query and reference to fasta genome to ensure consistent naming across input files.

correct_gene_id

Supplementary feature to attempt to match gene IDs in query file to reference file. This is key in grouping query transcripts to reference gene families for comparison.

Matching is done at three levels with increasing accuracy:

1. Crudely intersecting query coordinates with reference. Invoked by setting match_geneIDs to TRUE sprintf('Remaining number of non-standard gene_ids: 2. Trim ensembl-style gene IDs and attempt matching. Invoked by providing name of gene ID header (typically 'gene_id') from gtf file to primary_gene_id argument

3. Replace query gene ID with a secondary gene ID and attempt matching. Invoked by providing name of secondary gene ID header (for example 'ref_gene_id') from gtf file to secondary_gene_id argument

primary_gene_id

See 'correct_gene_id' for details

secondary_gene_id

See 'correct_gene_id' for details

Value

Query GRanges object that have been matched


fursham-h/ponder documentation built on Dec. 27, 2019, 12:15 a.m.