retrieve_seqs: Retrieve Sequences

Description Usage Arguments

View source: R/sequences.R

Description

Retrieve Sequences

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
retrieve_seqs(
  experiment,
  data_type = c("tss", "tsr", "shift"),
  samples = "all",
  genome_assembly = NULL,
  threshold = NULL,
  dominant = FALSE,
  fixed_size = FALSE,
  extend_upstream = NULL,
  extend_downstream = NULL,
  return_format = "biostrings",
  output_dir = NULL
)

Arguments

experiment

TSRexploreR object.

data_type

Get sequences around TSSs ('tss'), TSRs ('tsr'), or TSS shifting results ('shift').

samples

A vector of sample names to analyze.

genome_assembly

Genome assembly in FASTA or BSgenome format.

threshold

TSSs or TSRs with a score below this value will not be considered.

dominant

If TRUE, will only consider the highest-scoring TSS per gene, transcript, or TSR or highest-scoring TSR per gene or transcript.

fixed_size

Whether all returned sequences should have the same fixed size starting from the center of the ranges.

extend_upstream

Distance to extend ranges upstream.

extend_downstream

Distance to extend ranges downstream.

return_format

Either 'table' or 'biostrings'.

output_dir

If set, the results will be saved to this directory. If return format is set to 'biostrings' the sequences will be saved as FASTA files. If return format is a table, it will be saved as a tab delimited table.


rpolicastro/tsrexplorer documentation built on Oct. 17, 2021, 3:02 p.m.