align_rna: Alignment of RNA-seq reads

Description Usage Arguments Value Examples

View source: R/align_rna.R

Description

Alignment of RNA-seq reads

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
align_rna(threads = 1, plot = TRUE, out.dir = ".", bigwig = FALSE,
  hisat2 = "hisat2", samtools = "samtools", sambamba = "sambamba",
  idx = NULL, reads.dir = NULL, reads1 = NULL, reads2 = NULL,
  fastq = TRUE, fasta = FALSE, softClipPenalty = NULL,
  noSoftClip = FALSE, knownSplice = NULL, strand = NULL, tmo = FALSE,
  secondary = FALSE, maxAlign = NULL, nomixed = FALSE,
  nodiscordant = FALSE, rgid = NULL, quiet = FALSE,
  non_deterministic = TRUE, memory = "1G", remove.mitochondrial = "ChrM",
  hash_table = 262144, overflow_size = 2e+05, io_buffer = 128,
  scale = 2e+07)

Arguments

threads

A positive integer specifying the number of sorting and compression threads

plot

Logical. If TRUE, a PNG summarising the alignment statistics is produced

out.dir

Character string. Directory to save results in. If the directory does not exist, it will be created

bigwig

Logical indicating whether bigwig files should be created. This is memory and time-consuming.

hisat2

The path to hisat2 (if not in executable path).

samtools

The path to samtools (if not in executable path).

sambamba

The path to sambamba (if not in executable path).

idx

The basename of the index for the reference genome. The basename is the name of any of the index files up to but not including the final .1.ht2, etc.

reads.dir

Character string. Directory containing the raw reads. If this is specified, then reads1 and reads2 should be set as NULL.

reads1

Character vector of mate1 reads. If specified, then reads.dir must be NULL.

reads2

Character vector of mate2 reads. If specified, then reads.dir must be NULL. Must be the same length as mate1. If single-end sequencing, then should be left as NULL.

fastq

Logical indicating if reads are FASTQ files.

fasta

Logical indicating if reads are FASTA files.

softClipPenalty

Sets the maximum (MX) and minimum (MN) penalties for soft-clipping per base, both integers. Must be given in the format "MX,MN".

noSoftClip

Logical indicating whether to disallow soft-clipping.

knownSplice

Path to text file containing known splice sites.

strand

Specify strand-specific information. Default is unstranded.

tmo

Logical indicating whether to report only those reads aligning to known transcripts.

secondary

Logical indicating whether to report secondary alignments.

maxAlign

Integer indicating the maximum number of distinct primary alignments to search for each read.

nomixed

By default, when hisat2 cannot find a concordant or discordant alignment for a pair, it then tries to find alignments for the individual mates. If TRUE, this option disables that behavior.

nodiscordant

By default, hisat2 looks for discordant alignments if it cannot find any concordant alignments. If true, this option disables that behavior.

rgid

Character string, to which the read group ID is set.

quiet

If TRUE, print nothing except alignments and serious errors.

non_deterministic

When set to TRUE, HISAT2 re-initializes its pseudo-random generator for each read using the current time.#' @param rgid

memory

String specifying maximum memory per thread; suffix K/M/G recognized.

remove.mitochondrial

Character string. If set, this will remove reads mapping to the mitochondrial genome. The string should match the reference name for the mitochindrial genome in the alignment file. Examples include "ChrM", "M" and "MT".

hash_table

Size of hash table for finding read pairs (default is 262144 reads); will be rounded down to the nearest power of two. For best performance should be > (average coverage) * (insert size).

overflow_size

Size of the overflow list where reads, thrown out of the hash table, get a second chance to meet their pairs (default is 200000 reads); increasing the size reduces the number of temporary files created.

io_buffer

Controls sizes of the two buffers (in MB) used for reading and writing BAM during the second pass (default is 128).

scale

The number of reads to scale to. If NULL no scaling performed.

Value

Raw and filtered BAM files +/- bigwig files

Examples

1
2
3
4
5
6
7
## Not run: 
align_rna(threads = 2, plot = TRUE, out.dir = ".", bigwig = FALSE,
          hisat2 = "hisat2", samtools = "samtools", sambamba = "sambamba",
          idx = "../prana/data-raw/index/UCSC.hg19",
          reads.dir = "../prana/data-raw/seqFiles", fastq = TRUE)

## End(Not run)

anilchalisey/parseR documentation built on May 7, 2019, 7:45 a.m.