SangerAlignment | R Documentation |
the wrapper function for SangerAlignment
SangerAlignment(
printLevel = "SangerAlignment",
inputSource = "ABIF",
processMethod = "REGEX",
ABIF_Directory = NULL,
FASTA_File = NULL,
REGEX_SuffixForward = NULL,
REGEX_SuffixReverse = NULL,
CSV_NamesConversion = NULL,
geneticCode = GENETIC_CODE,
TrimmingMethod = "M1",
M1TrimmingCutoff = 1e-04,
M2CutoffQualityScore = NULL,
M2SlidingWindowSize = NULL,
baseNumPerRow = 100,
heightPerRow = 200,
signalRatioCutoff = 0.33,
showTrimmed = TRUE,
refAminoAcidSeq = "",
minReadsNum = 2,
minReadLength = 20,
minFractionCall = 0.5,
maxFractionLost = 0.5,
acceptStopCodons = TRUE,
readingFrame = 1,
processorsNum = 1
)
inputSource |
The input source of the raw file. It must be |
ABIF_Directory |
The parent directory of all of the reads contained in ABIF format you wish to analyse. In SangerAlignment, all reads in subdirectories will be scanned recursively. |
FASTA_File |
If |
REGEX_SuffixForward |
The suffix of the filenames for forward reads in regular expression, i.e. reads that do not need to be reverse-complemented. For forward reads, it should be |
REGEX_SuffixReverse |
The suffix of the filenames for reverse reads in regular expression, i.e. reads that need to be reverse-complemented. For revcerse reads, it should be |
CSV_NamesConversion |
The file path to the CSV file that provides read names that follow the naming regulation. If |
geneticCode |
Named character vector in the same format as |
TrimmingMethod |
TrimmingMethod The read trimming method for this SangerRead. The value must be |
M1TrimmingCutoff |
The trimming cutoff for the Method 1. If |
M2CutoffQualityScore |
The trimming cutoff quality score for the Method 2. If |
M2SlidingWindowSize |
The trimming sliding window size for the Method 2. If |
baseNumPerRow |
It defines maximum base pairs in each row. The default value is |
heightPerRow |
It defines the height of each row in chromatogram. The default value is |
signalRatioCutoff |
The ratio of the height of a secondary peak to a primary peak. Secondary peaks higher than this ratio are annotated. Those below the ratio are excluded. The default value is |
showTrimmed |
The logical value storing whether to show trimmed base pairs in chromatogram. The default value is |
refAminoAcidSeq |
An amino acid reference sequence supplied as a string or an AAString object. If your sequences are protein-coding DNA seuqences, and you want to have frameshifts automatically detected and corrected, supply a reference amino acid sequence via this argument. If this argument is supplied, the sequences are then kept in frame for the alignment step. Fwd sequences are assumed to come from the sense (i.e. coding, or "+") strand. The default value is |
minReadsNum |
The minimum number of reads required to make a consensus sequence, must be 2 or more. The default value is |
minReadLength |
Reads shorter than this will not be included in the readset. The default |
minFractionCall |
Minimum fraction of the sequences required to call a consensus sequence for SangerContig at any given position (see the ConsensusSequence() function from DECIPHER for more information). Defaults to 0.75 implying that 3/4 of all reads must be present in order to call a consensus. |
maxFractionLost |
Numeric giving the maximum fraction of sequence information that can be lost in the consensus sequence for SangerContig (see the ConsensusSequence() function from DECIPHER for more information). Defaults to 0.5, implying that each consensus base can ignore at most 50 percent of the information at a given position. |
acceptStopCodons |
The logical value |
readingFrame |
|
processorsNum |
The number of processors to use, or NULL (the default) for all available processors. |
minFractionCallSA |
Minimum fraction of the sequences required to call a consensus sequence for SangerAlignment at any given position (see the ConsensusSequence() function from DECIPHER for more information). Defaults to 0.75 implying that 3/4 of all reads must be present in order to call a consensus. |
maxFractionLostSA |
Numeric giving the maximum fraction of sequence information that can be lost in the consensus sequence for SangerAlignment (see the ConsensusSequence() function from DECIPHER for more information). Defaults to 0.5, implying that each consensus base can ignore at most 50 percent of the information at a given position. |
A SangerAlignment instance.
Kuan-Hao Chao
rawDataDir <- system.file("extdata", package = "sangeranalyseR")
parentDir <- file.path(rawDataDir, "Allolobophora_chlorotica", "RBNII")
REGEX_SuffixForward <- "_[0-9]*_F.ab1$"
REGEX_SuffixReverse <- "_[0-9]*_R.ab1$"
sangerAlignment <- SangerAlignment(
inputSource = "ABIF",
ABIF_Directory = parentDir,
REGEX_SuffixForward = REGEX_SuffixForward,
REGEX_SuffixReverse = REGEX_SuffixReverse,
refAminoAcidSeq = "SRQWLFSTNHKDIGTLYFIFGAWAGMVGTSLSILIRAELGHPGALIGDDQIYNVIVTAHAFIMIFFMVMPIMIGGFGNWLVPLMLGAPDMAFPRMNNMSFWLLPPALSLLLVSSMVENGAGTGWTVYPPLSAGIAHGGASVDLAIFSLHLAGISSILGAVNFITTVINMRSTGISLDRMPLFVWSVVITALLLLLSLPVLAGAITMLLTDRNLNTSFFDPAGGGDPILYQHLFWFFGHPEVYILILPGFGMISHIISQESGKKETFGSLGMIYAMLAIGLLGFIVWAHHMFTVGMDVDTRAYFTSATMIIAVPTGIKIFSWLATLHGTQLSYSPAILWALGFVFLFTVGGLTGVVLANSSVDIILHDTYYVVAHFHYVLSMGAVFAIMAGFIHWYPLFTGLTLNNKWLKSHFIIMFIGVNLTFFPQHFLGLAGMPRRYSDYPDAYTTWNIVSTIGSTISLLGILFFFFIIWESLVSQRQVIYPIQLNSSIEWYQNTPPAEHSYSELPLLTN",
TrimmingMethod = "M1",
M1TrimmingCutoff = 0.0001,
M2CutoffQualityScore = NULL,
M2SlidingWindowSize = NULL,
baseNumPerRow = 100,
heightPerRow = 200,
signalRatioCutoff = 0.33,
showTrimmed = TRUE,
processorsNum = 2)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.