summarise_abi_file: Create a summary of a single ABI sequencing file

View source: R/summarise_abi_file.R

summarise_abi_fileR Documentation

Create a summary of a single ABI sequencing file

Description

Takes a single ABI sequencing file and returns a summary of the file. The summary includes basic quality control metric of the sequence.

Usage

summarise_abi_file(
  seq.abif,
  trim.cutoff = 1e-04,
  secondary.peak.ratio = 0.33,
  output.folder = NA,
  prefix = "seq",
  processors = NULL
)

Arguments

seq.abif

an abif.seq s4 object from the sangerseqR package

trim.cutoff

the cutoff at which you consider a base to be bad. This works on a logarithmic scale, such that if you want to consider a score of 10 as bad, you set cutoff to 0.1; for 20 set it at 0.01; for 30 set it at 0.001; for 40 set it at 0.0001; and so on. Contiguous runs of bases below this quality will be removed from the start and end of the sequence. Default is 0.0001.

secondary.peak.ratio

the ratio of the height of a secondary peak to a primary peak. Secondary peaks higher than this ratio are annotated. Those below the ratio are not.

output.folder

If output.folder is NA (the default) no files are written. If a valid folder is provided, two files are written to that folder: a .csv file of the secondary peaks (see description below) and a .pdf file of the chromatogram.

prefix

If output.folder is specified, this is the prefix which will be appended to the .csv and the .pdf file. The default is "seq".

processors

Number of processors to use, or NULL (the default) for all available processors

Value

A numeric vector including:

  1. raw.length: the length of the untrimmed sequence, note that this is the sequence after conversion to a sangerseq object, and then the recalling the bases with MakeBaseCalls from the sangerseqR package

  2. trimmed.length: the length of the trimmed sequence, after trimming using trim.mott from this package and the parameter supplied to this function

  3. trim.start: the start position of the good sequence, see trim.mott for more details

  4. trim.finish: the finish position of the good sequence, see trim.mott for more details

  5. raw.secondary.peaks: the number of secondary peaks in the raw sequence, called with the secondary.peaks function from this package and the parameters supplied to this function

  6. trimmed.secondary.peaks: the number of secondary peaks in the trimmed sequence, called with the secondary.peaks function from this package and the parameters supplied to this function

  7. raw.mean.quality: the mean quality score of the raw sequence

  8. trimmed.mean.quality: the mean quality score of the trimmed sequence

  9. raw.min.quality: the minimum quality score of the raw sequence

  10. trimmed.min.quality: the minimum quality score of the trimmed sequence

Examples

## Read abif using sangerseqR package
abi_seq <- sangerseqR::read.abif(
    system.file("/extdata/sorted_sangerseq/E18_C1/A1_3_IgG_Inner.ab1",
        package = "scifer"
    )
)

## Summarise using summarise_abi_file()
summarise_abi_file(abi_seq)


rodrigarc/RepertoiR documentation built on Dec. 19, 2024, 6:14 p.m.