| single_train | R Documentation | 
Main function to train a SINGLE model in a set of reads of a reference / wild type sequence. To get the input data you will need to run before a minimap2 alignment and samtools counts.
single_train(
  bamfile,
  output = "results",
  refseq_fasta,
  rates.matrix = NULL,
  mean.n.mutations = NULL,
  pos_start = NULL,
  pos_end = NULL,
  verbose = TRUE,
  save_partial = FALSE,
  save_final = FALSE
)
bamfile | 
 File containing the counts per position returned by samtools mpileup  | 
output | 
 String. Prefix for output files  | 
refseq_fasta | 
 Fasta file containing reference sequence  | 
rates.matrix | 
 Mutation rate matrix: 4x5 matrix, each row/col representing a nucleotide (col adds deletion), and the values is the mutational rate from row to col.  | 
mean.n.mutations | 
 Mean number of mutations expected (one number).  | 
pos_start | 
 Numeric. Position to start analyzing, counting starts from 1 and it refers to reference used for minimap2 alignment.  | 
pos_end | 
 Numeric. Position to stop analyzing, counting starts from 1 and it refers to reference used for minimap2 alignment.  | 
verbose | 
 Logical.  | 
save_partial | 
 Logical. Should partial results be saved in files?  | 
save_final | 
 Logical. Should final fits be saved in a file?  | 
Before running single_train_function you have to align your INPUT data to a REFERENCE using minimap2 and count the nucleotides per position using samtools using these lines:
minimap2 -ax map-ont --sam-hit-only  REFERENCE.fasta INPUT.fastq >ALIGNMENT.sam
samtools view -S -b ALIGNMENT.sam > ALIGNMENT.bam
samtools sort ALIGNMENT.bam -o ALIGNMENT.sorted.bam 
samtools mpileup -Q 0 ALIGNMENT.sorted.bam > COUNTS.txt
Creates file output_prefix_single_results.txt with SINGLE training results.
refseq_fasta<- system.file("extdata", "ref_seq.fasta", package = "single")
train_reads_example <- system.file("extdata", "train_seqs_500.sorted.bam",
                                   package = "single")
train <- single_train(bamfile=train_reads_example,
                   refseq_fasta=refseq_fasta,
                   rates.matrix=mutation_rate,mean.n.mutations=5.4,
                   pos_start=1,pos_end=10)
print(head(train))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.