clustalRNA: alignment of RNA sequences for the spike protein of some...
In geantonicelli/firstPackage: set of wrapper functions for phylogenetic analysis

Description Usage Format Details Source

14 different sequences of some sarbecoviruses closely related to SARSCoV2 were retrieved from public databases and compared using CLUSTAL omega (1.2.4) alignment software with the output in clustal format (see examples section for the complete protocol for retrieving, saving and processing the data)

1	clustalRNA

a list of class alignment with 4 elements

nb: number of sequences compared
nam: sequence name (accession number)
seq: a list of RNA sequences
com: NA

spikeRNA_annot <- read.csv(system.file('extdata', 'spike_annot.csv', package='firstPackage'))
accessions <- spikeRNA_annot$accession
download manually each sequence from 'genbank' and save as spike.fasta and perform an alignment with clustal omega with following commands (it needs to be installed locally or use a web interface)
clustalo –threads=4 -v –outfmt=clustal -t RNA -i ./data/sequences/spike.fasta -o ./data/sequences/spike_align.aln
clustalRNA_load <- load_alignment(system.file('extdata', 'spike_align.aln', package='firstPackage'), 'clustal')

the genbank database of the National Center for Biotechnology Information or the Universal Protein Resource UniProt database

geantonicelli/firstPackage documentation built on Aug. 24, 2020, 3:14 a.m.