ClustalW: ClustalW alignment

Description Usage Arguments Details Value See Also Examples


An approach for performing multiple alignments of large numbers of amino acid or nucleotide sequences is described. The method is based on first deriving a phylogenetic tree from a matrix of all pairwise sequence similarity scores, obtained using a fast pairwise alignment algorithm. See details on


ClustalW(, file.path="", type="DNA", aln.filetype="CLUSTALW", 
         args=NULL,,, print.curl=FALSE,   
         shared.username=NULL, suppress.Warnings=FALSE)


Name of file to be evaluated on the Discovery Environment (DE), see details for supported input formats.


Optional path to a user's subdirectory on the DE; default path is empty, which leads to the home directory


Two options "PROTEIN" or "DNA". This defines the type of sequences in the file


ClustalW does alignment of sequences, this option selects the file type of that result file. There are seven options CLUSTALW, FASTA, PHYLIP_INT, NEXUS, GCG, GDE, and PIR


Optional for arguments (i.e. flags). The ClustalW model has much additional functionality that is not fit into this wrapper function (, see details. This option allows users to add anything that is not included (.i.e. args="-ITERATION=TREE"), to iterate at each step, see details.

The name given to the output filename

The name to give the job being submitted


Prints the curl statement that can be used in the terminal, if curl is installed on your computer


With iPlant you have the ability to share folders with other users. If someone has shared a folder with you and you want to run a job with them, enter their username for this input.


This will turn off the warnings, will speed up run time. Use with caution, if the inputs are incorrect they will not be caught.


The supported input file format is the fasta format

Additional arguments, args, can be found at The args input is text with the flags and inputs for those flags in a string like on the command line.

There are seven options for output files: CLUSTALW, FASTA, PHYLIP_INT, NEXUS, GCG, GDE, and PIR

The result file is ALWAYS ‘clustalw2.fa’.


A list containing the job id and the job name is provided for jobs submitted. If an error, then a message stating the error should also be reported.

See Also

SubmitJob, Validate, UploadFile


## Not run: data(DNA.fasta)
## Not run: write.fasta(sequences = DNA.fasta, names = names(DNA.fasta), file.out = "DNA.fasta")
## Not run: Validate("username","password")
## Not run: UploadFile("DNA.fasta", filetype="FASTA-0")
## Not run: ClustalW("DNA.fasta","ClustalWPHY", aln.filetype="PHYLIP_INT")

Example output

Loading required package: rjson
Loading required package: RCurl
Loading required package: bitops
Loading required package: seqinr

rPlant documentation built on April 14, 2017, 6:03 p.m.

Related to ClustalW in rPlant...