itsx | R Documentation |
Calls the external program ITSx, which must be installed and on the path. For more information on installation, algorithms, or options, see http://microbiology.se/software/itsx/.
itsx(
in_file,
out_root = tempfile("itsx"),
taxon = "fungi",
e_value = 1e-05,
s_value = 0,
n_value = 2,
selection_priority = c("score", "sum", "domains", "eval"),
search_eval = 0.01,
search_score = NULL,
allow_single_domain = c(1e-09, 0),
allow_reorder = FALSE,
complement = TRUE,
cpu = 1,
multi_thread = cpu > 1,
heuristics = FALSE,
nhmmer = FALSE,
summary = TRUE,
graphical = TRUE,
fasta = TRUE,
preserve = FALSE,
save_regions = c("ITS1", "ITS2"),
anchor = 0,
require_anchor = 0,
only_full = FALSE,
partial = 0,
concat = FALSE,
minlen = 0,
positions = TRUE,
table = FALSE,
detailed_results = FALSE,
not_found = TRUE,
truncate = TRUE,
silent = FALSE,
graph_scale = 0,
save_raw = FALSE,
read_function = NULL
)
in_file |
Name of a fasta file to read sequences from. Alternatively, a member of classes ShortRead, DNAStringSet or SeqFastadna, in which case the sequences will be written to a temporary .fasta file. |
out_root |
Root name for all output files. The default will put these in a pseudorandomly generated file name in the R temp directory, which is probably not what you want if you intend to access them later. |
taxon |
The taxonomic group(s) to attempt to find rDNA from. See the manual for ITSx for available options. In contrast to the default for ITSx ("all") the default for this function is "fungi". |
e_value |
E-value cutoff for inclusion in results. |
s_value |
Score cutoff for inclusion in results. |
n_value |
Number of domains required for inclusion in results. |
selection_priority |
Priority for determining sequence origin. |
search_eval |
Actual E-value cutoff for HMMER search. Only one of search_eval and search_score may be supplied. |
search_score |
Actual score cutoff for HMMER search. Only one of search_eval and search_score may be supplied. |
allow_single_domain |
Allow inclusion of sequences where only one domain match is found. Either FALSE or a (double, integer) pair giving inclusion criteria for e_value and Score. |
allow_reorder |
Allow inclusion of sequences where the domains do not occur in the expected order. |
complement |
Search the reverse complement of each sequence also. |
cpu |
Number of threads to use. |
multi_thread |
Whether to use multiple threads or not. |
heuristics |
Use heuristic filtering to speed up HMMER search. |
nhmmer |
Use nhmmer instead of hmmsearch. |
summary |
Output summary results in |
graphical |
Output graphical results in |
fasta |
Output full ITS (ITS1 + 5.8S + ITS2) results in
|
preserve |
Use original sequence headers instead of writing new ones. |
save_regions |
Additional reagions to save output for; Options are one
or more of |
anchor |
Number of extra bases included at the beginning and end of each
region. Only one of |
require_anchor |
As |
only_full |
Limit output to full length ITS1 and ITS2 regions. |
partial |
Save additional files for partial regions. The argument give
the minimum number of bases required. These will be saved as
|
concat |
Output concatenated ITS1 and ITS2 regions in
|
minlen |
Minimum length for ITS regions to be included in |
positions |
Output a table of positions where HMMER matches were
detected in |
table |
Output a table of ITS results in |
detailed_results |
Output detailed results in
|
not_found |
Output a list of sequences for which no match was found in
|
truncate |
Remove ends of ITS sequences if they extend beyond the ITS region. |
silent |
Supress printing of information to screen. |
graph_scale |
Sets the scale of the graphical output. |
save_raw |
Save raw data from searches in directory
|
read_function |
A function which can be used to read a fasta format
file, such as |
For basic usage, this function only checks the arguments and gernerates the
function call. However, to simplify integration with R-based workflows, it
is also possible to specify the in_file
as one of several R classes
that hold DNA sequences , and by specifying a read_function
, the
output file(s) can be automatically read into R and returned as members of a
list
.
The return value of the ITSx program, or if read_function
is
given, a list containing the contents of all files which were created
(except the raw output).
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.