generateTPsCTS | R Documentation |
Uses Chemical Transformation Simulator (CTS) to predict TPs.
generateTPsCTS(
parents,
transLibrary,
generations = 1,
errorRetries = 3,
skipInvalid = TRUE,
prefCalcChemProps = TRUE,
neutralChemProps = FALSE,
neutralizeTPs = TRUE,
calcLogP = "rcdk",
calcSims = FALSE,
fpType = "extended",
fpSimMethod = "tanimoto",
parallel = TRUE
)
parents |
The parents for which transformation products should be obtained. This can be (1) a suspect list (see
suspect screening for more information), (2) the resulting output of
|
transLibrary |
A |
generations |
An |
errorRetries |
The maximum number of connection retries. Sets the |
skipInvalid |
If set to |
prefCalcChemProps |
If |
neutralChemProps |
If |
neutralizeTPs |
If |
calcLogP |
A |
calcSims |
If set to |
fpType |
The type of structural fingerprint that should be calculated. See the |
fpSimMethod |
The method for calculating similarities (i.e. not dissimilarity!). See the |
parallel |
If set to |
This function uses CTS to obtain transformation products. This function is called when calling generateTPs
with
algorithm="cts"
.
This function uses the httr package to access the Web API of CTS for automatic TP prediction. Hence, an Internet connection is mandatory. Please take care to not 'abuse' the CTS servers, e.g. by running very large batch calculations in parallel, as this may result in rejected connections.
An important advantage of this algorithm is that it provides structural information for generated TPs. However, this also means that if the input is from a parent suspect list or screening then either SMILES or InChI information must be available for the parents.
The TPs are stored in an object derived from the transformationProductsStructure
class.
Chemical properties such as SMILES, InChIKey and formula in the parent suspect list are automatically validated and calculated if missing/invalid.
The internal validation/calculation process performs the following steps:
Validation of SMILES, InChI, InChIKey and formula data (if present). Invalid
entries will be set to NA
.
If neutralChemProps=TRUE
then chemical data (SMILES, formulae etc.) is neutralized by
(de-)protonation (using the --neutralized
option of OpenBabel
). An additional column
molNeutralized
is added to mark those molecules that were neutralized. Note that neutralization requires
either SMILES or InChI data to be available.
The SMILES and InChI data are used to calculate missing or invalid SMILES,
InChI, InChIKey and formula data. If prefCalcChemProps=TRUE
then existing
InChIKey and formula data is overwritten by calculated values whenever possible.
The chemical formulae which were not calculated are verified and normalized. This process may be time
consuming, and is potentially largely avoided by setting prefCalcChemProps=TRUE
.
Neutral masses are calculated for missing values (prefCalcChemProps=FALSE
) or whenever possible
(prefCalcChemProps=TRUE
).
Note that calculation of formulae for molecules that are isotopically labelled is currently only supported for deuterium (2H) elements.
This functionality relies heavily on OpenBabel, please make sure it is installed.
When the parents
argument is a compounds
object, the candidate library identifier
is used in case the candidate has no defined compoundName
.
rcdk1
\insertRefOBoyle2011patRoon
\insertRefWolfe2016patRoon
\insertRefTebesStevens2017patRoon
\insertRefYuan2020patRoon
\insertRefYuan2021patRoon
\insertRefOBoyle2011patRoon
generateTPs
for more details and other algorithms.
The website: https://qed.epa.gov/cts/ and the CTS User guide.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.