Description Usage Arguments Details Value Warning Note Author(s) See Also Examples
Algorithm for grouping isotope pattern centroids of chemical components by querying quantized simulation data
1 2 3 |
peaklist |
Dataframe of HRMS peaks with three numeric columns for (a) m/z, (b) intensity and (c) retention time, such as |
quantiz |
Quantized simulation data of feasible centroid-centroid relations as provided by package |
mztol |
m/z tolerance setting: value by which the m/z of a peak may vary from its expected value. If parameter |
ppm |
Should |
inttol |
Intensity tolerance setting = fraction by which peak intensities may vary; e.g., if set to 0.2, a peak with expected intensity 10000 may range in between 8000 and 12000. |
rttol |
+/- retention time tolerance. Units as given in column 3 of |
use_isotopes |
Restrict query to certain isotopes dominating centroid relations; set to |
use_charges |
Vector of signed integers. Restrict query to certain charges z; set to |
use_marker |
Query for marker peaks, |
quick |
Continue if query finds first hit? Speeds up, but leaves resulting information on underlying isotopes incomplete. |
isotopes |
Dataframe of relevant isotopes as provided by package |
As alternative to rule-based pattern.search
, differences among measured centroids (peaklist
) are queried to match those of compressed (=quantized) simulation data within bounds
of measurement tolerances and the quantization distortion. Hence, in comparion to pattern.search
, this approach accounts for centroid mass shifts induced by peak profile
interferences prevalent at even high m/z resolution.
To derive the quantized data, isotope pattern centroids of several million organic molecular formulas from the PubChem database were calculated for various classes of adducts.
Molecular formulas were filtered to be unique and only to contain C, H, O, N, Cl, Br, K, Na, S, Si, F, P and/or I.
The resulting >250 million centroid pairs from individual patterns were then categorized for their dominant isotopologues, charge and
the possible presence of another centroid of higher intensity than that of the pair (=marker peak).
Within these categories, data on centroid pair (a) m/z, (b) m/z differences, (c) intensity ratios and (d) marker m/z was quantized by a
recursive partitioning procedure.
The resulting compressed data representation was extended by nearest neigbour estimates in the above dimensions (a) to (d) to
account for queries with molecular formulas possibly not present in the PubChem set.
Internally, the quantized simulation data is queried by a tree-like space-partitioning structure for hyperrectangles, while centroids from peaklist
are restructured into kd-trees.
List of type pattern with 12 entries
pattern[[1]] |
|
pattern[[2]] |
|
pattern[[3]] |
|
pattern[[4]] |
|
pattern[[5]] |
|
pattern[[6]] |
|
pattern[[7]] |
|
pattern[[8]] |
|
pattern[[9]] |
|
pattern[[10]] |
|
pattern[[11]] |
|
pattern[[12]] |
|
Acceptable outcomes strongly depend on appropriate parametrization of the algorithm and using the correct quantiz
data set from package nontargetData
.
Using overly large values for rttol
and/or mztol
may lead to slow execution.
Peak IDs refer to the order in which peaks are provided.
If you do not find quantized simulation data for your instrument in package nontargetData
and you can provide resolution=f(m/z) information: contact maintainer.
Martin Loos
rm.sat
peaklist
plotisotopes
plotdefect
combine
plotgroup
pattern.search
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 | ######################################################
# load HRMS centroid list: ###########################
data(peaklist)
# load isotope data ##################################
data(isotopes)
# load quantized simulation data #####################
data(OrbitrapXL_VelosPro_R60000at400_q)
######################################################
# run isotope pattern grouping #######################
# save the list returned as "pattern" ################
pattern<-pattern.search2(
peaklist,
OrbitrapXL_VelosPro_R60000at400_q,
mztol=2,
ppm=TRUE,
inttol=0.5,
rttol=0.3,
use_isotopes=FALSE,
use_charges=FALSE,
use_marker=TRUE,
quick=FALSE,
isotopes
)
names(pattern);
######################################################
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.