summarize_table: Creates a Summary Table of Mass-Spectrometry (MS) Data
In invitroTKstats: In Vitro Toxicokinetic Data Processing and Analysis Pipeline

summarize_table

R Documentation

Creates a Summary Table of Mass-Spectrometry (MS) Data

Description

This function creates and returns a list containing summary counts from the provided data frame containing mass-spectrometry (MS) data, MS calibration, chemical identifiers, and measurement type. The list includes the number of observations, unique chemicals, unique measurements in the input data table, and a vector of chemicals that have repeated observations. If a vector of data types is specified in the argument req.types, the function also checks if each chemical has observations for every measurement type included in the vector for each chemical-calibration pair. If it does, the chemical is said to have a complete data set. Otherwise, it has an incomplete data set. The number of complete and incomplete datasets, for each chemical, are returned in the output list. The input data frame can be level-1 (or level-2) Caco-2 data, ultracentrifugation (UC) data, rapid equilibrium dialysis (RED) data, or hepatocyte clearance (Clint) data. See the Details section for measurement type and annotation tables used in each assay.

Usage

summarize_table(
  input.table,
  dtxsid.col = "DTXSID",
  compound.col = "Compound.Name",
  cal.col = "Calibration",
  type.col = "Sample.Type",
  req.types = NULL,
  verbose = TRUE
)

Arguments

`input.table`	(Data Frame) A data frame (level-1 or level-2) containing mass-spectrometry peak areas, indication of chemical identity, and measurement type. The data frame should contain columns with names specified by the following arguments:
`dtxsid.col`	(Character) Column name of `input.table` containing EPA's DSSTox Structure ID (http://comptox.epa.gov/dashboard). (Defaults to "DTXSID".)
`compound.col`	(Character) Column name of `input.table` containing the test compound. (Defaults to "Compound.Name".)
`cal.col`	(Character) Column name of `input.table` containing the MS calibration. Calibration typically uses indices or dates to represent if the analyses were done on different machines on the same day or on different days with the same MS analyzer. (Defaults to "Calibration".)
`type.col`	(Character) Column name of `input.table` containing the sample type (see tables in Details). (Defaults to "Sample.Type".)
`req.types`	(Character Vector) A vector of character strings containing measurement types. If a vector is specified, each chemical-calibration pair will be checked if it has observations for all of the measurement types in the vector. (Defaults to `NULL`.)
`verbose`	(logical) Indicate whether printed statements should be shown. (Default is TRUE.)

Details

Sample types used in ultracentrifugation (UC) data collected for calculation of chemical fraction unbound in plasma (Fup) should be annotated as follows:

Calibration Curve	CC
Ultracentrifugation Aqueous Fraction	AF
Whole Plasma T1h Sample	T1
Whole Plasma T5h Sample	T5

Samples types used in rapid equilibrium dialysis (RED) data collected for calculation of chemical fraction unbound in plasma (Fup) should be annotated as follows:

No Plasma Blank (no chemical, no plasma)	NoPlasma.Blank
Plamsa Blank (no chemical, just plasma)	Plasma.Blank
Plasma well concentration	Plasma
Phosphate-buffered well concentration	PBS
Time zero plasma concentration	T0
Plasma stability sample	Stability
Acceptor compartment of the equilibrium evaluation	EC_acceptor
Donor compartment of the equilibrium evaluation (chemical spiked side)	EC_donor
Calibration Curve	CC

Sample types in hepatocyte clearance (Clint) data should be annotated as follows:

Blank	Blank
Hepatocyte incubation concentration	Cvst
Inactivated Hepatocytes	Inactive
Calibration Curve	CC

Samples types used in Caco-2 data to calculate membrane permeability should be annotated as follows:

Blank with no chemical added	Blank
Target concentration added to donor compartment at time 0 (C0)	D0
Donor compartment at end of experiment	D2
Receiver compartment at end of experiment	R2

Value

A list containing the summary counts from the input data table. The list includes the number of observations, the number of unique chemicals, the number of unique measurements, the number of chemicals with complete data sets, the number of chemicals with incomplete data sets, and the number of chemicals with repeated observations.

Author(s)

John Wambaugh

Examples


library(invitroTKstats)
# Smeltz et al. (2020) data:
##  Clint ##
summarize_table(
  input.table = invitroTKstats::clint_L2,
  req.types = c("Blank", "Cvst")
  )
## Fup RED ##
summarize_table(
  input.table = invitroTKstats::fup_red_L2,
  req.types=  c("Plasma", "PBS", "Plasma.Blank", "NoPlasma.Blank")
  )
## Fup UC ##
summarize_table(
  input.table = invitroTKstats::fup_uc_L2,
  req.types = c("CC", "T1", "T5", "AF")
  )
# Honda et al. () data:
## Caco2 ##
summarize_table(
  input.table = invitroTKstats::caco2_L2,
  req.types=c("Blank","D0","D2","R2")
  )

invitroTKstats documentation built on Aug. 23, 2025, 9:08 a.m.