inst/info.md

title: "IN-vivo reSPonsE Classification of Tumours (INSPECTumours)" output: html_document: number_section: true

IN-vivo reSPonsE Classification of Tumours (INSPECTumours)

This is a statistical tool used for classifying and analysing pre-clinical tumour responses.

1 Overview

2 User guide

2.1 Load data

2.1.1 Data requirements

New studies

Critical columns:

There should be a control group for each study_id.

Historical control data (highly recommended – this will increase the robustness of the analysis)

Critical columns:

2.1.2 Data quality Control (QC)

Automatic QC

Manually outlier removal

Data visualisation and QC are available in "QC New study" and "QC Control data" subsections.

Specific data points can be excluded from the analysis per study, when providing a reason:

2.2 Analysis

2.2.1 Tumour classification

By using a novel statistical method, which is explained in section "3. Statistical methods", each individual tumours are classified into one of the following categories: Non-responder, Modest responder, Stable responder and Regressing responder. Briefly, a statistical model is constructed for the control data, including the historical control data, to predict the normal growth profile. If a tumour falls within the prediction interval of the normal growth profile, the tumour is classified as "Non-responder", otherwise, it is classified as a responder. All responders are further classified into sub-categories based on the growth rate calculations.

Analysis setting

In order to run the analysis, please tick "QC check" box. When you click "Start", the analysis will start!

Results

2.2.2 Statistical analysis for drug efficacy(optional)

Additional statistical analysis to compare drug efficacy (treated vs. control) is available. The Bayesian ordered logistic regression is used for the analysis.

3 Statistical methods

3.1 Statistical model for the control data

To analyse the tumour volume data, summarising the time series as the growth rate [1, 2] is commonly used. The subsequent assessment of drug efficacy is then based on statistical tests comparing the mean growth rates between different groups. However, this group-level analysis is insufficient when heterogenous tumour growth is observed within a treatment group.

We have developed a novel method by building the statistical model for the “normal” tumour growth profile without treatment.

3.1.1 Two-stage non-linear model

A multilevel (or called “mixed-effect”) non-linear model is constructed for all control data, including both the control data from new studies and historical studies (if available).

$$log_{10}(y_{ijk}) \sim f(t_{ijk}) + \alpha_i + \beta_{ij} + e_{ijk},$$

with $y_{ijk}$ indicate the tumour volume that is measured on the $k$th day for the $j$th animal in the $i$th study and $t_{ijk}$ indicates the day of measurement. The random effect term $\alpha_i \sim N(0, \sigma_{study}^2)$ is used to evaluate the day-to-day variability; $\beta_{ij} \sim N(0, \sigma_{animal}^2)$ is used to evaluate the animal-to-animal variability and the error term is $e_{ijk} \sim N(0, \sigma^2).$

We use the continuous hinged function that was posted by Gelman A. (https://statmodeling.stat.columbia.edu/2017/05/19/continuous-hinge-function-bayesian-modeling/) to allow different growth rates between the early stage of tumour growth ("unstable" phase) and the later stage ("stable" stage). The function is shown as follows:

$$f(t_{ijk}) = a + b_{0} \times (t_{ijk} - t_{change}) + (b_{1} - b_{0}) \times \delta \times log(1 + \frac{exp(t_{ijk} - t_{change})}{\delta}),$$

where where $a$, $b_0$ and $b_1$ are common intercept, the slope for the unstable phase and the slope for the stable phase respectively. A parameter $\delta$ is used to control the smoothness of the curve as shown in Figure 3 and the parameter $t_{change}$ is the inflection time points to separate the unstable and stable phases. Mathematically, the slope changes at $t_{change}$. The design of studies may vary in practice. For example, the windows between the tumour implantation and first dose of compounds can be between 1 day to 7 days in different experiments. We observed that $t_{change}$ for different studies can be different accordingly sometimes. Therefore, a flexible Baysian model is used for the study-varying $t_{change}$ estimation.

3.1.2 Linear model

Alternatively, the simplified linear model can be used for the data. This will consider a unified slope through the study. The model equation is shown as follows.

$$log_{10}(y_{ijk}) \sim a + b \times t_{ijk} + \alpha_i + \beta_{ik} + e_{ijk},$$

with $y_{ijk}$ indicate the tumour volume that is measured on the $k$th day for the $j$th animal in the $i$th study and $t_{ijk}$ indicates the day of measurement. The random effect term $\alpha_i \sim N(0, \sigma^2_{study})$ is used to evaluate the study-to-study variability; $\beta_{ij} \sim N(0, \sigma^2_{animal})$ is used to evaluate the animal-to-animal variability and the error term is $e_{ijk} \sim N(0, \sigma^2)$.

3.2 Tumour classification

The 95% predication interval is derived from the model for the control data. The tumours that are higher than the lower bound of 95% prediction interval are considered as not significantly different from the “normal” tumour growth. For an individual tumour, if there are m consecutive measurements that are smaller than the lower bound of 95% prediction interval, the tumour is considered as “Responder”. Otherwise, the tumour is classified as “Non-responder”. In order to ensure the classification is reliable for the tumour over a period that is long enough, the selection of m depends on the schedule of measurements. If there are 3 or less measurements per week, we consider m = 3. Otherwise, m is equal to the number of measurements per week.

All responders are classified into sub-categories based on the tumour growth rates. It is common to assume tumours grow exponentially, and thus the linear model can be fitted for the log transformed tumour volumes over time:

$$log_{10}(y_k) = a + b \times t_k + e_k $$

where $y_k$ is the measured tumour volume on the day $t_k$, while $a$ is the intercept and $e_k \sim N(0, \sigma^2)$. The estimation of slope $b$ is defined as the tumour growth rate.

To improve the accuracy of the estimation of the growth rates, we will only use data between $t_{change}$ that is derived from the two-stage non-linear model to the “Cut-off day for responder classification”. If there is no input for “Cut-off day for responder classification”, the day for the end of study will be used. Based on the sign of the growth rate, the “Responder” tumours are further classified by the sign of the growth rate and two-sided T test. If the growth rate is not significantly different from 0, the tumour is classified as “Stable responder”. We consider a tumour with growth rate significantly larger than 0 as a “Modest responder”, which shows a response to the treatment although the tumour still growth slowly. A strong drug effect is concluded if the tumour growth rate is significantly smaller than 0 and the tumour is classified as “Regressing responder”.

3.3 Statistical analysis for drug efficacy

In order to compare the tumour responders from different treatment groups, a Bayesian ordered logistic model is used for the output from either a single study or multiple studies. This analysis convert the tumour classification into ordinal data (0 - non-responder, 1 - modest responder, 2 - stable responder, 3 regressing responder). We can use a latent variable $y^*$ to describe the tumour classification:

$$ Tumour.classification = \begin{cases} \textit{0 (non-responder), if} \; y^ \leq \mu_1 \newline \textit{1 (modest responder), if} \; \mu_1 < y^ \leq \mu_2 \newline \textit{2 (stable responder), if} \; \mu_2 < y^* \leq \mu_3 \newline \textit{3 (regressing responder) other}\newline \end{cases} $$

with $$y^{ijk} = b_j + \alpha_i + e{ijk}, $$ where $b_j$ indicate the mean of tumour classification for the j* th group; $\alpha_i \sim N(0, \sigma_{study}^2)$ is the study-to-study variability and $e_{ijk} \sim N(0, \sigma^2).$ From this model, we are able to compare the treatment group through the team $\mu_i$.

3.4 References

(1) Hather G. et al., (2014) Growth Rate Analysis and Efficient Experimental Design for Tumor Xenograft Studies: Supplementary Issue: Array Platform Modeling and Analysis (A). Cancer Informatics, 13, 65-72

(2) Karp N. et al. (2020) A multi-batch design to deliver robust estimates of efficacy and reduce animal use – a syngeneic tumour case study. Scientific Reports, 10, 6178



AstraZeneca/INSPECTumours documentation built on March 30, 2023, 12:30 p.m.