Main Content

This example shows how to perform noncompartmental analysis to calculate NCA parameters and
estimate the tumor growth model [1] parameters from experimental
data using nonlinear regression in the **SimBiology Model Analyzer**
app.

The model used in this example is a SimBiology^{®} implementation of the pharmacokinetic/pharmacodynamic (PK/PD) model by Simeoni
et al. It quantifies the effect of anticancer drugs on tumor growth kinetics from
*in vivo* animal studies. The drug pharmacokinetics are described by
a two-compartment model with IV bolus dosing and linear elimination (*ke*)
from the *Central* compartment. Tumor growth is a biphasic process with an
initial exponential growth followed by linear growth. The growth rate of the proliferating
tumor cells is described by

$$\frac{{L}_{0}*{x}_{1}}{{\left[1+{\left(\frac{{L}_{0}}{{L}_{1}}*w\right)}^{\psi}\right]}^{\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$\psi $}\right.}}$$

*L _{0}*,

*w* =
*x*_{1}

. In the presence of an anticancer
agent, a fraction of the proliferating cells is transformed into nonproliferating cells. The
rate of this transformation is assumed to be a function of the drug concentration in the
plasma and an efficacy factor The SimBiology model makes these adjustments to the pharmacodynamics of tumor growth:

Instead of defining the tumor weight as the sum of

*x1*,*x2*,*x3*, and*x4*, the model defines the tumor weight by the reaction named*Increase*,`null → tumor_weight`

, with the reaction rate $$\left(\raisebox{1ex}{$2*{L}_{1}*{L}_{0}*{x}_{1}{}^{2}$}\!\left/ \!\raisebox{-1ex}{${L}_{1}+2*{L}_{0}*{x}_{1}$}\right.\right)*tumor\_weight$$.*tumor_weight*is the total tumor weight,*x*is the weight of the proliferating tumor cells, and_{1}*L*, and_{0}*L*are tumor growth parameters._{1}Similarly, the model defines the decrease in tumor weight by the reaction named

*Decay*,`tumor_weight → null`

, with the reaction rate

. The constant*k**_{1}*x*_{4}*k*is the forward rate parameter, and_{1}*x*is the last species in the series of transit reductions in tumor weight._{4}*ke*is a function of the clearance and the volume of the central compartment:`ke = Cl_Central/Central`

.

The experimental (synthetic) data contains measurements from eight patients for three responses: measured drug concentrations in the central compartment, in the peripheral compartment, and measured tumor weight. The data also contains the dosing information, and each patient receives an IV dose at day 7.

The data set contains the following columns.

*ID*— Patient IDs*Time*— Times when measurements are taken*CentralConc*— Drug concentration in the central compartment*PeripheralConc*— Drug concentration in the peripheral compartment*Dose*— Dosing information for each patient

`NaN`

values are used whenever there is no measurement or no dose is
given.

Open the

**SimBiology Model Analyzer**app by typing`simBiologyModelAnalyzer`

at the command line or by clicking the app icon on the**Apps**tab.On the

**Home**tab of the app, select**Open**.Navigate to the folder

.*matlabroot*\examples\simbio\data\*matlabroot*is the folder where you have installed MATLAB. Select the project file named`tumor_growth_fitPKPD.sbproj`

. In the**Browser**pane, the**Models**folder contains the`Tumor Growth Model`

and the**Data1**folder contains the experimental data along with the dosing information.Classify the data columns so that such variable classifications can be used by the

**Fit**program later in the example. The app performs automatic classifications as appropriate (such as the*ID*,*Time*,*Dose*columns). But for the measured response data columns such as*CentralConc*, you need to manually classify them as the dependent variables. To do so, first open the data sheet as follows. In the**Browser**pane, expand the**Data1**folder and double-click**Datasheet1**.In the

**Data1**table, double-click**Classification**under*CentralConc*. Select`dependent`

. Repeat the same process for*PeripheralConc*and*TumorWeight*. Now all the data columns have proper classifications and the data is ready for use.**Note**The app has automatically classified:

The

*ID*column as**group**(a grouping variable).The

*Time*column as**independent**(an independent variable).The

*Dose*column as**dose1**(a dosing variable). If there are more than one dose columns, they can be classified as**dose2**,**dose3**, and so on.

After you load the data, you can visualize the measured responses.

In the

**Browser**pane, click**Data1**.On the

**Home**tab, in the**Plot**section, click the**time**plot. The app generates a time plot of all three responses, namely:*CentralConc*,*PeripheralConc*, and*TumorWeight*.In the default time plot,

**Responses**correspond to the measured responses and are plotted using different line styles.**Scenarios**refers to different groups (eight patients) in the data and are plotted using different colors.**Tip**Plots are backed by data that are currently present in the app workspace. Plots are not snapshots. When the data (either experimental data or simulation results) is removed or changed, the plots are also updated according to the changes in the underlying data.

The steps in this section are optional and are not necessary for fitting. You can
customize the plot to make it clearer. For example, you can plot the PD data
(*TumorWeight*) on a different axis than the PK data
(*CentralConc* and *PeripheralConc*). To do so, create two
different groups (*sets*) of responses, where the first set contains only
*TumorWeight* and the second set contains *CentralConc* and
*PeripheralConc*.

Right-click

**TumorWeight (gram)**in the**Responses**table and select`Create New Set`

. The app creates**Set 1**and**Set 2**.**Set 1**contains only*TumorWeight*, which is now plotted on a different axis than**Set 2**, which contains*CentralConc*and*PeripheralConc*.The

**Visual Channels**table now contains**Sets**. This table is a summary table of all slicing variables that are currently present in the plot and their corresponding plot styles. In this current plot, the slicing variables are**Sets**,**Responses**, and**Scenarios**.**Tip**You can slice data using different slicing variables. Each slicing variable appears in the plot with a different visual

*style*(or channel) such as color, line style, and axes position. Slicing variables can represent attributes of data, such as responses or scenarios (that is, groups or simulation runs). Slicing variables can also be covariates or parameter values associated with a scenario or group. By default, the app provides slicing variables for different response variables and different scenarios in the plotted data. You can add other visual styles (or channels) for sets of responses and associated parameter or covariate variables.

You can also group the responses based on different dose amounts that the patients receive. There are three different dose groups: 30, 75, and 150 mg.

In the

**Visual Channels**table, at the**Dose**row, double-click the empty cell and select`Color`

. A red indicator appears because another slicing variable (**Scenarios**) has the same plot style. Clear the style (visual channel) for**Scenarios**by selecting empty.In the

**Dose**table, the app has automatically binned the dose amounts. Set**Number of Bins**to`3`

. You can now see that the dose amount has an impact on the tumor size. The higher the dose is, the smaller the tumor becomes.You can also query the corresponding dose group from each line by showing its data tip. Press

**Ctrl**and click a blue line to display its data tip. To remove it,**Ctrl****+****Click**again anywhere on the same line.

Using the drug pharmacokinetic data, you can estimate NCA parameters. NCA is model agnostic and can give insights into the drug pharmacokinetics without any underlying assumptions. You can use some of the NCA results as initial estimates when calibrating the model to the data, as discussed later in this example. For details on the list of available NCA parameters and their formulas, see Noncompartmental Analysis.

On the

**Home**tab, select**Program****>****Non-Compartmental Analysis**. A new program (**Program1**) appears.The

**Data***setup*step of the program defines the data set to use for the NCA analysis. In this example, the program automatically selects**Data1**.The

**NCA***execution*step defines the data column associations and algorithm details. In the**Definition**table, set**Concentration**to`CentralConc`

. Leave the other settings unchanged.On the

**Home**tab, click**Run**.

Once the NCA analysis is complete, the app opens a new datasheet containing the results.

The program also saves the results in the **LastRun** folder of the
program by default. To access the results, in the **Browser** pane, expand the
**Program1** folder. Then expand **LastRun** folder. The NCA
results are stored in the table named **results**. For details about
calculated NCA parameters, see Noncompartmental Analysis.

You can export the NCA results to the MATLAB^{®} workspace and perform further data analyses at the command line.

Right-click

**results**. Select**Export Data to MATLAB Workspace**.The

**SimBiology Data Export**dialog box opens. Change the name of the variable to**ncatable**. Click**OK**.

After you export the data to the MATLAB workspace, you can analyze the data at the command-line. For instance, you can calculate the average drug clearance from the NCA data and use it as the model parameter value.

SimBiology provides different regression techniques to estimate model parameters based on
experimental data. This example details the steps for using the nonlinear regression method
`lsqnonlin`

(requires Optimization Toolbox™) to fit the model to the data. If you do not have Optimization Toolbox, the app uses `fminsearch`

instead. For the purposes of the
example, only some of the PK/PD model parameters are estimated, namely: *k1*,
*L0*, *L1*, *Cl_Central*,
*k12*, and *k21*.

From the

**Home**tab, select**Program****>****Fit Data**. A new program (**Program2**) appears on a new tab. The**Data**and**Model**steps have been prepopulated with**Data1**and**Tumor Growth Model**, respectively.By default, the

**Fit**step autogenerates plots after the fitting is complete. Disable the plot generation by clicking the plot icon at the top of the**Fit**program step for now. The plots will be explored later in the example.In the

**Data Map**table, define the mapping between the model components and the data columns from the input data.The

**group**row identifies which column in the data is a grouping variable, such as patient IDs.The

**independent**row identifies which column in the data is an independent variable, such as time.The

**response**row identifies which response or measurement data column corresponds to which model component. If there are multiple response data, you can add more response rows by clicking the**Response**button at the bottom of the**Data Map**table. To delete a response from the table, right-click and select**Delete**.The

**dose from data**row defines which column in the data maps to which model component as a dose target. If there are multiple dose columns, you can add more rows by clicking the**Dose**button.The

**variant from data**row defines which column in the data contains alternative parameter values for which model component. Click the**Variant**button to add more.

**Note**In this example, the app uses the classification information from the data sheet of the input data and maps the

*ID*column as the grouping variable (defined by the**group**row in the table), and*Time*column as an independent variable (defined by the**independent**row in the table). It has also identified*CentralConc*,*PeripheralConc*, and*TumorWeight*as response columns.In the first response row, next to

*CentralConc*, double-click the cell**Component**, and enter*Central.Drug*as the corresponding model component for that measurement data column.Similarly, map the

*PeripheralConc*column to*Peripheral.Drug*.Map

*TumorWeight*to*[Tumor Growth Model].tumor_weight*.Map the

*Dose*column to*Central.Drug*to indicate that the*Drug*species in the*Central*compartment is being dosed.

Define the model parameters to estimate in the

**Estimated Parameters**table. Double-click the empty cell in the**Estimated Parameters**column and type*k1*. The app shows model components with matching names. Select`k1`

from the list.By default, the parameter is log-transformed as indicated by the transformation

**log**. You can change the transformation to no transformation`none`

,`probit`

, or`logit`

transformation. For details, see Parameter Transformations. For this example, keep the default log transform because it often improves the convergence. The**Initial Untransformed Value**is automatically set to the model value which is 0.5.Enforce the biological parameters to stay positive by specifying the

**Untransformed Lower Bound**and**Untransformed Upper Bound**as`1e-5`

and`10`

, respectively.Similarly, add the following parameters:

*Cl_Central*,*L0*,*L1*,*k12*, and*k21*.Select

**Pooled fit**to estimate one set of parameters for all patients (population fit). If you do not select**Pooled fit**, the app estimates one set of parameters for each patient (individual fit).The default error model is the constant error model. SimBiology supports constant, proportional, exponential, and combined error models. For details, see Error Models. For now, use the constant error model.

Keep the rest of the fitting settings unchanged. These settings are

Estimation Method — The default method is

`lsqnonlin`

if you have Optimization Toolbox. If you do not, the app uses`fminsearch`

.For more information, see Supported Methods for Parameter Estimation in SimBiology.

Algorithm Settings — Most common options for the estimation method. Click to expand the section and see the options. To see the description of each option, click the info icon to the right of the header.

Advanced Algorithm Settings — Advanced settings for the estimation method. The table is empty by default.

After you set the fitting options, you can run the **Fit** step.

At the top of the

**Fit**step, click the**Run this program step**button.By default, the

**Fit**step shows the progress of parameter estimation in a separate figure. The progress plot shows the live status of parameter estimation and fitting quality measures such as log likelihood. For details, see Progress Plot.The progress plot shows that the fit converged. You can close the progress plot.

If you are using

`fminsearch`

, the fit can fail to converge due to reaching the maximum number of iterations. You can increase**MaxIter**in the**Algorithm Settings**, but for the purposes of this example, you can continue completing the steps without doing so.

Once the parameter estimation is complete, the fit results are shown in a new data sheet. The datasheet contains parameter estimates and other information related to fit quality measures, such as AIC and BIC, which can be useful to compare the performance of different error models.

In addition to quality statistics, you can also view various fit plots, such as actual versus predicted plots and residual distribution plots.

In the

**Browser**pane, expand**Program2**>**LastRun**, which contains**results**and**simdataI**.**results**contains estimated parameter values and fit statistics.**simdataI**contains the simulated model responses for each individual (patient or group) using the estimated parameter values.Click

**results**. Available fit plots are automatically listed in the**Plot**section on the**Home**tab. Then select**Act vs Pred**from the list.The actual versus predicted plot appears on a separate tab. The predicted responses are plotted on the

*x*-axis and the observed (experimental) responses are plotted on the*y*-axis.You can change the plot to other supported plots by selecting one of the plots from the

**Style**section in the**Property Editor**. If you want the new plot on its own separate tab and you do not want to reuse the existing plot tab, select the plot from the**Plot**section on the**Home**tab.Change the plot to a residual distribution plot by selecting

**Res Dist**in the**Style**section.The plot shows whether the residuals for each response are normally distributed. In an ideal normal probability plot of residuals, the residuals line up along the diagonal line across the plot and the histograms indicate a normal fit. However, from the plot, the residuals for all three responses, especially

*CentralConc*and*PeripheralConc*, do not seem to be normally distributed. It could be an indication that the constant error model assumption is incorrect.

The following steps show how to change the error model to an exponential error model to fit the data again and compare the fit statistics of two different error models.

**Save Fit Results. **Before fitting the data again using the exponential error model, save the constant error
model result in a separate folder. Otherwise, the program, by default, overwrites results from
the **LastRun** folder every time you run the fit.

Right-click the

**LastRun**folder of the fit program in the**Browser**pane.Select

`Save Data`

.In the

**Save Data**dialog, enter`fit_constant`

as the data name.

**Rerun Fit With Exponential Error Model. **After you save the data, you can rerun the fit program with a different error
model.

Return to the fit program by clicking the

**Program2**tab. In the**Error Model**section, select**exponential**.At the top of the

**Fit**step, click the**Run this program step**button.Close the progress plot after the fit completes.

If you closed the previous datasheet (

**Datasheet3**) that contains the fit statistics from the previous fit, reopen the datasheet. To do so, in the**Browser**pane, expand**Program2**>**fit_constant**. Then double-click**Datasheet3**.From the

**LastRun**folder, drag**results**onto**Datasheet3**. New columns (**Program2_LastRun**) containing the latest fit results are added next to the previous fit results (**Program2_fit_constant**).The

**Statistics**table compares the fit quality measures. From the comparison, both the AIC and BIC of the fit using the exponential error model are smaller than those of the previous fit. This indicates that the exponential error model fits the data better than the constant error model. The larger log likelihood of the exponential error model also indicates it is a better fit.Next, look at the residual distribution plot. Click

**results**from the**LastRun**folder. Then click**Residual Dist**from the**Plot**section on the**Home**tab.Compared to the residual distributions of the constant error model, the residual distributions from the exponential error model look more normal, indicating that the exponential error model fits the data better.

Another way to assess the quality of fit results is to compute 95% confidence intervals for the estimated parameters and model predictions — that is, model simulation results using the estimated parameters. This step requires Statistics and Machine Learning Toolbox™.

Return to the fit program. Click the (+) icon at the upper left and select

`Confidence Interval`

. A**Confidence Interval**step appears following the**Fit**step.At the top of the

**Confidence Interval**step, disable the autogeneration of plots by clicking the plot icon. For both**Parameter Confidence Intervals**and**Prediction Confidence Intervals**, use the default method`gaussian`

and`95%`

confidence level. Click the**Run this program step**button to compute confidence intervals.For parameter confidence intervals, the supported methods are gaussian, profileLikelihood, and bootstrap.

For prediction confidence intervals, the supported methods are gaussian and bootstrap.

Once completed, the results are stored as

*parameterCI*and*predictionCI*in the**LastRun**folder of the program.*parameterCI*contains 95% confidence intervals for the estimated parameters.*predictionCI*contains 95% confidence intervals for the model predictions.Plot 95% confidence intervals for the estimated parameters. Click

*parameterCI*in the**Browser**pane and select**Confidence**in the**Plot**section.The confidence interval for each estimated parameter is shown in a new plot. The plot indicates the successful computation of the confidence intervals for all estimated parameters.

Depending on the outcome (status) of the confidence interval estimation, the app plots the results differently.

If the status of confidence interval estimation is a

*success*(as in the above plot), the app uses the first default color (blue) to plot a line and a centered dot for every parameter estimate. The app also plots a box to indicate the confidence intervals.If the status is

*constrained*or*estimable*, the app uses the second default color (red) and plots a line, centered dot, and box to indicate the confidence intervals.If the status is

*not estimable*, the app plots only a line and a centered cross in red.If there are any transformed parameters with estimated values that are 0 (for the

`log`

transform) and 0 or 1 (for the`probit`

or`logit`

transform), no confidence intervals are plotted for those parameter estimates.

For more details on the definitions of different statuses, see Parameter Confidence Interval Estimation Status.

You can also change the

**Layout**of the plot in**Plot Settings**.The

`'split'`

layout displays the confidence interval for each parameter estimate on a separate axis.The

`'grouped'`

layout displays all confidence intervals on one axis, grouped by parameter estimates. Each estimated parameter is separated by a vertical black line.

In both cases, the parameter bounds defined in the original fit are marked by square brackets. The app uses vertical dotted lines to group confidence intervals of parameter estimates that have been computed in a common fit.

Similarly, plot 95% confidence intervals for the model predictions. Click

*predictionCI*in the**Browser**pane and select**Confidence**in the**Plot**section.In this plot, the confidence interval for each group is plotted in a separate column, and each response is plotted in a separate row. The plot indicates the successful computation of the confidence intervals.

The plotting behavior differs depending on the outcome (status) of the confidence interval calculation.

If the status is

*constrained*or*not estimable*, the app uses the second default color (red) to plot the confidence intervals.Otherwise, the app uses the first default color (blue) and plots the confidence intervals as shaded areas (as in the above plot).

For details, see Gaussian Confidence Interval Calculation for Model Predictions and Bootstrap Confidence Interval Calculation.

The percentile plot lets you visualize simulation results and statistics that can be overlaid with experimental data. For example, you can plot 5th and 95th percentile curves of simulation data over time instead of seeing the individual time plots. You can also visualize the mean, standard deviation, minimum, and maximum of simulation and experimental data. For details, see Percentile Plot.

**Note**

If you have not completed the prior steps that generated the required results to continue, you can load the completed project instead.

Open the SimBiology Model Analyzer app.

Click

**Open**and navigate to the folder

.*matlabroot*\examples\simbio\data\*matlabroot*is the folder where you have installed MATLAB. Select the project file named`tumor_growth_fitPKPD_completed.sbproj`

.

In the

**LastRun**folder of the fit program, select**simDataI**>**[Tumor Growth Model].tumor_weight**.On the

**Home**tab, click**percentile**.The percentile plot shows the 5th and 95th percentile curves by default.

Select

**Data1**>**TumorWeight**. Drag and drop it on to the plot.By default, the

**Display**type of the experimental data is automatically set to**Mean**, which shows the mean measurement at each time with ±1 standard deviation. If you want to visualize the original data points, double-click**Mean**and select**Raw Data**. For this example, keep the**Mean**display.In the

**Percentiles**section, for the**Show percentiles (%)**option, enter`10,80`

to show the 10th and 80th percentiles.Change

**Display style**to`Both lines and shading`

.In the

**Mean**section, show the minimum and maximum response data points at each time by setting**Show min/max**to`true`

. Also show all the underlying raw data points by changing**Show raw data fraction (%)**to`100`

. You can also enter a custom percentage number.The percentile plot is now shown as follows. The asterisks (

`*`

) represent the calculated minimum and maximum values after interpolation, and the dots (`.`

) represent original data points. Because these minimum and maximum values are computed based on an interpolated common time vector, the values do not match exactly with those of raw data. For details, see Mean Options.

After parameter estimation, you can set the model values to the parameter estimates and perform other analyses. For instance, you can find out important model parameters using sensitivity analysis and vary the sensitive parameters to investigate model variability by using virtual patients.

[1] Simeoni, Monica, Paolo Magni, Cristiano Cammia, Giuseppe De Nicolao, Valter Croci, Enrico Pesenti, Massimiliano Germani, Italo Poggesi, and Maurizio Rocchetti. “Predictive Pharmacokinetic-Pharmacodynamic Modeling of Tumor Growth Kinetics in Xenograft Models after Administration of Anticancer Agents.” *Cancer Research* 64, no. 3 (February 1, 2004): 1094–1101.