sequential_experiment: Design and Run Sequential Computer Experiment

Description Usage Arguments Details Examples

View source: R/sequential_experiment.R

Description

This function implements a sequential version of the method by Weaver et al. (2016) that uses Gaussian process (GP) optimization to estimate an optimal design for a stochastic design criterion. The function design_experiment is used repeatedly to design each stage of the experiment. After the design for each stage is estimated, an inexpensive simulator is run to collect data on the design. New posterior samples are drawn, and the process is repeated. Validation of the fitted GP models is provided by the statistics described in Bastos and O'Hagan (2009).

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
sequential_experiment(
  criterion,
  sampler,
  lower_bound,
  upper_bound,
  stochastic,
  simulator,
  init_design = NULL,
  init_response = NULL,
  design_budget = 10,
  batch = 1,
  init_budget = 10,
  optim_budget = 10,
  gp_options = list(formula = ~1, kernel = "matern5_2", optimizer = "gen"),
  genoud_options = list(pop.size = 1000, max.generations = 100, wait.generations = 10),
  diagnostics = 1,
  verbose = 1,
  max_augment = 10,
  cluster = NULL
)

Arguments

criterion

A function with vector input of length d (see details).

sampler

A function that sample from the posterior distribution given current observations (see details).

lower_bound

A vector of length d.

upper_bound

A vector of length d.

stochastic

Is the design criterion stochastic or deterministic (see details)?

simulator

Computer simulator being explored.

init_design

A matrix of design points that data have been collected.

init_response

A matrix of responses that have been collected.

design_budget

Number of sequential experiments to perform.

batch

Number of design points per experiment stage (batch = 1 is sequential) (see details)

init_budget

An integer defining the size of the initial training dataset and the size of the validation dataset for the GP model.

optim_budget

An integer defining the number of GP optimizations iterations.

gp_options

A list specifying the type of GP model to fit (see km).

genoud_options

A list specifying the control options to optimizer (see genoud).

diagnostics

Type of GP diagnostics to perform before optimization occurs. There are currently three options: 0 (none), 1 (automatic) a simple Mahalanobis distance significance test, 2 (user inspected) execution is paused for visual inspection of pivoted-Cholesky residuals and QQ-plots.

verbose

Print extra output during execution?

max_augment

An integer defining the maximum number of design augmentations before terminating GP fitting.

cluster

A parallel cluster object.

Details

The design criterion (DC) is a stochastic or deterministic univariate function that measures the quality of a proposed design. GADGET assumes the design criterion must be minimized. For example, instead of maximizing the determinant of the Fisher-information matrix, GADGET would minimize the negative determinant of the Fisher-information matrix. If the DC is stochastic then the GP model is fit with a nugget effect and expected quantile improvement (EQI) is used to perform the GP optimization. The optimal design is taken to be the design that maximizes EQI on the final optimization iteration. If the DC is deterministic then the GP model is fit without a nugget effect and expected improvement (EI) is used to perform the optimization. The optimal design is taken to be the design with smallest observed DC over all evaluation of the DC.

The GADGET represents designs as a d-length vector. The user supplied DC function must translate this vector into the apporiate form for computing the DC. The upper_bound and lower_bound arguments define bounds of each element in the vectorized design.

The batch allows for more than one design point to be optimized in a single step of GADGET. To use this feature, the design criterion must be able to accept multiple design points stack in a matrix with each row being a single design point.

The function sampler must accept the currently observed design and response (including the initial design and response) to produce posteriors for the simulator's parameters. The design criterion also requires a second argument accepting the posterior sample so that the utility function is computed with respect to the current posterior distribution.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
#--- Synthetic Design Problem ---#
 
#demonstration design criterion
dc <- function(x,theta) {sum(x^2) + rnorm(1,0.1)}
#demonstration posterior sampler 
post <- function(design,response) {rnorm(1000)}
#demonstration simulatior 
sim <- function(x) {x}
my_result = sequential_experiment(criterion    = dc,
                                 stochastic    = TRUE,
                                 sampler       = post,
                                 lower_bound   = -3,
                                 upper_bound   =  3,
                                 simulator     = sim,
                                 design_budget = 2,
                                 optim_budget  = 1,
                                 batch         = 2)

GADGET documentation built on Jan. 25, 2020, 1:06 a.m.