queso

Markov Chain Monte Carlo algorithms from the QUESO package

Topics

bayesian_calibration, package_queso

Specification

  • Alias: None

  • Arguments: None

Child Keywords:

Required/Optional

Description of Group

Dakota Keyword

Dakota Keyword Description

Required

chain_samples

Number of Markov Chain Monte Carlo posterior samples

Optional

seed

Seed of the random number generator

Optional

rng

Selection of a random number generator

Optional

emulator

Use an emulator or surrogate model to evaluate the likelihood function

Optional

standardized_space

Perform Bayesian inference in standardized probability space

Optional

logit_transform

Utilize the logit transformation to reduce sample rejection for bounded domains

Optional

export_chain_points_file

Export the MCMC chain to the specified filename

Optional (Choose One)

MCMC Algorithm

dram

Use the DRAM MCMC algorithm

delayed_rejection

Use the Delayed Rejection MCMC algorithm

adaptive_metropolis

Use the Adaptive Metropolis MCMC algorithm

metropolis_hastings

Use the Metropolis-Hastings MCMC algorithm

multilevel

Use the multilevel MCMC algorithm.

Optional

pre_solve

Perform deterministic optimization for MAP before Bayesian calibration

Optional

proposal_covariance

Defines the technique used to generate the MCMC proposal covariance.

Optional

options_file

File containing advanced QUESO options

Description

The queso method supports the following MCMC algorithms: DRAM (Delayed Rejection Adaptive Metropolis), delayed rejection (DR) only, adaptive metropolis (AM) only, pure Metropolis Hasting (MH)s, and multilevel (ML).

When calibrating fast-running simulation models, use of an emulator is not typically warranted. For slower models, using an emulator model in the MCMC sampling will greatly improve the speed, since the Monte Carlo Markov Chain will generate thousands of samples on the emulator instead of the real simulation code. An emulator may be specified with the keyword emulator, followed by a gaussian_process emulator, a pce emulator (polynomial chaos expansion), or a sc emulator (stochastic collocation). For the gaussian_process emulator, the user must specify whether to use the surfpack or dakota version of the Gaussian process. The user can define the number of samples build_samples from which the emulator should be built. It is also possible to build the Gaussian process from points read in from the import_points_file and to export approximation-based sample evaluations using export_points_file. For pce or sc, the user can define a sparse_grid_level.

There are a variety of ways the user can specify the proposal covariance matrix which is very important in governing the samples generated in the chain. The proposal covariance specifies the covariance structure of a multivariate normal distribution. The user can specify proposal_covariance, followed by derivatives, prior, values, or filename. The derivative specification involves forming the Hessian of the misfit function (the negative log likelihood). When derivative information is available inexpensively (e.g. from an emulator), the derived-based proposal covariance forms a more accurate proposal distribution, resulting in lower rejection rates and faster chain mixing. The prior setting involves constructing the proposal from the variance of the prior distributions of the parameters being calibrated. When specifying the proposal covariance with values or from a file, the user can choose to specify only the diagonals of the covariance matrix with diagonal or to specify the full covariance matrix with matrix.

There are two other controls for QUESO. The pre_solve option enables the user to start the chain at an optimal point, the Maximum A Posteriori (MAP) point. This is the point in parameter space that maximizes the log posterior, (defined as the log-likelihood minus the log_prior). A deterministic optimization method is used to obtain the MAP point, and the MCMC chain is then started at the best point found in the optimization. The second factor is a logit_transform, which performs an internal variable transformation from bounded domains to unbounded domains in order to reduce sample rejection due to an out-of-bounds condition.

Note that as of Dakota 6.2, the field data capability may be used with QUESO. That is, the user can specify field simulation data and field experiment data, and Dakota will interpolate and provide the proper residuals to the Bayesian calibration.