.. _method-stoch_collocation:

"""""""""""""""""
stoch_collocation
"""""""""""""""""


Uncertainty quantification with stochastic collocation


.. toctree::
   :hidden:
   :maxdepth: 1

   method-stoch_collocation-p_refinement
   method-stoch_collocation-h_refinement
   method-stoch_collocation-max_refinement_iterations
   method-stoch_collocation-convergence_tolerance
   method-stoch_collocation-metric_scale
   method-stoch_collocation-quadrature_order
   method-stoch_collocation-sparse_grid_level
   method-stoch_collocation-piecewise
   method-stoch_collocation-askey
   method-stoch_collocation-wiener
   method-stoch_collocation-use_derivatives
   method-stoch_collocation-samples_on_emulator
   method-stoch_collocation-sample_type
   method-stoch_collocation-rng
   method-stoch_collocation-probability_refinement
   method-stoch_collocation-final_moments
   method-stoch_collocation-response_levels
   method-stoch_collocation-probability_levels
   method-stoch_collocation-reliability_levels
   method-stoch_collocation-gen_reliability_levels
   method-stoch_collocation-distribution
   method-stoch_collocation-variance_based_decomp
   method-stoch_collocation-diagonal_covariance
   method-stoch_collocation-full_covariance
   method-stoch_collocation-import_approx_points_file
   method-stoch_collocation-export_approx_points_file
   method-stoch_collocation-seed
   method-stoch_collocation-fixed_seed
   method-stoch_collocation-model_pointer


**Specification**

- *Alias:* nond_stoch_collocation 

- *Arguments:* None


**Child Keywords:**

+-------------------------+--------------------+-------------------------------+-----------------------------------------------+
| Required/Optional       | Description of     | Dakota Keyword                | Dakota Keyword Description                    |
|                         | Group              |                               |                                               |
+=========================+====================+===============================+===============================================+
| Optional (Choose One)   | Automated          | `p_refinement`__              | Automatic polynomial order refinement         |
|                         | Refinement Type    +-------------------------------+-----------------------------------------------+
|                         |                    | `h_refinement`__              | Employ h-refinement to refine the grid        |
+-------------------------+--------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `max_refinement_iterations`__ | Maximum number of expansion refinement        |
|                                              |                               | iterations                                    |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `convergence_tolerance`__     | Stopping criterion based on objective         |
|                                              |                               | function or statistics convergence            |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `metric_scale`__              | define scaling of statistical metrics when    |
|                                              |                               | adapting UQ surrogates                        |
+-------------------------+--------------------+-------------------------------+-----------------------------------------------+
| Required (Choose One)   | Interpolation Grid | `quadrature_order`__          | Order for tensor-products of Gaussian         |
|                         | Type               |                               | quadrature rules                              |
|                         |                    +-------------------------------+-----------------------------------------------+
|                         |                    | `sparse_grid_level`__         | Level to use in sparse grid integration or    |
|                         |                    |                               | interpolation                                 |
+-------------------------+--------------------+-------------------------------+-----------------------------------------------+
| Optional (Choose One)   | Basis Polynomial   | `piecewise`__                 | Use piecewise local basis functions           |
|                         | Family             +-------------------------------+-----------------------------------------------+
|                         |                    | `askey`__                     | Select the standardized random variables (and |
|                         |                    |                               | associated basis polynomials) from the Askey  |
|                         |                    |                               | family that best match the user-specified     |
|                         |                    |                               | random variables.                             |
|                         |                    +-------------------------------+-----------------------------------------------+
|                         |                    | `wiener`__                    | Use standard normal random variables (along   |
|                         |                    |                               | with Hermite orthogonal basis polynomials)    |
|                         |                    |                               | when transforming to a standardized           |
|                         |                    |                               | probability space.                            |
+-------------------------+--------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `use_derivatives`__           | Use derivative data to construct surrogate    |
|                                              |                               | models                                        |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `samples_on_emulator`__       | Number of samples at which to evaluate an     |
|                                              |                               | emulator (surrogate)                          |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `sample_type`__               | Selection of sampling strategy                |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `rng`__                       | Selection of a random number generator        |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `probability_refinement`__    | Allow refinement of probability and           |
|                                              |                               | generalized reliability results using         |
|                                              |                               | importance sampling                           |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `final_moments`__             | Output moments of the specified type and      |
|                                              |                               | include them within the set of final          |
|                                              |                               | statistics.                                   |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `response_levels`__           | Values at which to estimate desired           |
|                                              |                               | statistics for each response                  |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `probability_levels`__        | Specify probability levels at which to        |
|                                              |                               | estimate the corresponding response value     |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `reliability_levels`__        | Specify reliability levels at which the       |
|                                              |                               | response values will be estimated             |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `gen_reliability_levels`__    | Specify generalized relability levels at      |
|                                              |                               | which to estimate the corresponding response  |
|                                              |                               | value                                         |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `distribution`__              | Selection of cumulative or complementary      |
|                                              |                               | cumulative functions                          |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `variance_based_decomp`__     | Activates global sensitivity analysis based   |
|                                              |                               | on decomposition of response variance into    |
|                                              |                               | main, interaction, and total effects          |
+-------------------------+--------------------+-------------------------------+-----------------------------------------------+
| Optional (Choose One)   | Covariance Type    | `diagonal_covariance`__       | Display only the diagonal terms of the        |
|                         |                    |                               | covariance matrix                             |
|                         |                    +-------------------------------+-----------------------------------------------+
|                         |                    | `full_covariance`__           | Display the full covariance matrix            |
+-------------------------+--------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `import_approx_points_file`__ | Filename for points at which to evaluate the  |
|                                              |                               | PCE/SC surrogate                              |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `export_approx_points_file`__ | Output file for surrogate model value         |
|                                              |                               | evaluations                                   |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `seed`__                      | Seed of the random number generator           |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `fixed_seed`__                | Reuses the same seed value for multiple       |
|                                              |                               | random sampling sets                          |
+----------------------------------------------+-------------------------------+-----------------------------------------------+
| Optional                                     | `model_pointer`__             | Identifier for model block to be used by a    |
|                                              |                               | method                                        |
+----------------------------------------------+-------------------------------+-----------------------------------------------+

.. __: method-stoch_collocation-p_refinement.html
__ method-stoch_collocation-h_refinement.html
__ method-stoch_collocation-max_refinement_iterations.html
__ method-stoch_collocation-convergence_tolerance.html
__ method-stoch_collocation-metric_scale.html
__ method-stoch_collocation-quadrature_order.html
__ method-stoch_collocation-sparse_grid_level.html
__ method-stoch_collocation-piecewise.html
__ method-stoch_collocation-askey.html
__ method-stoch_collocation-wiener.html
__ method-stoch_collocation-use_derivatives.html
__ method-stoch_collocation-samples_on_emulator.html
__ method-stoch_collocation-sample_type.html
__ method-stoch_collocation-rng.html
__ method-stoch_collocation-probability_refinement.html
__ method-stoch_collocation-final_moments.html
__ method-stoch_collocation-response_levels.html
__ method-stoch_collocation-probability_levels.html
__ method-stoch_collocation-reliability_levels.html
__ method-stoch_collocation-gen_reliability_levels.html
__ method-stoch_collocation-distribution.html
__ method-stoch_collocation-variance_based_decomp.html
__ method-stoch_collocation-diagonal_covariance.html
__ method-stoch_collocation-full_covariance.html
__ method-stoch_collocation-import_approx_points_file.html
__ method-stoch_collocation-export_approx_points_file.html
__ method-stoch_collocation-seed.html
__ method-stoch_collocation-fixed_seed.html
__ method-stoch_collocation-model_pointer.html



**Description**


Stochastic collocation is a general framework for approximate
representation of random response functions in terms of
finite-dimensional interpolation bases.

The stochastic collocation (SC) method is very similar to
:dakkw:`method-polynomial_chaos`, with the key difference that the orthogonal
polynomial basis functions are replaced with interpolation polynomial
bases. The interpolation polynomials may be either local or global
and either value-based or gradient-enhanced. In the local case,
valued-based are piecewise linear splines and gradient-enhanced are
piecewise cubic splines, and in the global case, valued-based are
Lagrange interpolants and gradient-enhanced are Hermite interpolants.
A value-based expansion takes the form


.. math:: R = \sum_{i=1}^{N_p} r_i L_i(\xi) 

where :math:`N_p`  is the total number of collocation points, :math:`r_i` 
is a response value at the :math:`i^{th}`  collocation point, :math:`L_i` 
is the :math:`i^{th}`  multidimensional interpolation polynomial, and
:math:`\xi`  is a vector of standardized random variables.

Thus, in PCE, one forms coefficients for known orthogonal polynomial
basis functions, whereas SC forms multidimensional interpolation
functions for known coefficients.



*Basis polynomial family (Group 2)*

In addition to the :dakkw:`method-stoch_collocation-askey` and
:dakkw:`method-stoch_collocation-wiener` basis types also supported by
:dakkw:`method-polynomial_chaos`, SC supports the option of ``piecewise`` local
basis functions. These are piecewise linear splines, or in the case of
gradient-enhanced interpolation via the ``use_derivatives``
specification, piecewise cubic Hermite splines. Both of these basis
options provide local support only over the range from the
interpolated point to its nearest 1D neighbors (within a tensor grid
or within each of the tensor grids underlying a sparse grid), which
exchanges the fast convergence of global bases for smooth functions
for robustness in the representation of nonsmooth response functions
(that can induce Gibbs oscillations when using high-order global basis
functions). When local basis functions are used, the usage of
nonequidistant collocation points (e.g., the Gauss point selections
described above) is not well motivated, so equidistant Newton-Cotes
points are employed in this case, and all random variable types are
transformed to standard uniform probability space. The
global gradient-enhanced interpolants (Hermite interpolation
polynomials) are also restricted to uniform or transformed uniform
random variables (due to the need to compute collocation weights by
integration of the basis polynomials) and share the variable support
shown in :ref:`topic-variable_support` for Piecewise SE. Due to numerical
instability in these high-order basis polynomials, they are deactivated
by default but can be activated by developers using a compile-time switch.

*Interpolation grid type (Group 3)*

To form the multidimensional interpolants :math:`L_i`  of the expansion,
two options are provided.



1. interpolation on a tensor-product of Gaussian quadrature points (specified with ``quadrature_order`` and, optionally, ``dimension_preference`` for anisotropic tensor grids). As for PCE, non-nested Gauss rules are employed by default, although the presence of ``p_refinement`` or ``h_refinement`` will result in default usage of nested rules for normal or uniform variables after any variable transformations have been applied (both defaults can be overridden using explicit ``nested`` or ``non_nested`` specifications).

2. interpolation on a Smolyak sparse grid (specified with ``sparse_grid_level`` and, optionally, ``dimension_preference`` for anisotropic sparse grids) defined from Gaussian rules. As for sparse PCE, nested rules are employed unless overridden with the ``non_nested`` option, and the growth rules are restricted unless overridden by the ``unrestricted`` keyword.


Another distinguishing characteristic of stochastic collocation
relative to :dakkw:`method-polynomial_chaos` is the ability to reformulate the
interpolation problem from a ``nodal`` interpolation approach into a
``hierarchical`` formulation in which each new level of interpolation
defines a set of incremental refinements (known as hierarchical
surpluses) layered on top of the interpolants from previous levels.
This formulation lends itself naturally to uniform or adaptive
refinement strategies, since the hierarchical surpluses can be
interpreted as error estimates for the interpolant. Either global or
local/piecewise interpolants in either value-based or
gradient-enhanced approaches can be formulated using ``hierarchical``
interpolation. The primary restriction for the hierarchical case is
that it currently requires a sparse grid approach using nested
quadrature rules (Genz-Keister, Gauss-Patterson, or Newton-Cotes for
standard normals and standard uniforms in a transformed space: Askey,
Wiener, or Piecewise settings may be required), although this
restriction can be relaxed in the future. A selection of
``hierarchical`` interpolation will provide greater precision in the
increments to mean, standard deviation, covariance, and
reliability-based level mappings induced by a grid change within
uniform or goal-oriented adaptive refinement approaches (see following
section).

It is important to note that, while ``quadrature_order`` and
``sparse_grid_level`` are array inputs, only one scalar from these arrays
is active at a time for a particular expansion estimation.  These
scalars can be augmented with a ``dimension_preference`` to support
anisotropy across the random dimension set.  The array inputs are
present to support advanced use cases such as multifidelity UQ, where
multiple grid resolutions can be employed.

*Automated refinement type (Group 1)*

Automated expansion refinement can be selected as either
``p_refinement`` or ``h_refinement``, and either refinement specification
can be either ``uniform`` or ``dimension_adaptive``. The
``dimension_adaptive`` case can be further specified as either ``sobol`` or
``generalized`` ( ``decay`` not supported). Each of these automated
refinement approaches makes use of the ``max_iterations`` and
``convergence_tolerance`` iteration controls.
The ``h_refinement`` specification involves use of the same piecewise
interpolants (linear or cubic Hermite splines) described above for the
``piecewise`` specification option (it is not necessary to redundantly
specify ``piecewise`` in the case of ``h_refinement``). In future
releases, the ``hierarchical`` interpolation approach will enable local
refinement in addition to the current ``uniform`` and
``dimension_adaptive`` options.

*Covariance type (Group 5)*

These two keywords are used to specify how this method computes, stores,
and outputs the covariance of the responses.  In particular, the diagonal
covariance option is provided for reducing post-processing overhead and
output volume in high dimensional applications.

*Active Variables*

The default behavior is to form expansions over aleatory
uncertain continuous variables. To form expansions
over a broader set of variables, one needs to specify
``active`` followed by ``state``, ``epistemic``, ``design``, or ``all``
in the variables specification block.

For continuous design, continuous state, and continuous epistemic
uncertain variables included in the expansion, interpolation points
for these dimensions are based on Gauss-Legendre rules if non-nested,
Gauss-Patterson rules if nested, and Newton-Cotes points in the case
of piecewise bases. Again, when probability integrals are evaluated,
only the aleatory random variable domain is integrated, leaving behind
a polynomial relationship between the statistics and the remaining
design/state/epistemic variables.

*Optional Keywords regarding method outputs*

Each of these sampling specifications refer to sampling on the SC
approximation for the purposes of generating approximate statistics.

- ``sample_type``
- ``samples``
- ``seed``
- ``fixed_seed``
- ``rng``
- ``probability_refinement``
- ``distribution``
- ``reliability_levels``
- ``response_levels``
- ``probability_levels``
- ``gen_reliability_levels``

Since SC approximations are formed on structured grids, there should
be no ambiguity with simulation sampling for generating the SC expansion.

When using the ``probability_refinement`` control, the number of
refinement samples is not under the user's control (these evaluations
are approximation-based, so management of this expense is less
critical). This option allows for refinement of probability and
generalized reliability results using importance sampling.

*Multi-fidelity UQ*

When using multifidelity UQ, the high fidelity expansion generated
from combining the low fidelity and discrepancy expansions retains the
polynomial form of the low fidelity expansion (only the coefficients
are updated).  Refer to :dakkw:`method-polynomial_chaos` for information
on the multifidelity interpretation of array inputs for
``quadrature_order`` and ``sparse_grid_level``.

*Usage Tips*

If *n* is small, then tensor-product Gaussian quadrature is again the
preferred choice. For larger *n*, tensor-product quadrature quickly
becomes too expensive and the sparse grid approach is preferred. For
self-consistency in growth rates, nested rules employ restricted
exponential growth (with the exception of the ``dimension_adaptive``
``p_refinement`` ``generalized`` case) for consistency with the linear
growth used for non-nested Gauss rules (integrand precision
:math:`i=4l+1`  for sparse grid level *l* and :math:`i=2m-1`  for tensor
grid order *m*).

*Additional Resources*

Dakota provides access to SC methods through the NonDStochCollocation
class. Refer to :ref:`uq` and :ref:`theory:uq:expansion` for additional information
on the SC algorithm.

*Expected HDF5 Output*

If Dakota was built with HDF5 support and run with the
:dakkw:`environment-results_output-hdf5` keyword, this method
writes the following results to HDF5:


- :ref:`hdf5_results-se_moments`
- :ref:`hdf5_results-pdf`
- :ref:`hdf5_results-level_mappings`
- :ref:`hdf5_results-vbd`



**Examples**



.. code-block::

    method,
     stoch_collocation
       sparse_grid_level = 2
       samples = 10000 seed = 12347 rng rnum2
       response_levels = .1 1. 50. 100. 500. 1000.
       variance_based_decomp





**Theory**


As mentioned above, a value-based expansion takes the form


.. math:: R = \sum_{i=1}^{N_p} r_i L_i(\xi) 

The :math:`i^{th}`  interpolation polynomial assumes the value of 1 at
the :math:`i^{th}`  collocation point and 0 at all other collocation
points, involving either a global Lagrange polynomial basis or local
piecewise splines. It is easy to see that the approximation reproduces
the response values at the collocation points and interpolates between
these values at other points. A gradient-enhanced expansion (selected
via the ``use_derivatives`` keyword) involves both type 1 and type 2
basis functions as follows:


.. math:: R = \sum_{i=1}^{N_p} [ r_i H^{(1)}_i(\xi)
   + \sum_{j=1}^n \frac{dr_i}{d\xi_j} H^{(2)}_{ij}(\xi) ] 

where the :math:`i^{th}`  type 1 interpolant produces 1 for the value at
the :math:`i^{th}`  collocation point, 0 for values at all other
collocation points, and 0 for derivatives (when differentiated) at all
collocation points, and the :math:`ij^{th}`  type 2 interpolant produces
0 for values at all collocation points, 1 for the :math:`j^{th}` 
derivative component at the :math:`i^{th}`  collocation point, and 0 for
the :math:`j^{th}`  derivative component at all other collocation points.
Again, this expansion reproduces the response values at each of the
collocation points, and when differentiated, also reproduces each
component of the gradient at each of the collocation points. Since
this technique includes the derivative interpolation explicitly, it
eliminates issues with matrix ill-conditioning that can occur in the
gradient-enhanced PCE approach based on regression. However, the
calculation of high-order global polynomials with the desired
interpolation properties can be similarly numerically challenging such
that the use of local cubic splines is recommended due to numerical
stability.

..
   HIDDEN: The orthogonal polynomials used in defining the Gauss
   points that make up the interpolation grid are governed by the
   Wiener, Askey, or Extended options. The Wiener option uses
   interpolation points from Gauss-Hermite (non-nested) or
   Genz-Keister (nested) integration rules for all random variables
   and employs the same nonlinear variable transformation as the local
   and global reliability methods (and therefore has the same variable
   support).  The Askey option, however, employs interpolation points
   from Gauss-Hermite (Genz-Keister if nested), Gauss-Legendre
   (Gauss-Patterson if nested), Gauss-Laguerre, Gauss-Jacobi, and
   generalized Gauss-Laguerre quadrature. The Extended option avoids
   the use of any nonlinear variable transformations by augmenting the
   Askey approach with Gauss points from numerically-generated
   orthogonal polynomials for non-Askey probability density
   functions. As for PCE, the Wiener/Askey/Extended selection defaults
   to Extended, can be overridden by the user using the keywords \c
   askey or \c wiener, and automatically falls back from
   Extended/Askey to Wiener on a per variable basis as needed to
   support prescribed correlations.