.. _method-surrogate_based_global: """""""""""""""""""""" surrogate_based_global """""""""""""""""""""" Adaptive Global Surrogate-Based Optimization **Topics** surrogate_based_optimization_methods .. toctree:: :hidden: :maxdepth: 1 method-surrogate_based_global-method_pointer method-surrogate_based_global-method_name method-surrogate_based_global-model_pointer method-surrogate_based_global-replace_points method-surrogate_based_global-max_iterations **Specification** - *Alias:* None - *Arguments:* None **Child Keywords:** +-------------------------+--------------------+--------------------+---------------------------------------------+ | Required/Optional | Description of | Dakota Keyword | Dakota Keyword Description | | | Group | | | +=========================+====================+====================+=============================================+ | Required (Choose One) | Sub-method | `method_pointer`__ | Pointer to sub-method to apply to a | | | Selection | | surrogate or branch-and-bound sub-problem | | | +--------------------+---------------------------------------------+ | | | `method_name`__ | Specify sub-method by name | +-------------------------+--------------------+--------------------+---------------------------------------------+ | Required | `model_pointer`__ | Identifier for model block to be used by a | | | | method | +----------------------------------------------+--------------------+---------------------------------------------+ | Optional | `replace_points`__ | (Recommended) Replace points in the | | | | surrogate training set, instead of | | | | appending | +----------------------------------------------+--------------------+---------------------------------------------+ | Optional | `max_iterations`__ | Number of iterations allowed for optimizers | | | | and adaptive UQ methods | +----------------------------------------------+--------------------+---------------------------------------------+ .. __: method-surrogate_based_global-method_pointer.html __ method-surrogate_based_global-method_name.html __ method-surrogate_based_global-model_pointer.html __ method-surrogate_based_global-replace_points.html __ method-surrogate_based_global-max_iterations.html **Description** The ``surrogate_based_global`` method iteratively performs optimization on a global surrogate using the same bounds during each iteration. - In one iteration, optimal solutions are found on the surrogate model, and a subset of these are passed to the next iteration. - At the next iteration, these surrogate-optimal variable sets are evaluated with the "truth" model, and added to the set of points over which the next surrogate is constructed. In this way, the optimization operates on a more accurate surrogate during each iteration, presumably driving to optimality quickly. In contrast to :dakkw:`method-surrogate_based_local`, this approach has no guarantee of convergence. *Usage Tips* *Attention:* This adaptive method is not recommended for "build-once" surrogates trained from (static) imported data or trained online using a single Dakota design of experiments. Instead, any Dakota optimization method can be used with a (build-once) global surrogate by specifying the ``id_model`` of a global surrogate model with the optimizer's ``model_pointer`` keyword. Configuring the method: - The sub-method, specified with either :dakkw:`method-surrogate_based_global-method_pointer` or :dakkw:`method-surrogate_based_global-method_name` should typically be an optimizer that returns multiple final solutions, such as :dakkw:`method-moga` or :dakkw:`method-soga`. - The :dakkw:`method-surrogate_based_global-model_pointer` keyword must identify a surrogate model which includes an underlying truth model, typically via :dakkw:`model-surrogate-global-truth_model_pointer` or :dakkw:`model-surrogate-global-dace_method_pointer` Workflow: - One might first try a single minimization method coupled with a surrogate model prior to using this surrogate-based global method. This is essentially equivalent to setting ``max_iterations`` to 1 and will allow one to get a sense of what surrogate types are the most accurate to use for the problem. - Consider starting with a small number of maximum iterations, such as 3--5, to get a sense of how the optimization evolves as the surrogate gets updated. If it is still changing significantly, then a larger number (used in combination with restart) may be needed. - Surrogates can be built for all primary functions and constraints or for only a subset of these functions and constraints. This allows one to use a "truth" model directly for some of the response functions, perhaps due to them being much less expensive than other functions. *Known Issue: When using discrete variables, there have been sometimes significant differences in surrogate behavior observed across computing platforms in some cases. The cause has not yet been fully diagnosed and is currently under investigation. In addition, guidance on appropriate construction and use of surrogates with discrete variables is under development. In the meantime, users should therefore be aware that there is a risk of inaccurate results when using surrogates with discrete variables.* *Expected HDF5 Output* If Dakota was built with HDF5 support and run with the :dakkw:`environment-results_output-hdf5` keyword, this method writes the following results to HDF5: - :ref:`hdf5_results-best_params` - :ref:`hdf5_results-best_obj_fncs` (when :dakkw:`responses-objective_functions`) are specified) - :ref:`hdf5_results-best_constraints` - :ref:`hdf5_results-calibration` (when :dakkw:`responses-calibration_terms` are specified) **Theory** In surrogate-based optimization (SBO) and surrogate-based nonlinear least squares (SBNLS), minimization occurs using a set of one or more approximations, defined from a surrogate model, that are built and periodically updated using data from a "truth" model. The surrogate model can be a global data fit (e.g., regression or interpolation of data generated from a design of computer experiments), a multipoint approximation, a local Taylor Series expansion, or a model hierarchy approximation (e.g., a low-fidelity simulation model), whereas the truth model involves a high-fidelity simulation model. The goals of surrogate-based methods are to reduce the total number of truth model simulations and, in the case of global data fit surrogates, to smooth noisy data with an easily navigated analytic function. The ``surrogate_based_global`` method was originally designed for MOGA (a multi-objective genetic algorithm). Since genetic algorithms often need thousands or tens of thousands of points to produce optimal or near-optimal solutions, the use of surrogates can be helpful for reducing the truth model evaluations. Instead of creating one set of surrogates for the individual objectives and running the optimization algorithm on the surrogate once, the idea is to select points along the (surrogate) Pareto frontier, which can be used to supplement the existing points. In this way, one does not need to use many points initially to get a very accurate surrogate. The surrogate becomes more accurate as the iterations progress.