.. _variables-histogram_point_uncertain:

"""""""""""""""""""""""""
histogram_point_uncertain
"""""""""""""""""""""""""


Aleatory uncertain variable - discrete histogram


**Topics**


discrete_variables, aleatory_uncertain_variables


.. toctree::
   :hidden:
   :maxdepth: 1

   variables-histogram_point_uncertain-integer
   variables-histogram_point_uncertain-string
   variables-histogram_point_uncertain-real


**Specification**

- *Alias:* None

- *Arguments:* None

- *Default:* no histogram point uncertain variables


**Child Keywords:**

+-------------------------+--------------------+--------------------+---------------------------------------------+
| Required/Optional       | Description of     | Dakota Keyword     | Dakota Keyword Description                  |
|                         | Group              |                    |                                             |
+=========================+====================+====================+=============================================+
| Optional                                     | `integer`__        | Integer valued point histogram variable     |
+----------------------------------------------+--------------------+---------------------------------------------+
| Optional                                     | `string`__         | String (categorical) valued point histogram |
|                                              |                    | variable                                    |
+----------------------------------------------+--------------------+---------------------------------------------+
| Optional                                     | `real`__           | Real valued point histogram variable        |
+----------------------------------------------+--------------------+---------------------------------------------+

.. __: variables-histogram_point_uncertain-integer.html
__ variables-histogram_point_uncertain-string.html
__ variables-histogram_point_uncertain-real.html


**Description**


Histogram uncertain variables are typically used to model a set of
empirical data.  When the variables take on only discrete values or
categories, a discrete, or point histogram is used to describe their
probability mass function (one could think of this as a
:dakkw:`variables-histogram_bin_uncertain` variable with "bins" of zero width).
Dakota supports integer-, string-, and real-valued point histograms.

Point histograms are similar to :dakkw:`variables-discrete_design_set` and
:dakkw:`variables-discrete_state_set`, but as they are uncertain
variables, include the relative probabilities of observing the
different values within the set.

The ``histogram_point_uncertain`` keyword is followed by one or more of
``integer``, ``string``, or ``real``, each of which specify the number of
variables to be characterized as discrete histograms of that sub-type.

Each discrete histogram variable is specified by one or more
abscissa/count pairs.  The ``abscissas``, are the possible values the
variable can take on (:math:`x` coordinates of type integer, string, or
real), and must be specified in increasing order.  These are paired
with ``counts`` :math:`c` which provide the frequency of the given value or
string, relative to other possible values/strings.

Thus, to fully specify a point-based histogram with :math:`n` points, :math:`n` :math:`(x,c)` pairs must be specified with the following features:


- :math:`x` is the point value (integer, string, or real) and :math:`c` is the corresponding count for that value.
- the :math:`x` values must be strictly increasing (lexicographically for strings).
- all :math:`c` values must be positive.
- a minimum of one pair must be specified for each point-based histogram.


**Examples**


The ``pairs_per_variable`` specification provides for the proper
association of multiple sets of :math:`(x,c)` or :math:`(x,y)` pairs with
individual histogram variables. For example, in the following
specification,


.. code-block::

    histogram_point_uncertain
      integer            = 2
      pairs_per_variable = 2     3
      abscissas          = 3 4   100 200 300
      counts             = 1 1   1   2   1


``pairs_per_variable`` associates the :math:`(x,c)` pairs {(3,1),(4,1)} with
one point-based histogram variable (where the values 3 and 4 are
equally probable) and associates the :math:`(x,c)` pairs
{(100,1),(200,2),(300,1)} with a second point-based histogram variable
(where the value 200 is twice as probable as either 100 or 300).


**FAQ**


*Difference between bin and point histograms:* A (continuous) bin
histogram specifies bins of non-zero width, whereas a (discrete) point
histogram specifies individual point values, which can be thought of
as bins with zero width. In the terminology of LHS :cite:p:`Wyss1998`,
the bin pairs specification defines a "continuous linear" distribution and the point pairs specification
defines a "discrete histogram" distribution (although the points are
real-valued, the number of possible values is finite).