ТОП просматриваемых книг сайта:















Computational Statistics in Data Science. Группа авторов
Читать онлайн.Название Computational Statistics in Data Science
Год выпуска 0
isbn 9781119561088
Автор произведения Группа авторов
Жанр Математика
Издательство John Wiley & Sons Limited
The posterior distribution of (given
) is
If the true value of is unknown, it is often estimated from the marginal distribution of
,
via maximum‐likelihood estimation as
Robert and Casella [4] consider estimating using the posterior mean
. Under a quadratic loss, the Bayes estimator is
The risk for
is difficult to obtain analytically (although not impossible, see Robert and Casella [4]). Instead, we can estimate the risk over a grid of values using Monte Carlo. To do this, we fix
choices
over a grid, and for each
, generate
Monte Carlo samples from
yielding estimates
The resulting estimate of the risk is an ‐dimensional vector of means, for which we can utilize the sampling distribution in Theorem 1 to construct large‐sample confidence regions. An appropriate choice of a sequential stopping rule here is the relative‐magnitude sequential stopping rule, which stops simulation when the Monte Carlo variance is small relative to the average risk over all values of
considered. It is important to note that the risk at a particular
could be zero, but it is unlikely.
For illustration, we set and simulate a data point from the true model with
. To evaluate risk we choose a grid of
values with
. In order to assess the appropriate Monte Carlo sample size
, we set
so that at least
Monte Carlo samples are obtained. With
, and
estimated using the sample covariance matrix, the sequential stopping rule terminates simulation at 21 100 steps. Figure 2 demonstrates the estimated risk at
iterations and the estimated risk at termination. Pointwise Bonferroni corrected confidence intervals are presented as an indication of variability for each component1.
Figure 2 Estimated risk at
7.3 Bayesian Nonlinear Regression
Consider the biomedical oxygen demand (BOD) data collected by Marske [39] where BOD levels were measured periodically from cultured bottles of stream water. Bates and Watts [40] and Newton and Raftery [41] study a Bayesian nonlinear model with a fixed