chapter - Atmospheric and Oceanic Science
chapter - Atmospheric and Oceanic Science
chapter - Atmospheric and Oceanic Science
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Statistical analysis of extreme events in a non-stationary context<br />
shape parameter ξ of the GEV distribution. In the case of the dispersion parameter,<br />
a trend would be explored using σ = exp(α + βt), the exponential being used to<br />
ensure that the dispersion parameter is always non-negative. Also, instead of looking<br />
at trends where the parameters vary in time, it is also possible to use the same<br />
approach to determine whether other kinds of trend exist; for example, it might be<br />
of interest to test whether annual maximum one-hour rainfall is related in some way<br />
to the Southern Oscillation Index (SOI). In this case, SOI would take the place of<br />
the time variable t in the above example.<br />
The method of Maximum Likelihood, used in the above example, underpins<br />
a very large part of estimation <strong>and</strong> hypothesis-testing procedures in parametric statistics,<br />
<strong>and</strong> it is not proposed here to re-present the theory that is already fully presented<br />
in many text-books. Briefly, given independent observations x1, x2… xN of a<br />
r<strong>and</strong>om variable X with probability distribution fx(x,θ) where θ is a parameter to be<br />
estimated, the likelihood function L(θ, x1, x2… xN) is the product fx(x1,θ) fx(x2,θ)…<br />
fx(xN,θ), <strong>and</strong> the ML estimate of θ is the value that maximizes this function. The<br />
variance of the estimate of θ is approximately - 1/ (second derivative of logeL with<br />
respect to θ). ML estimates have desirable statistical properties which makes ML<br />
estimation the procedure of choice, when the distribution fx(x,θ) can be specified.<br />
The brief account given here assumes just one parameter θ; extension to the case of<br />
several parameters θ = {θ1, θ2,…} is straightforward.<br />
To determine whether the selected distribution (in this case, the GEV) is<br />
appropriate for the data being analyzed, diagnostic techniques are used which are<br />
also provided by the available software packages. Figure 15.1, known as a Q-Q<br />
plot, shows a plot of the order statistics of the Porto Alegre data (vertical axis)<br />
against the corresponding quantiles (“scores”) for a GEV distribution; if the GEV<br />
provides a good fit to data, the plotted points should lie close to a straight line. The<br />
degree of agreement between the points <strong>and</strong> a straight line is ascertained by looking<br />
at whether the plotted points lie within the 95% confidence b<strong>and</strong>, which is also<br />
shown in Fig. 15.1. This shows that there is no reason to doubt that the GEV is<br />
appropriate for the data. Since, in the present example, there is no evidence of timetrend<br />
in the data, return periods can be calculated as shown in Fig. 15.2; the return<br />
period, in years, is shown on the horizontal axis, <strong>and</strong> the hourly rainfall for this<br />
return period is shown on the vertical axis. The 95% confidence limits, also plotted,<br />
show that there is considerable uncertainty in the calculated rainfalls.<br />
15.4. Example: POT procedure applied to fitting daily rainfalls at Ceres,<br />
Argentina<br />
This example uses the 44-year record 1959-2002 of daily rainfall for Ceres,<br />
Argentina, supplied by the National Meteorological Service. The example uses a<br />
threshold of 100mm. Theory (e.g., Coles 2001) shows that the distribution of daily<br />
204