12.12.2012 Views

chapter - Atmospheric and Oceanic Science

chapter - Atmospheric and Oceanic Science

chapter - Atmospheric and Oceanic Science

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Statistical analysis of extreme events in a non-stationary context<br />

shape parameter ξ of the GEV distribution. In the case of the dispersion parameter,<br />

a trend would be explored using σ = exp(α + βt), the exponential being used to<br />

ensure that the dispersion parameter is always non-negative. Also, instead of looking<br />

at trends where the parameters vary in time, it is also possible to use the same<br />

approach to determine whether other kinds of trend exist; for example, it might be<br />

of interest to test whether annual maximum one-hour rainfall is related in some way<br />

to the Southern Oscillation Index (SOI). In this case, SOI would take the place of<br />

the time variable t in the above example.<br />

The method of Maximum Likelihood, used in the above example, underpins<br />

a very large part of estimation <strong>and</strong> hypothesis-testing procedures in parametric statistics,<br />

<strong>and</strong> it is not proposed here to re-present the theory that is already fully presented<br />

in many text-books. Briefly, given independent observations x1, x2… xN of a<br />

r<strong>and</strong>om variable X with probability distribution fx(x,θ) where θ is a parameter to be<br />

estimated, the likelihood function L(θ, x1, x2… xN) is the product fx(x1,θ) fx(x2,θ)…<br />

fx(xN,θ), <strong>and</strong> the ML estimate of θ is the value that maximizes this function. The<br />

variance of the estimate of θ is approximately - 1/ (second derivative of logeL with<br />

respect to θ). ML estimates have desirable statistical properties which makes ML<br />

estimation the procedure of choice, when the distribution fx(x,θ) can be specified.<br />

The brief account given here assumes just one parameter θ; extension to the case of<br />

several parameters θ = {θ1, θ2,…} is straightforward.<br />

To determine whether the selected distribution (in this case, the GEV) is<br />

appropriate for the data being analyzed, diagnostic techniques are used which are<br />

also provided by the available software packages. Figure 15.1, known as a Q-Q<br />

plot, shows a plot of the order statistics of the Porto Alegre data (vertical axis)<br />

against the corresponding quantiles (“scores”) for a GEV distribution; if the GEV<br />

provides a good fit to data, the plotted points should lie close to a straight line. The<br />

degree of agreement between the points <strong>and</strong> a straight line is ascertained by looking<br />

at whether the plotted points lie within the 95% confidence b<strong>and</strong>, which is also<br />

shown in Fig. 15.1. This shows that there is no reason to doubt that the GEV is<br />

appropriate for the data. Since, in the present example, there is no evidence of timetrend<br />

in the data, return periods can be calculated as shown in Fig. 15.2; the return<br />

period, in years, is shown on the horizontal axis, <strong>and</strong> the hourly rainfall for this<br />

return period is shown on the vertical axis. The 95% confidence limits, also plotted,<br />

show that there is considerable uncertainty in the calculated rainfalls.<br />

15.4. Example: POT procedure applied to fitting daily rainfalls at Ceres,<br />

Argentina<br />

This example uses the 44-year record 1959-2002 of daily rainfall for Ceres,<br />

Argentina, supplied by the National Meteorological Service. The example uses a<br />

threshold of 100mm. Theory (e.g., Coles 2001) shows that the distribution of daily<br />

204

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!