20.04.2013 Views

Documentation of the Evaluation of CALPUFF and Other Long ...

Documentation of the Evaluation of CALPUFF and Other Long ...

Documentation of the Evaluation of CALPUFF and Other Long ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

observations) divided by <strong>the</strong> number <strong>of</strong> monitor‐time events that ei<strong>the</strong>r a prediction or<br />

observed tracer occurred at a monitor (i.e., ei<strong>the</strong>r a predicted or observed hits), with a perfect<br />

score <strong>of</strong> 100% (which means <strong>the</strong>re were no occurrences when <strong>the</strong>re was a predicted hit but an<br />

observed miss <strong>and</strong> vice versa).<br />

45%<br />

40%<br />

35%<br />

30%<br />

25%<br />

20%<br />

15%<br />

10%<br />

5%<br />

0%<br />

Threat Score (TS)<br />

(Perfect = 100%)<br />

<strong>CALPUFF</strong> SCIPUFF HYSPLIT FLEXPART CAMx<br />

Figure 6‐8. Threat Score (TS) statistical performance metric for <strong>the</strong> five LRT models <strong>and</strong> <strong>the</strong><br />

ETEX tracer field experiment.<br />

6.4.1.2 Global Analysis <strong>of</strong> Model Performance<br />

Eight global statistical analysis metrics are used to evaluate <strong>the</strong> five LRT model performance<br />

using <strong>the</strong> ETEX data base that are described in Section 2.4 <strong>and</strong> consist <strong>of</strong> <strong>the</strong> FOEX, FA2, FA5,<br />

NMSE, PCC, FB, KS <strong>and</strong> RANK statistical metrics.<br />

The Factor <strong>of</strong> Exceedance (FOEX) gives a measure <strong>of</strong> <strong>the</strong> scatter <strong>of</strong> <strong>the</strong> modeled predicted <strong>and</strong><br />

observed <strong>and</strong> a level <strong>of</strong> underestimation versus overestimation <strong>of</strong> <strong>the</strong> model. FOEX is bounded<br />

by ‐50% to +50%. The within a Factor <strong>of</strong> α (FAα), where we used within a Factor <strong>of</strong> 2 (FA2) <strong>and</strong><br />

5 (FA5), also gives an indication <strong>of</strong> <strong>the</strong> amount <strong>of</strong> scatter in <strong>the</strong> predicted <strong>and</strong> observed tracer<br />

pairs, but no information on whe<strong>the</strong>r <strong>the</strong> model is over‐ or under‐predicting. A perfect model<br />

would have an FAα score <strong>of</strong> 100%. A good performing model would have a FOEX score near<br />

zero <strong>and</strong> high FAα values. A model with a large negative FOEX <strong>and</strong> low FAα values would<br />

indicate an under‐prediction tendency. Whereas a model with a large positive FOEX <strong>and</strong> low<br />

FAα would suggest a model that over‐predicts.<br />

Figure 6‐9 displays <strong>the</strong> FOEX performance metrics for <strong>the</strong> five LRT models <strong>and</strong> <strong>the</strong> ETEX<br />

modeling period.<br />

113

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!