27.01.2014 Views

Entropy Inference and the James-Stein Estimator, with Application to ...

Entropy Inference and the James-Stein Estimator, with Application to ...

Entropy Inference and the James-Stein Estimator, with Application to ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

ENTROPY INFERENCE AND THE JAMES-STEIN ESTIMATOR<br />

is a weighted average of <strong>the</strong>se two estima<strong>to</strong>rs, where <strong>the</strong> weight is chosen in a data-driven fashion<br />

such that ˆθ Shrink is improved in terms of mean squared error relative <strong>to</strong> both ˆθ <strong>and</strong> ˆθ Target .<br />

A key advantage of <strong>James</strong>-<strong>Stein</strong>-type shrinkage is that <strong>the</strong> optimal shrinkage intensity λ ⋆ can be<br />

calculated analytically <strong>and</strong> <strong>with</strong>out knowing <strong>the</strong> true value θ, via<br />

λ ⋆ = ∑p k=1 Var(ˆθ k ) − Cov(ˆθ k , ˆθ Target<br />

k<br />

)+Bias(ˆθ k )E(ˆθ k − ˆθ Target<br />

∑ p k=1 E[(ˆθ k − ˆθ Target<br />

k<br />

) 2 ]<br />

k<br />

)<br />

. (10)<br />

A simple estimate of λ ⋆ is obtained by replacing all variances <strong>and</strong> covariances in Equation 10 <strong>with</strong><br />

<strong>the</strong>ir empirical counterparts, followed by truncation of ˆλ ⋆ at 1 (so that ˆλ ⋆ ≤ 1 always holds).<br />

Equation 10 is discussed in detail in Schäfer <strong>and</strong> Strimmer (2005b) <strong>and</strong> Opgen-Rhein <strong>and</strong> Strimmer<br />

(2007a). More specialized versions of it are treated, for example, in Ledoit <strong>and</strong> Wolf (2003) for<br />

unbiased ˆθ <strong>and</strong> in Thompson (1968) (unbiased, univariate case <strong>with</strong> deterministic target). A very<br />

early version (univariate <strong>with</strong> zero target) even predates <strong>the</strong> estima<strong>to</strong>r of <strong>James</strong> <strong>and</strong> <strong>Stein</strong>, see Goodman<br />

(1953). For <strong>the</strong> multinormal setting of <strong>James</strong> <strong>and</strong> <strong>Stein</strong> (1961), Equation 9 <strong>and</strong> Equation 10<br />

reduce <strong>to</strong> <strong>the</strong> shrinkage estima<strong>to</strong>r described in Stigler (1990).<br />

<strong>James</strong>-<strong>Stein</strong> shrinkage has an empirical Bayes interpretation (Efron <strong>and</strong> Morris, 1973). Note,<br />

however, that only <strong>the</strong> first two moments of <strong>the</strong> distributions of ˆθ Target <strong>and</strong> ˆθ need <strong>to</strong> be specified in<br />

Equation 10. Hence, <strong>James</strong>-<strong>Stein</strong> estimation may be viewed as a quasi-empirical Bayes approach<br />

(in <strong>the</strong> same sense as in quasi-likelihood, which also requires only <strong>the</strong> first two moments).<br />

Appendix B. Computer Implementation<br />

The proposed shrinkage estima<strong>to</strong>rs of entropy <strong>and</strong> mutual information, as well as all o<strong>the</strong>r investigated<br />

entropy estima<strong>to</strong>rs, have been implemented in R (R Development Core Team, 2008). A<br />

corresponding R package “entropy” was deposited in <strong>the</strong> R archive CRAN <strong>and</strong> is accessible at <strong>the</strong><br />

URL http://cran.r-project.org/web/packages/entropy/ under <strong>the</strong> GNU General Public<br />

License.<br />

References<br />

A. Agresti <strong>and</strong> D. B. Hitchcock. Bayesian inference for categorical data analysis. Statist. Meth.<br />

Appl., 14:297–330, 2005.<br />

A. J. Butte, P. Tamayo, D. Slonim, T. R. Golub, <strong>and</strong> I. S. Kohane. Discovering functional relationships<br />

between RNA expression <strong>and</strong> chemo<strong>the</strong>rapeutic susceptibility using relevance networks.<br />

Proc. Natl. Acad. Sci. USA, 97:12182–12186, 2000.<br />

A. Chao <strong>and</strong> T.-J. Shen. Nonparametric estimation of Shannon’s index of diversity when <strong>the</strong>re are<br />

unseen species. Environ. Ecol. Stat., 10:429–443, 2003.<br />

A. Dobra, C. Hans, B. Jones, J. R. Nevins, G. Yao, <strong>and</strong> M. West. Sparse graphical models for<br />

exploring gene expression data. J. Multiv. Anal., 90:196–212, 2004.<br />

B. Efron <strong>and</strong> C. N. Morris. <strong>Stein</strong>’s estimation rule <strong>and</strong> its competi<strong>to</strong>rs–an empirical Bayes approach.<br />

J. Amer. Statist. Assoc., 68:117–130, 1973.<br />

1481

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!