Entropy Inference and the James-Stein Estimator, with Application to ...
Entropy Inference and the James-Stein Estimator, with Application to ...
Entropy Inference and the James-Stein Estimator, with Application to ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
ENTROPY INFERENCE AND THE JAMES-STEIN ESTIMATOR<br />
is a weighted average of <strong>the</strong>se two estima<strong>to</strong>rs, where <strong>the</strong> weight is chosen in a data-driven fashion<br />
such that ˆθ Shrink is improved in terms of mean squared error relative <strong>to</strong> both ˆθ <strong>and</strong> ˆθ Target .<br />
A key advantage of <strong>James</strong>-<strong>Stein</strong>-type shrinkage is that <strong>the</strong> optimal shrinkage intensity λ ⋆ can be<br />
calculated analytically <strong>and</strong> <strong>with</strong>out knowing <strong>the</strong> true value θ, via<br />
λ ⋆ = ∑p k=1 Var(ˆθ k ) − Cov(ˆθ k , ˆθ Target<br />
k<br />
)+Bias(ˆθ k )E(ˆθ k − ˆθ Target<br />
∑ p k=1 E[(ˆθ k − ˆθ Target<br />
k<br />
) 2 ]<br />
k<br />
)<br />
. (10)<br />
A simple estimate of λ ⋆ is obtained by replacing all variances <strong>and</strong> covariances in Equation 10 <strong>with</strong><br />
<strong>the</strong>ir empirical counterparts, followed by truncation of ˆλ ⋆ at 1 (so that ˆλ ⋆ ≤ 1 always holds).<br />
Equation 10 is discussed in detail in Schäfer <strong>and</strong> Strimmer (2005b) <strong>and</strong> Opgen-Rhein <strong>and</strong> Strimmer<br />
(2007a). More specialized versions of it are treated, for example, in Ledoit <strong>and</strong> Wolf (2003) for<br />
unbiased ˆθ <strong>and</strong> in Thompson (1968) (unbiased, univariate case <strong>with</strong> deterministic target). A very<br />
early version (univariate <strong>with</strong> zero target) even predates <strong>the</strong> estima<strong>to</strong>r of <strong>James</strong> <strong>and</strong> <strong>Stein</strong>, see Goodman<br />
(1953). For <strong>the</strong> multinormal setting of <strong>James</strong> <strong>and</strong> <strong>Stein</strong> (1961), Equation 9 <strong>and</strong> Equation 10<br />
reduce <strong>to</strong> <strong>the</strong> shrinkage estima<strong>to</strong>r described in Stigler (1990).<br />
<strong>James</strong>-<strong>Stein</strong> shrinkage has an empirical Bayes interpretation (Efron <strong>and</strong> Morris, 1973). Note,<br />
however, that only <strong>the</strong> first two moments of <strong>the</strong> distributions of ˆθ Target <strong>and</strong> ˆθ need <strong>to</strong> be specified in<br />
Equation 10. Hence, <strong>James</strong>-<strong>Stein</strong> estimation may be viewed as a quasi-empirical Bayes approach<br />
(in <strong>the</strong> same sense as in quasi-likelihood, which also requires only <strong>the</strong> first two moments).<br />
Appendix B. Computer Implementation<br />
The proposed shrinkage estima<strong>to</strong>rs of entropy <strong>and</strong> mutual information, as well as all o<strong>the</strong>r investigated<br />
entropy estima<strong>to</strong>rs, have been implemented in R (R Development Core Team, 2008). A<br />
corresponding R package “entropy” was deposited in <strong>the</strong> R archive CRAN <strong>and</strong> is accessible at <strong>the</strong><br />
URL http://cran.r-project.org/web/packages/entropy/ under <strong>the</strong> GNU General Public<br />
License.<br />
References<br />
A. Agresti <strong>and</strong> D. B. Hitchcock. Bayesian inference for categorical data analysis. Statist. Meth.<br />
Appl., 14:297–330, 2005.<br />
A. J. Butte, P. Tamayo, D. Slonim, T. R. Golub, <strong>and</strong> I. S. Kohane. Discovering functional relationships<br />
between RNA expression <strong>and</strong> chemo<strong>the</strong>rapeutic susceptibility using relevance networks.<br />
Proc. Natl. Acad. Sci. USA, 97:12182–12186, 2000.<br />
A. Chao <strong>and</strong> T.-J. Shen. Nonparametric estimation of Shannon’s index of diversity when <strong>the</strong>re are<br />
unseen species. Environ. Ecol. Stat., 10:429–443, 2003.<br />
A. Dobra, C. Hans, B. Jones, J. R. Nevins, G. Yao, <strong>and</strong> M. West. Sparse graphical models for<br />
exploring gene expression data. J. Multiv. Anal., 90:196–212, 2004.<br />
B. Efron <strong>and</strong> C. N. Morris. <strong>Stein</strong>’s estimation rule <strong>and</strong> its competi<strong>to</strong>rs–an empirical Bayes approach.<br />
J. Amer. Statist. Assoc., 68:117–130, 1973.<br />
1481