Maximum Likelihood Estimation Of Information Measures

2015 IEEE International Symposium on Information Theory (ISIT)(2015)

引用 5|浏览25
暂无评分
摘要
The Maximum Likelihood Estimator (MLE) is widely used in estimating information measures, and involves "plugging-in" the empirical distribution of the data to estimate a given functional of the unknown distribution. In this work we propose a general framework and procedure to analyze the non-asymptotic performance of the MLE in estimating functionals of discrete distributions, under the worst-case mean squared error criterion.We show that existing theory is insufficient for analyzing the bias of the MLE, and propose to apply the theory of approximation using positive linear operators to study this bias. The variance is controlled using the well-known tools from the literature on concentration inequalities. Our techniques completely characterize the maximum L-2 risk incurred by the MLE in estimating the Shannon entropy H(P) = Sigma(S)(i=1) -p(i) ln p(i), and F-alpha(P) = Sigma(S)(i=1) p(i)(alpha) up to a multiplicative constant. As a corollary, for Shannon entropy estimation, we show that it is necessary and sufficient to have n >> S observations for the MLE to be consistent, where S represents the support size. In addition, we obtain that it is necessary and sufficient to consider n >> S-1/alpha samples for the MLE to consistently estimate F-alpha (P), 0 < alpha < 1. The minimax rate-optimal estimators for both problems require S/ln S and S-1/alpha / ln S samples, which implies that the MLE is strictly sub-optimal. When 1 < alpha < 3/2, we show that the maximum L-2 rate of convergence for the MLE is n(-2)(alpha - 1) for infinite support size, while the minimax L-2 rate is (n ln n)(-2(alpha -) (1)). When alpha >= 3/2, the MLE achieves the minimax optimal L-2 convergence rate n(-1) regardless of the support size.
更多
查看译文
关键词
maximum likelihood estimation,MLE,information measure estimation,mean squared error criterion,approximation theory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要