How biased are maximum entropy models?

Jakob H. Macke, Iain Murray, Peter E. Latham

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Maximum entropy models have become popular statistical models in neuroscience and other areas in biology, and can be useful tools for obtaining estimates of mutual information in biological systems. However, maximum entropy models fit to small data sets can be subject to sampling bias; i.e. the true entropy of the data can be severely underestimated. Here we study the sampling properties of estimates of the entropy obtained from maximum entropy models. We show that if the data is generated by a distribution that lies in the model class, the bias is equal to the number of parameters divided by twice the number of observations. However, in practice, the true distribution is usually outside the model class, and we show here that this misspecification can lead to much larger bias. We provide a perturbative approximation of the maximally expected bias when the true model is out of model class, and we illustrate our results using numerical simulations of an Ising model; i.e. the second-order maximum entropy distribution on binary data.
Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 24
EditorsJ. Shawe-Taylor, R.S. Zemel, P. Bartlett, F.C.N. Pereira, K.Q. Weinberger
Pages2034-2042
Number of pages9
Publication statusPublished - 2011

Fingerprint

Dive into the research topics of 'How biased are maximum entropy models?'. Together they form a unique fingerprint.

Cite this