On Suspicious Coincidences and Pointwise Mutual Information

Research output: Contribution to journalArticlepeer-review

Abstract

Barlow (1985) hypothesized that the co-occurrence of two events A and B is “suspicious” if P(A,B)≫P(A)P(B). We first review classical measures of association for 2 × 2 contingency tables, including Yule's Y (Yule, 1912), which depends only on the odds ratio λ and is independent of the marginal probabilities of the table. We then discuss the mutual information (MI) and pointwise mutual information (PMI), which depend on the ratio P(A,B)/P(A)P(B), as measures of association. We show that once the effect of the marginals is removed, MI and PMI behave similarly to Y as functions of λ. The pointwise mutual information is used extensively in some research communities for flagging suspicious coincidences. We discuss the pros and cons of using it in this way, bearing in mind the sensitivity of the PMI to the marginals, with increased scores for sparser events.
Original languageEnglish
Pages (from-to)2037-2046
Number of pages10
JournalNeural Computation
Volume34
Issue number10
DOIs
Publication statusPublished - 12 Sep 2022

Fingerprint

Dive into the research topics of 'On Suspicious Coincidences and Pointwise Mutual Information'. Together they form a unique fingerprint.

Cite this