Calculating p-values and their significances with the Energy Test for large datasets

W. Barter, C. Burr, C. Parkes

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

The energy test method is a multi-dimensional test of whether two samples are consistent with arising from the same underlying population, through the calculation of a single test statistic (called the T-value). The method has recently been used in particle physics to search for samples that differ due to CP violation. The generalised extreme value function has previously been used to describe the distribution of T-values under the null hypothesis that the two samples are drawn from the same underlying population. We show that, in a simple test case, the distribution is not sufficiently well described by the generalised extreme value function. We present a new method, where the distribution of T-values under the null hypothesis when comparing two large samples can be found by scaling the distribution found when comparing small samples drawn from the same population. This method can then be used to quickly calculate the p-values associated with the results of the test.
Original languageEnglish
Article numberP04011
Pages (from-to)1-8
Number of pages8
Journal Journal of Instrumentation
Volume13
DOIs
Publication statusPublished - 6 Apr 2018

Fingerprint

Dive into the research topics of 'Calculating p-values and their significances with the Energy Test for large datasets'. Together they form a unique fingerprint.

Cite this