Retrieval of Experiments by Efficient Estimation of Marginal Likelihood

Sohan Seth, John Shawe-Taylor, Samuel Kaski

Research output: Working paper

Abstract

We study the task of retrieving relevant experiments given a query experiment. By experiment, we mean a collection of measurements from a set of `covariates' and the associated `outcomes'. While similar experiments can be retrieved by comparing available `annotations', this approach ignores the valuable information available in the measurements themselves. To incorporate this information in the retrieval task, we suggest employing a retrieval metric that utilizes probabilistic models learned from the measurements. We argue that such a metric is a sensible measure of similarity between two experiments since it permits inclusion of experiment-specific prior knowledge. However, accurate models are often not analytical, and one must resort to storing posterior samples which demands considerable resources. Therefore, we study strategies to select informative posterior samples to reduce the computational load while maintaining the retrieval performance. We demonstrate the efficacy of our approach on simulated data with simple linear regression as the models, and real world datasets.
Original languageEnglish
PublisherArXiv
Number of pages11
Publication statusPublished - 19 Feb 2014

Fingerprint Dive into the research topics of 'Retrieval of Experiments by Efficient Estimation of Marginal Likelihood'. Together they form a unique fingerprint.

Cite this