Inference of Causal Relationships between Biomarkers and Outcomes in High Dimensions

Felix Agakov, Paul McKeigue, Jon Krohn, Jonathan Flint

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

We describe a unified computational framework for learning causal dependencies between genotypes, biomarkers, and phenotypic outcomes from large-scale data. In contrast to previous studies, our framework allows for noisy measurements, hidden confounders, missing data, and pleiotropic effects of genotypes on outcomes. The method exploits the use of genotypes as “instrumentalvariables”toinfercausalassociationsbetweenphenotypicbiomarkersandoutcomes,withoutrequiringtheassumption that genotypic effects are mediated only through the observed biomarkers. The framework builds on sparse linear methods developed in statistics and machine learning and modified here for inferring structures of richer networks with latent variables. Where the biomarkers are gene transcripts, the method can be used for fine mapping of quantitative trait loci (QTLs) detected ingeneticlinkagestudies. Todemonstrateourmethod,weexamined effects of gene transcript levels in the liver on plasma HDL cholesterol levels in a sample of 260 mice from a heterogeneous stock.
Original languageEnglish
Pages (from-to)1-8
Number of pages8
JournalJournal of Systemics, Cybernetics, and Informatics
Volume9
Issue number6
Publication statusPublished - 2011

Keywords / Materials (for Non-textual outputs)

  • sparse linear models
  • causality
  • structure learning
  • Bayesian networks
  • Mendelian randomization
  • instrumental variables

Fingerprint

Dive into the research topics of 'Inference of Causal Relationships between Biomarkers and Outcomes in High Dimensions'. Together they form a unique fingerprint.

Cite this