Pathogenic loss-of-function, gain-of-function and dominant-negative mutations have profoundly different effects on protein structure: implications for variant effect prediction

  • Ben Livesey (Creator)
  • Joseph Marsh (Creator)
  • Lukas Gerasimavicius (Creator)

Dataset

Abstract

The dataset contains all the protein variants, their annotations and variant effect predictor scores used in the analysis outlined in our study. Abstract of the study: Most known pathogenic mutations occur in protein-coding regions of DNA and change the way proteins are made. Taking protein structure into account has therefore provided great insight into the molecular mechanisms underlying human genetic disease. While there has been much focus on how mutations can disrupt protein structure and thus cause a loss of function (LOF), alternative mechanisms, specifically dominant-negative (DN) and gain-of-function (GOF) effects, are less understood. Here, we have investigated the protein-level effects of pathogenic missense mutations associated with different molecular mechanisms. We observe striking differences between recessive vs dominant, and LOF vs non-LOF mutations, with dominant, non-LOF disease mutations having much milder effects on protein structure, and DN mutations being highly enriched at protein interfaces. We also find that nearly all computational variant effect predictors underperform on non-LOF mutations, even those based solely on sequence conservation. However, we do find that non-LOF mutations could potentially be identified by their tendency to cluster in space. Overall, our work suggests that many pathogenic mutations that act via DN and GOF mutations are likely being missed by current variant prioritisation strategies, but that there is considerable scope to improve computational predictions through consideration of molecular disease mechanisms.

Data Citation

Gerasimavicius, Lukas; Livesey, Benjamin; Marsh, Joseph. (2021). Pathogenic loss-of-function, gain-of-function and dominant-negative mutations have profoundly different effects on protein structure: implications for variant effect prediction, [dataset]. University of Edinburgh. College of Medicine & Veterinary Medicine. Institute of Genetics and Cancer. https://doi.org/10.7488/ds/3153.
Date made available22 Oct 2021
PublisherEdinburgh DataShare

Cite this