Models incorporating chromatin modification data identify functionally important p53 binding sites

Ji-Hyun Lim, Richard D Iggo, Daniel Barker

Research output: Contribution to journalArticlepeer-review

Abstract

Genome-wide prediction of transcription factor binding sites is notoriously difficult. We have developed and applied a logistic regression approach for prediction of binding sites for the p53 transcription factor that incorporates sequence information and chromatin modification data. We tested this by comparison of predicted sites with known binding sites defined by chromatin immunoprecipitation (ChIP), by the location of predictions relative to genes, by the function of nearby genes and by analysis of gene expression data after p53 activation. We compared the predictions made by our novel model with predictions based only on matches to a sequence position weight matrix (PWM). In whole genome assays, the fraction of known sites identified by the two models was similar, suggesting that there was little to be gained from including chromatin modification data. In contrast, there were highly significant and biologically relevant differences between the two models in the location of the predicted binding sites relative to genes, in the function of nearby genes and in the responsiveness of nearby genes to p53 activation. We propose that these contradictory results can be explained by PWM and ChIP data reflecting primarily biophysical properties of protein–DNA interactions, whereas chromatin modification data capture biologically important functional information.
Original languageEnglish
Pages (from-to)5582-5593
Number of pages12
JournalNucleic Acids Research
Volume41
Issue number11
Early online date17 Apr 2013
DOIs
Publication statusPublished - 1 Jun 2013

Fingerprint

Dive into the research topics of 'Models incorporating chromatin modification data identify functionally important p53 binding sites'. Together they form a unique fingerprint.

Cite this