Projects per year
Abstract / Description of output
Background:
Structural variants (SVs) are known to play important roles in a variety of cancers, but their origins and functional consequences are still poorly understood. Many SVs are thought to emerge from errors in the repair processes following DNA double strand breaks (DSBs).
Results:
We used experimentally quantified DSB frequencies in cell lines with matched chromatin and sequence features to derive the first quantitative genome-wide models of DSB susceptibility. These models are accurate and provide novel insights into the mutational mechanisms generating DSBs. Models trained in one cell type can be successfully applied to others, but a substantial proportion of DSBs appear to reflect cell type-specific processes. Using model predictions as a proxy for susceptibility to DSBs in tumors, many SV-enriched regions appear to be poorly explained by selectively neutral mutational bias alone. A substantial number of these regions show unexpectedly high SV breakpoint frequencies given their predicted susceptibility to mutation and are therefore credible targets of positive selection in tumors. These putatively positively selected SV hotspots are enriched for genes previously shown to be oncogenic. In contrast, several hundred regions across the genome show unexpectedly low levels of SVs, given their relatively high susceptibility to mutation. These novel coldspot regions appear to be subject to purifying selection in tumors and are enriched for active promoters and enhancers.
Conclusions:
We conclude that models of DSB susceptibility offer a rigorous approach to the inference of SVs putatively subject to selection in tumors.
Structural variants (SVs) are known to play important roles in a variety of cancers, but their origins and functional consequences are still poorly understood. Many SVs are thought to emerge from errors in the repair processes following DNA double strand breaks (DSBs).
Results:
We used experimentally quantified DSB frequencies in cell lines with matched chromatin and sequence features to derive the first quantitative genome-wide models of DSB susceptibility. These models are accurate and provide novel insights into the mutational mechanisms generating DSBs. Models trained in one cell type can be successfully applied to others, but a substantial proportion of DSBs appear to reflect cell type-specific processes. Using model predictions as a proxy for susceptibility to DSBs in tumors, many SV-enriched regions appear to be poorly explained by selectively neutral mutational bias alone. A substantial number of these regions show unexpectedly high SV breakpoint frequencies given their predicted susceptibility to mutation and are therefore credible targets of positive selection in tumors. These putatively positively selected SV hotspots are enriched for genes previously shown to be oncogenic. In contrast, several hundred regions across the genome show unexpectedly low levels of SVs, given their relatively high susceptibility to mutation. These novel coldspot regions appear to be subject to purifying selection in tumors and are enriched for active promoters and enhancers.
Conclusions:
We conclude that models of DSB susceptibility offer a rigorous approach to the inference of SVs putatively subject to selection in tumors.
Original language | English |
---|---|
Article number | 28 |
Journal | Genome Biology |
Volume | 20 |
Early online date | 8 Feb 2019 |
DOIs | |
Publication status | E-pub ahead of print - 8 Feb 2019 |
Keywords / Materials (for Non-textual outputs)
- : Double strand break
- Chromatin
- Modeling
Fingerprint
Dive into the research topics of 'Modeling double strand break susceptibility to interrogate structural variation in cancer'. Together they form a unique fingerprint.Projects
- 2 Finished
-
-
Precision medicine and the mutational landscape of high grade serous ovarian cancer
1/02/18 → 31/01/21
Project: Research
Datasets
-
Supplementary files for study on modeling DSB with random forests
Ballinger, T. (Creator), Edinburgh DataShare, 5 May 2021
DOI: 10.7488/ds/2365
Dataset