Edinburgh Research Explorer

Statistical modeling of isoform splicing dynamics from RNA-seq time series data

Research output: Contribution to journalArticle

Original languageEnglish
Pages (from-to)2965-2972
Number of pages8
JournalBioinformatics
Volume32
Issue number19
Early online date17 Jun 2016
DOIs
StatePublished - 1 Oct 2016

Abstract

Motivation: Isoform quantification is an important goal of RNA-seq experiments, yet it remains problematic for genes with low expression or several isoforms. These difficulties may in principle be ameliorated by exploiting correlated experimental designs, such as time series or dosage response experiments. Time series RNA-seq experiments, in particular, are becoming increasingly popular, yet there are no methods that explicitly leverage the experimental design to improve isoform quantification. Results: Here, we present DICEseq, the first isoform quantification method tailored to correlated RNA-seq experiments. DICEseq explicitly models the correlations between different RNA-seq experiments to aid the quantification of isoforms across experiments. Numerical experiments on simulated datasets show that DICEseq yields more accurate results than state-of-the-art methods, an advantage that can become considerable at low coverage levels. On real datasets, our results show that DICEseq provides substantially more reproducible and robust quantifications, increasing the correlation of estimates from replicate datasets by up to 10% on genes with low or moderate expression levels (bottom third of all genes). Furthermore, DICEseq permits to quantify the trade-off between temporal sampling of RNA and depth of sequencing, frequently an important choice when planning experiments. Our results have strong implications for the design of RNA-seq experiments, and offer a novel tool for improved analysis of such datasets. Availability and Implementation: Python code is freely available at http://diceseq.sf.net.

Download statistics

No data available

ID: 28992447