Abstract / Description of output
Deep learning, a powerful methodology for data-driven modelling, has been shown to be useful in tackling several problems in the biomedical domain. However, deep neural architectures lack interpretability of how predictions from them are made on any test input. While several approaches to "opening the black box" are being developed, their application to biological and medical data is very much as its infancy. Here, we consider the specific problem of protein secondary structure prediction using the techniques of saliency maps to explain decisions of a deep neural network. The analysis leads to two important observations: (a) one-hot-encoded amino-acids are irrelevant in the presence of PSSM values as extra features; and (b) in predicting α-helices at any position, amino-acids to the right are far more important than those to the left. The latter observation may have a biological basis relating to the synthesis of proteins by ribosome movement from left to right, sequentially adding amino-acids.
Original language | English |
---|---|
Title of host publication | ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 1249-1253 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-4799-8131-1 |
ISBN (Print) | 978-1-4799-8132-8 |
DOIs | |
Publication status | Published - 17 May 2019 |
Event | 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing - Brighton, United Kingdom Duration: 12 May 2019 → 17 May 2019 Conference number: 44 https://www.2019.ieeeicassp.org/2019.ieeeicassp.org/index.html |
Publication series
Name | |
---|---|
ISSN (Print) | 1520-6149 |
ISSN (Electronic) | 2379-190X |
Conference
Conference | 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Abbreviated title | ICASSP 2019 |
Country/Territory | United Kingdom |
City | Brighton |
Period | 12/05/19 → 17/05/19 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- interpretability
- saliency maps
- protein secondary structure prediction
- convolutional neural networks