Improving representations of genomic sequence motifs in convolutional networks with exponential activations.

Nat Mach Intell

Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA.

Published: March 2021

Deep convolutional neural networks (CNNs) trained on regulatory genomic sequences tend to build representations in a distributed manner, making it a challenge to extract learned features that are biologically meaningful, such as sequence motifs. Here we perform a comprehensive analysis on synthetic sequences to investigate the role that CNN activations have on model interpretability. We show that employing an exponential activation to first layer filters consistently leads to interpretable and robust representations of motifs compared to other commonly used activations. Strikingly, we demonstrate that CNNs with better test performance do not necessarily imply more interpretable representations with attribution methods. We find that CNNs with exponential activations significantly improve the efficacy of recovering biologically meaningful representations with attribution methods. We demonstrate these results generalise to real DNA sequences across several datasets. Together, this work demonstrates how a small modification to existing CNNs, i.e. setting exponential activations in the first layer, can significantly improve the robustness and interpretabilty of learned representations directly in convolutional filters and indirectly with attribution methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315445PMC
http://dx.doi.org/10.1038/s42256-020-00291-xDOI Listing

Publication Analysis

Top Keywords

exponential activations
12
attribution methods
12
sequence motifs
8
biologically meaningful
8
representations attribution
8
activations
5
representations
5
improving representations
4
representations genomic
4
genomic sequence
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!