Multinomial Convolutions for Joint Modeling of Regulatory Motifs and Sequence Activity Readouts.

Minjun Park Salvi Singh Samin Rahman Khan Mohammed Abid Abrar Francisco Grisanti M Sohel Rahman Md Abul Hassan Samee

Genes (Basel)

Department of Integrative Physiology, Baylor College of Medicine, Houston, TX 77030, USA.

Published: September 2022

A common goal in the convolutional neural network (CNN) modeling of genomic data is to discover specific sequence motifs. Post hoc analysis methods aid in this task but are dependent on parameters whose optimal values are unclear and applying the discovered motifs to new genomic data is not straightforward. As an alternative, we propose to learn convolutions as multinomial distributions, thus streamlining interpretable motif discovery with CNN model fitting. We developed MuSeAM (Multinomial CNNs for Sequence Activity Modeling) by implementing multinomial convolutions in a CNN model. Through benchmarking, we demonstrate the efficacy of MuSeAM in accurately modeling genomic data while fitting multinomial convolutions that recapitulate known transcription factor motifs.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9498894	PMC
http://dx.doi.org/10.3390/genes13091614	DOI Listing

Publication Analysis

Top Keywords

multinomial convolutions

genomic data

sequence activity

modeling genomic

cnn model

multinomial

convolutions joint

modeling

joint modeling

modeling regulatory

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!