Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data.

Sci Rep

Methods in Medical Informatics, Department of Computer Science, University of Tübingen, Sand 14, Tübingen, 72076, Germany.

Published: October 2023

Artificial neural networks show promising performance in detecting correlations within data that are associated with specific outcomes. However, the black-box nature of such models can hinder the knowledge advancement in research fields by obscuring the decision process and preventing scientist to fully conceptualize predicted outcomes. Furthermore, domain experts like healthcare providers need explainable predictions to assess whether a predicted outcome can be trusted in high stakes scenarios and to help them integrating a model into their own routine. Therefore, interpretable models play a crucial role for the incorporation of machine learning into high stakes scenarios like healthcare. In this paper we introduce Convolutional Motif Kernel Networks, a neural network architecture that involves learning a feature representation within a subspace of the reproducing kernel Hilbert space of the position-aware motif kernel function. The resulting model enables to directly interpret and evaluate prediction outcomes by providing a biologically and medically meaningful explanation without the need for additional post-hoc analysis. We show that our model is able to robustly learn on small datasets and reaches state-of-the-art performance on relevant healthcare prediction tasks. Our proposed method can be utilized on DNA and protein sequences. Furthermore, we show that the proposed method learns biologically meaningful concepts directly from data using an end-to-end learning scheme.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567796PMC
http://dx.doi.org/10.1038/s41598-023-44175-7DOI Listing

Publication Analysis

Top Keywords

motif kernel
12
convolutional motif
8
kernel networks
8
high stakes
8
stakes scenarios
8
proposed method
8
inherently interpretable
4
interpretable position-aware
4
position-aware convolutional
4
kernel
4

Similar Publications

Increasing the diversity of bio-based polymers is needed to address the combined problems of plastic pollution and greenhouse gas emissions. The magnitude of the problems necessitates rapid discovery of new materials; however, identification of appropriate chemistries maybe slow using current iterative methods. Machine learning (ML) methods could significantly expedite new material discovery and property identification.

View Article and Find Full Text PDF

Systematic Survey and Analysis Reveal Jasmonate ZIM-Domain Gene Family in Under High Temperature.

Plants (Basel)

November 2024

School of Pharmaceutical Sciences, Academy of Chinese Medical Sciences, Zhejiang Chinese Medical University, Hangzhou 310053, China.

Article Synopsis
  • Jasmonate ZIM-domain (JAZ) proteins are key regulators in the jasmonic acid (JA) signaling pathway, impacting plant defense, growth, and crosstalk with other hormones under stress conditions.
  • This study identified 20 JAZ family proteins, organized into six groups, located primarily in the nucleus, and examined their gene expression across different plant organs under high-temperature stress through transcriptomic analysis.
  • Findings revealed widespread differential expression of JAZ genes, with specific roles linked to stress responses, particularly highlighting one gene's unique regulation by heat stress and plant hormones like ABA and MeJA, paving the way for further exploration of JAZ family functions in plant stress adaptation.
View Article and Find Full Text PDF

Characterization of pecan PEBP family genes and the potential regulation role of CiPEBP-like1 in fatty acid synthesis.

Plant Sci

February 2025

State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Lin'an District, Hangzhou, Zhejiang 311300, China. Electronic address:

Phosphatidyl ethanolamine-binding protein (PEBP) plays important roles in plant growth and development. However, few studies have investigated the PEBP gene family in pecan (Carya illinoinensis), particularly the function of the PEBP-like subfamily. In this study, we identified 12 PEBP genes from the pecan genome and classified them into four subfamilies: MFT-like, FT-like, TFL1-like and PEBP-like.

View Article and Find Full Text PDF

Terahertz Imaging Detects Oral Cariogenic Microbial Domains Characteristics.

J Dent Res

December 2024

State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Research Center for Oral Diseases, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, China.

Dental caries, associated with plaque biofilm, is highly prevalent and significantly burdens public health. is the main cariogenic bacteria that adheres to the tooth surface and forms an abundant extracellular polysaccharide matrix (EPS) as a cariogenic biofilm scaffold. RNase III-encoding gene () and a putative chromosome segregation protein-encoding gene () are potentially associated with EPS production.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!