Sequence-based neural networks can learn to make accurate predictions from large biological datasets, but model interpretation remains challenging. Many existing feature attribution methods are optimized for continuous rather than discrete input patterns and assess individual feature importance in isolation, making them ill-suited for interpreting non-linear interactions in molecular sequences. Building on work in computer vision and natural language processing, we developed an approach based on deep learning - Scrambler networks - wherein the most salient sequence positions are identified with learned input masks. Scramblers learn to predict Position-Specific Scoring Matrices () where unimportant nucleotides or residues are scrambled by raising their entropy. We apply Scramblers to interpret the effects of genetic variants, uncover non-linear interactions between cis-regulatory elements, explain binding specificity for protein-protein interactions, and identify structural determinants of designed proteins. We show that Scramblers enable efficient attribution across large datasets and result in high-quality explanations, often outperforming state-of-the-art methods.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9373874 | PMC |
http://dx.doi.org/10.1038/s42256-021-00428-6 | DOI Listing |
Physiol Rev
January 2025
Department of Sport, Exercise and Health, University of Basel, Basel, Switzerland.
Physical activity is a meaningful part of life, which starts before birth and lasts until death. There are many health benefits to be derived from physical activity, hence, regular engagement is recommended on a weekly basis. However, these recommendations are often not met.
View Article and Find Full Text PDFPLoS One
January 2025
Department of Computer Science, Khalifa University, Abu Dhabi, UAE.
A methodology is proposed, which addresses the caveat that line-of-sight emission spectroscopy presents in that it cannot provide spatially resolved temperature measurements in non-homogeneous temperature fields. The aim of this research is to explore the use of data-driven models in measuring temperature distributions in a spatially resolved manner using emission spectroscopy data. Two categories of data-driven methods are analyzed: (i) Feature engineering and classical machine learning algorithms, and (ii) end-to-end convolutional neural networks (CNN).
View Article and Find Full Text PDFPLoS One
January 2025
Data Management, Modelling and Geo-Information Unit, International Centre of Insect Physiology and Ecology, Kenya.
Organic fertilizers have been identified as a sustainable agricultural practice that can enhance productivity and reduce environmental impact. Recently, the European Union defined and accepted insect frass as an innovative and emerging organic fertilizer. In the wider domain of organic fertilizers, mathematical and computational models have been developed to optimize their production and application conditions.
View Article and Find Full Text PDFSci Adv
January 2025
Department of Zoology, University of Cambridge, Cambridge, UK.
The evolutionary origin of the vertebrate brain remains a major subject of debate, as its development from a dorsal tubular neuroepithelium is unique to chordates. To shed light on the evolutionary emergence of the vertebrate brain, we compared anterior neuroectoderm development across deuterostome species, using available single-cell datasets from sea urchin, amphioxus, and zebrafish embryos. We identified a conserved gene co-expression module, comparable to the anterior gene regulatory network (aGRN) controlling apical organ development in ambulacrarians, and spatially mapped it by multiplexed in situ hybridization to the developing retina and hypothalamus of chordates.
View Article and Find Full Text PDFPLoS One
January 2025
School of Information Science and Engineering, Xinjiang University, Urumqi, China.
Anomaly detection is crucial in areas such as financial fraud identification, cybersecurity defense, and health monitoring, as it directly affects the accuracy and security of decision-making. Existing generative adversarial nets (GANs)-based anomaly detection methods overlook the importance of local density, limiting their effectiveness in detecting anomaly objects in complex data distributions. To address this challenge, we introduce a generative adversarial local density-based anomaly detection (GALD) method, which combines the data distribution modeling capabilities of GANs with local synthetic density analysis.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!