Designing biological sequences is an important challenge that requires satisfying complex constraints and thus is a natural problem to address with deep generative modeling. Diffusion generative models have achieved considerable success in many applications. Score-based generative stochastic differential equations (SDE) model is a continuous-time diffusion model framework that enjoys many benefits, but the originally proposed SDEs are not naturally designed for modeling discrete data. To develop generative SDE models for discrete data such as biological sequences, here we introduce a diffusion process defined in the probability simplex space with stationary distribution being the Dirichlet distribution. This makes diffusion in continuous space natural for modeling discrete data. We refer to this approach as Dirchlet diffusion score model. We demonstrate that this technique can generate samples that satisfy hard constraints using a Sudoku generation task. This generative model can also solve Sudoku, including hard puzzles, without additional training. Finally, we applied this approach to develop the first human promoter DNA sequence design model and showed that designed sequences share similar properties with natural promoter sequences.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10246113PMC

Publication Analysis

Top Keywords

discrete data
12
diffusion score
8
score model
8
biological sequences
8
modeling discrete
8
model
6
generative
5
diffusion
5
dirichlet diffusion
4
model biological
4

Similar Publications

Parent-Adult Child Relationships and Repartnering After Gray Divorce.

J Gerontol B Psychol Sci Soc Sci

January 2025

Ryan White Center for Pediatric Infectious Diseases and Global Health, Indiana University School of Medicine. Indianapolis, Indiana, USA.

Objectives: The rise in gray divorce has catalyzed repartnering in later life. However, the antecedents of older adult repartnering remain poorly understood, particularly the potential role of adult children. A form of ambiguous loss, marital disruption often leads to family boundary ambiguity, thereby weakening family ties.

View Article and Find Full Text PDF

Modeling the effects of thin filament near-neighbor cooperative interactions in mammalian myocardium.

J Gen Physiol

March 2025

Department of Animal, Veterinary, and Food Sciences, College of Agricultural and Life Sciences, University of Idaho, Moscow, ID, USA.

The mechanisms underlying cooperative activation and inactivation of myocardial force extend from local, near-neighbor interactions involving troponin-tropomyosin regulatory units (RU) and crossbridges (XB) to more global interactions across the sarcomere. To better understand these mechanisms in the hearts of small and large mammals, we undertook a simplified mathematical approach to assess the contribution of three types of near-neighbor cooperative interactions, i.e.

View Article and Find Full Text PDF

Affective experiences within academic contexts significantly influence educational outcomes. Despite this, the literature reveals a gap in generalising these effects to specific classroom activities, partly arising from the absence of suitable instruments to measure emotions in situational educational scenarios. Our study introduces an experience sampling method to measure sixteen discrete emotional states, deriving two scales for positive and negative activating emotions.

View Article and Find Full Text PDF

Introduction: Children are among the most vulnerable populations affected by armed conflicts, yet there is limited data on the preparedness of military medical personnel to care for pediatric combat trauma casualties in austere or large-scale combat operations. This study aimed to assess the confidence, training needs, and resource requirements of military medical providers who have managed pediatric patients during deployment.

Materials And Methods: This IRB-exempt, cross-sectional mixed-methods study used a survey created via a modified Delphi method with input from subject matter experts.

View Article and Find Full Text PDF

Daylight Saving Time and Automobile Accidents: Evidence From Chile.

Health Econ

January 2025

Big Data Analysis Department, Central Bank of Chile, Santiago, Chile.

Under the evidence that the Daylight Saving Time (DST) regime does not accomplish its primary goal of saving energy, I analyze one of the main side effects, automobile accidents in Chile between 2002 and 2018. I use a Regression Discontinuity Design (RDD) exploiting the discrete nature of the transition into DST and a Difference-in-Difference (DID) approach, taking advantage of the changes in dates that the policy starts and ends over the years. I find a 2.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!