Deep Learning for Reintegrating Biology.

Integr Comp Biol

Department of Electrical and Computer Engineering, University of Southern California, 3740 McClintock Avenue, Los Angeles, CA 90089, USA.

Published: February 2022

The goal of this vision paper is to investigate the possible role that advanced machine learning techniques, especially deep learning (DL), could play in the reintegration of various biological disciplines. To achieve this goal, a series of operational, but admittedly very simplistic, conceptualizations have been introduced: Life has been taken as a multidimensional phenomenon that inhabits three physical dimensions (time, space, and scale) and biological research as establishing connection between different points in the domain of life. Each of these points hence denotes a position in time, space, and scale at which a life phenomenon of interest takes place. Using these conceptualizations, fragmentation of biology can be seen as the result of too few and especially too short-ranged connections. Reintegrating biology could then be accomplished by establishing more, longer ranged connections. DL methods appear to be very well suited for addressing this particular need at this particular time. Notwithstanding the numerous unsubstantiated claims regarding the capabilities of AI, DL networks represent a major advance in the ability to find complex relationships inside large data sets that would have not been accessible with traditional data analytic methods or to a human observer. In addition, ongoing advances in the automation of taking measurements from phenomena on all levels of biological organization continue to increase the number of large quantitative data sets that are available. These increasingly common data sets could serve as anchor points for making long-range connections by virtue of DL. However, connections within the domain of life are likely to be structured in a highly nonuniform fashion and hence it is necessary to develop methods, for example, theoretical, computational, and experimental, to determine linkage of biological data sets most likely to provide useful insights on a biological problem using DL. Finally, specific DL approaches and architectures should be developed to match the needs of reintegrating biology.

Download full-text PDF

Source
http://dx.doi.org/10.1093/icb/icab015DOI Listing

Publication Analysis

Top Keywords

data sets
16
reintegrating biology
12
deep learning
8
time space
8
space scale
8
domain life
8
biological
5
data
5
learning reintegrating
4
biology
4

Similar Publications

Energy decomposition analysis (EDA) based on density functional theory (DFT) and self-consistent field (SCF) calculations has become widely used for understanding intermolecular interactions. This work reports a new approach to EDA for post-SCF wave functions based on closed-shell restricted second-order Mo̷ller-Plesset (MP2) together with an efficient implementation that generalizes the successful SCF-level second-generation absolutely localized molecular orbital EDA approach, ALMO-EDA-II, and improves upon MP2 ALMO-EDA-I. The new MP2 ALMO-EDA-II provides distinct energy contributions for a frozen interaction energy containing permanent electrostatics and Pauli repulsions, polarized energy-yielding induced electrostatics, dispersion-corrected energy, and the fully relaxed energy, which describes charge transfer.

View Article and Find Full Text PDF

Extending Density-Corrected Density Functional Theory to Large Molecular Systems.

J Phys Chem Lett

January 2025

Department of Chemistry, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Korea.

Practical density-corrected density functional theory (DC-DFT) calculations rely on Hartree-Fock (HF) densities, which can be computationally expensive for systems with over a hundred atoms. We extend the applicability of HF-DFT using the dual-basis method, where the density matrix from a smaller basis set is used to estimate the HF solution on a larger basis set. Benchmarks on many systems, including the GMTKN55 database for main-group chemistry, and the L7 and S6L data sets for large molecular systems demonstrate the efficacy of our approach.

View Article and Find Full Text PDF

Background: The accurate and prompt diagnosis of infections is essential for improving patient outcomes and preventing bacterial drug resistance. Host gene expression profiling as an approach to infection diagnosis holds great potential in assisting early and accurate diagnosis of infection.

Methods: To improve the precision of infection diagnosis, we developed InfectDiagno, a rank-based ensemble machine learning algorithm for infection diagnosis via host gene expression patterns.

View Article and Find Full Text PDF

The effects of warming on loggerhead turtle nesting counts.

J Anim Ecol

January 2025

Faculdade de Ciências da Universidade do Porto, Centro de Investigação em Ciências Geo-Espaciais (CICGE), Vila Nova de Gaia, Portugal.

Global trends in marine turtle nesting numbers vary by region, influenced by environmental or anthropogenic factors. Our study investigates the potential role of past temperature fluctuations on these trends, particularly whether warmer beaches are linked to increased nesting due to higher female production (since sea turtles have temperature-dependent sex determination). We selected the loggerhead turtle (Caretta caretta) due to its wide distribution, strong philopatry and vulnerability to environmental changes.

View Article and Find Full Text PDF

Permafrost soils store vast amounts of organic carbon, and their thawing due to climate warming accelerates the release of carbon as methane and carbon dioxide, exacerbating global climate change. Understanding the distribution of greenhouse gases trapped in these soils and predicting their behavior upon thawing is essential for accurately modeling climate feedbacks. This study presents an integrated biogeochemical and microbial dataset from ~1.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!