Molecular machine learning has been maturing rapidly over the last few years. Improved methods and the presence of larger datasets have enabled machine learning algorithms to make increasingly accurate predictions about molecular properties. However, algorithmic progress has been limited due to the lack of a standard benchmark to compare the efficacy of proposed methods; most new algorithms are benchmarked on different datasets making it challenging to gauge the quality of proposed methods. This work introduces MoleculeNet, a large scale benchmark for molecular machine learning. MoleculeNet curates multiple public datasets, establishes metrics for evaluation, and offers high quality open-source implementations of multiple previously proposed molecular featurization and learning algorithms (released as part of the DeepChem open source library). MoleculeNet benchmarks demonstrate that learnable representations are powerful tools for molecular machine learning and broadly offer the best performance. However, this result comes with caveats. Learnable representations still struggle to deal with complex tasks under data scarcity and highly imbalanced classification. For quantum mechanical and biophysical datasets, the use of physics-aware featurizations can be more important than choice of particular learning algorithm.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5868307PMC
http://dx.doi.org/10.1039/c7sc02664aDOI Listing

Publication Analysis

Top Keywords

machine learning
20
molecular machine
16
benchmark molecular
8
learning algorithms
8
proposed methods
8
learnable representations
8
learning
7
molecular
6
machine
5
moleculenet
4

Similar Publications

A prediction model for electrical strength of gaseous medium based on molecular reactivity descriptors and machine learning method.

J Mol Model

January 2025

Hubei Key Laboratory·for High-Efficiency-Utilization of Solar Energy and Operation, Control of Energy-Storage System, Hubei-University of Technology, Wuhan, 430068, China.

Context: Ionization and adsorption in gas discharge are similar to electrophilic and nucleophilic reactions. The molecular descriptors characterizing reactions such as electrostatic potential descriptors are useful in predicting the electrical strength of environmentally friendly gases. In this study, descriptors of 73 molecules are employed for correlation analysis with electrical strength.

View Article and Find Full Text PDF

Predicting fall parameters from infant skull fractures using machine learning.

Biomech Model Mechanobiol

January 2025

Department of Mechanical Engineering, University of Utah, Salt Lake City, UT, 84112, USA.

When infants are admitted to the hospital with skull fractures, providers must distinguish between cases of accidental and abusive head trauma. Limited information about the incident is available in such cases, and witness statements are not always reliable. In this study, we introduce a novel, data-driven approach to predict fall parameters that lead to skull fractures in infants in order to aid in determinations of abusive head trauma.

View Article and Find Full Text PDF

Role of immune cell homeostasis in research and treatment response in hepatocellular carcinoma.

Clin Exp Med

January 2025

Department of Thoracic Surgery, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200127, China.

Introduction Recently, immune cells within the tumor microenvironment (TME) have become crucial in regulating cancer progression and treatment responses. The dynamic interactions between tumors and immune cells are emerging as a promising strategy to activate the host's immune system against various cancers. The development and progression of hepatocellular carcinoma (HCC) involve complex biological processes, with the role of the TME and tumor phenotypes still not fully understood.

View Article and Find Full Text PDF

The brain undergoes atrophy and cognitive decline with advancing age. The utilization of brain age prediction represents a pioneering methodology in the examination of brain aging. This study aims to develop a deep learning model with high predictive accuracy and interpretability for brain age prediction tasks.

View Article and Find Full Text PDF

Risk-taking is a concerning yet prevalent issue during adolescence and can be life-threatening. Examining its etiological sources and evolving pathways helps inform strategies to mitigate adolescents' risk-taking behavior. Studies have found that unfavorable environmental factors, such as adverse childhood experiences (ACEs), are associated with momentary levels of risk-taking in adolescents, but little is known about whether ACEs shape the developmental trajectory of risk-taking.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!