Automated electrosynthesis reaction mining with multimodal large language models (MLLMs).

Chem Sci

Department of Chemistry, University of Toronto, Lash Miller Chemical Laboratories 80 St. George Street ON M5S 3H6 Toronto Canada

Published: October 2024

Leveraging the chemical data available in legacy formats such as publications and patents is a significant challenge for the community. Automated reaction mining offers a promising solution to unleash this knowledge into a learnable digital form and therefore help expedite materials and reaction discovery. However, existing reaction mining toolkits are limited to single input modalities (text or images) and cannot effectively integrate heterogeneous data that is scattered across text, tables, and figures. In this work, we go beyond single input modalities and explore multimodal large language models (MLLMs) for the analysis of diverse data inputs for automated electrosynthesis reaction mining. We compiled a test dataset of 65 articles (MERMES-T24 set) and employed it to benchmark five prominent MLLMs against two critical tasks: (i) reaction diagram parsing and (ii) resolving cross-modality data interdependencies. The frontrunner MLLM achieved ≥96% accuracy in both tasks, with the strategic integration of single-shot visual prompts and image pre-processing techniques. We integrate this capability into a toolkit named MERMES (multimodal reaction mining pipeline for electrosynthesis). Our toolkit functions as an end-to-end MLLM-powered pipeline that integrates article retrieval, information extraction and multimodal analysis for streamlining and automating knowledge extraction. This work lays the groundwork for the increased utilization of MLLMs to accelerate the digitization of chemistry knowledge for data-driven research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11462585PMC
http://dx.doi.org/10.1039/d4sc04630gDOI Listing

Publication Analysis

Top Keywords

reaction mining
20
automated electrosynthesis
8
electrosynthesis reaction
8
multimodal large
8
large language
8
language models
8
models mllms
8
single input
8
input modalities
8
reaction
7

Similar Publications

Undoped ruthenium oxide as a stable catalyst for the acidic oxygen evolution reaction.

Nat Commun

January 2025

WA School of Mines: Minerals, Energy and Chemical Engineering (WASM-MECE), Curtin University, Perth, WA, 6102, Australia.

Reducing green hydrogen production cost is critical for its widespread application. Proton-exchange-membrane water electrolyzers are among the most promising technologies, and significant research has been focused on developing more active, durable, and cost-effective catalysts to replace expensive iridium in the anode. Ruthenium oxide is a leading alternative while its stability is inadequate.

View Article and Find Full Text PDF

Background: For patients who experience atypical neurogenic pain thought to be complex regional pain syndrome (CRPS) after Dupuytren's fasciectomy early recognition has been reported to improve outcomes. Furthermore, given the progressive nature of Dupuytren's, individuals with a history of CRPS have been "at risk" for further surgical intervention.

Purpose: To familiarize therapists with a Budapest criteria (BC) checklist for early diagnosis of CRPS, describe how tracking sudomotor/vasomotor signs alongside differences in skin temperature were used to monitor vasomotor instability and intervention effectiveness for a patient with atypical pain after fasciectomy and to detail management of the same patient with a CRPS history who had collagenase clostridium histolyticum (CCH) injection of her other hand without exacerbating CRPS.

View Article and Find Full Text PDF

Alkali metal doping is a new and promising approach to enhance the photo/electrocatalytic activity of NiS-based catalyst systems. This work investigates the impact of sodium on the structural, electronic, and catalytic properties of NiS. Comprehensive characterization techniques demonstrate that Na-doping causes significant changes in the NiS lattice and surface chemistry translating into a larger bandgap than NiS.

View Article and Find Full Text PDF

First-principles study of CO and HO adsorption on the anatase TiO(101) surface: effect of Au doping.

Phys Chem Chem Phys

January 2025

Shanxi Coal International Energy Group Co., Ltd., Taiyuan 030000, China.

Photocatalytic reduction of CO will play a major role in future energy and environmental crisis. To investigate the adsorption mechanisms of CO and HO molecules involved in the catalytic process on the surface of anatase titanium dioxide 101 (TiO(101)) and the influence of Au atom doping on their adsorption, first-principles density functional theory calculations were used. The results show that 1.

View Article and Find Full Text PDF

Oxygen evolution reaction (OER) is a cornerstone of various electrochemical energy conversion and storage systems, including water splitting, CO/N reduction, reversible fuel cells, and rechargeable metal-air batteries. OER typically proceeds through three primary mechanisms: adsorbate evolution mechanism (AEM), lattice oxygen oxidation mechanism (LOM), and oxide path mechanism (OPM). Unlike AEM and LOM, the OPM proceeds via direct oxygen-oxygen radical coupling that can bypass linear scaling relationships of reaction intermediates in AEM and avoid catalyst structural collapse in LOM, thereby enabling enhanced catalytic activity and stability.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!