Efficiently extracting data from tables in the scientific literature is pivotal for building large-scale databases. However, the tables reported in materials science papers exist in highly diverse forms; thus, rule-based extractions are an ineffective approach. To overcome this challenge, the study presents MaTableGPT, which is a GPT-based table data extractor from the materials science literature. MaTableGPT features key strategies of table data representation and table splitting for better GPT comprehension and filtering hallucinated information through follow-up questions. When applied to a vast volume of water splitting catalysis literature, MaTableGPT achieves an extraction accuracy (total F1 score) of up to 96.8%. Through comprehensive evaluations of the GPT usage cost, labeling cost, and extraction accuracy for the learning methods of zero-shot, few-shot, and fine-tuning, the study presents a Pareto-front mapping where the few-shot learning method is found to be the most balanced solution owing to both its high extraction accuracy (total F1 score >95%) and low cost (GPT usage cost of 5.97 US dollars and labeling cost of 10 I/O paired examples). The statistical analyses conducted on the database generated by MaTableGPT revealed valuable insights into the distribution of the overpotential and elemental utilization across the reported catalysts in the water splitting literature.

Download full-text PDF

Source
http://dx.doi.org/10.1002/advs.202408221DOI Listing

Publication Analysis

Top Keywords

table data
12
materials science
12
extraction accuracy
12
matablegpt gpt-based
8
gpt-based table
8
data extractor
8
extractor materials
8
science literature
8
study presents
8
literature matablegpt
8

Similar Publications

Background: Despite the high acuity of coronary care unit (CCU) patients and their risk of deterioration, little is known about how nurses assess them.

Aim: Increase understanding of the scope of nurses' assessments of deteriorating CCU patients.

Design: Online mixed methods survey.

View Article and Find Full Text PDF

Tracking Boats on Amazon Rivers-A Case Study with the LoRa/LoRaWAN.

Sensors (Basel)

January 2025

Electronic and Information Technology Research and Development Center (CETELI), Federal University of Amazonas, Manaus 69067-005, AM, Brazil.

The Amazon region has the largest hydrographic basin in the world. The rivers act as roads, and boats serve as vehicles for transporting passengers and cargo to large urban centers, municipalities, riverside communities, villages, and settlements. The Amazon River transportation system faces critical gaps due to the lack of land infrastructure in certain areas, which makes rivers essential for commerce and access to isolated communities.

View Article and Find Full Text PDF

A Comprehensive Survey on the Integrity of Localization Systems.

Sensors (Basel)

January 2025

Inria-ASTRA Team, 48 Rue Barrault, 75013 Paris, France.

This survey extends and refines the existing definitions of integrity and protection level in localization systems (localization as a broad term, i.e., not limited to GNSS-based localization).

View Article and Find Full Text PDF

: Sleeve gastrectomy (SG) is increasingly used to treat severe obesity in adolescents, but its effects on bone health during this critical period of bone accrual are not fully understood. This systematic review aims to evaluate the impact of SG on the bone mineral density (BMD), bone microarchitecture, marrow adipose tissue (MAT), and bone turnover markers in adolescents. : A comprehensive literature search was conducted to identify studies assessing bone health outcomes in adolescents undergoing SG.

View Article and Find Full Text PDF

: This study investigates the potential of artificial intelligence (AI), specifically large language models (LLMs) like ChatGPT, to enhance decision support in diagnosing epilepsy. AI tools can improve diagnostic accuracy, efficiency, and decision-making speed. The aim of this study was to compare the level of agreement in epilepsy diagnosis between human experts (epileptologists) and AI (ChatGPT), using the 2014 International League Against Epilepsy (ILAE) criteria, and to identify potential predictors of diagnostic errors made by ChatGPT.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!