Me-LLaMA: Foundation Large Language Models for Medical Applications.

Qianqian Xie Qingyu Chen Aokun Chen Cheng Peng Yan Hu Fongci Lin Xueqing Peng Jimin Huang Jeffrey Zhang Vipina Keloth Xinyu Zhou Huan He Lucila Ohno-Machado Yonghui Wu Hua Xu Jiang Bian

Res Sq

Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA.

Published: May 2024

Recent advancements in large language models (LLMs) such as ChatGPT and LLaMA have hinted at their potential to revolutionize medical applications, yet their application in clinical settings often reveals limitations due to a lack of specialized training on medical-specific data. In response to this challenge, this study introduces Me-LLaMA, a novel medical LLM family that includes foundation models - Me-LLaMA 13/70B, along with their chat-enhanced versions - Me-LLaMA 13/70B-chat, developed through continual pre-training and instruction tuning of LLaMA2 using large medical datasets. Our methodology leverages a comprehensive domain-specific data suite, including a large-scale, continual pre-training dataset with 129B tokens, an instruction tuning dataset with 214k samples, and a new medical evaluation benchmark (MIBE) across six critical medical tasks with 12 datasets. Our extensive evaluation using the MIBE shows that Me-LLaMA models achieve overall better performance than existing open-source medical LLMs in zero-shot, few-shot and supervised learning abilities. With task-specific instruction tuning, Me-LLaMA models outperform ChatGPT on 7 out of 8 datasets and GPT-4 on 5 out of 8 datasets. In addition, we investigated the catastrophic forgetting problem, and our results show that Me-LLaMA models outperform other open-source medical LLMs in mitigating this issue. Me-LLaMA is one of the largest open-source medical foundation LLMs that use both biomedical and clinical data. It exhibits superior performance across both general and medical tasks compared to other open-source medical LLMs, rendering it an attractive choice for medical AI applications. We release our models, datasets, and evaluation scripts at: https://github.com/BIDS-Xu-Lab/Me-LLaMA.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11142305	PMC
http://dx.doi.org/10.21203/rs.3.rs-4240043/v1	DOI Listing

Publication Analysis

Top Keywords

open-source medical

medical

medical applications

instruction tuning

me-llama models

medical llms

me-llama

large language

language models

continual pre-training

Similar Publications

A pipeline for harmonising NHS Scotland laboratory data to enable national-level analyses.

J Biomed Inform

January 2025

Health Informatics Centre University of Dundee UK; Health Data Research UK London UK. Electronic address:

Chuang Gao Shahzad Mumtaz Sophie McCall Katherine O'Sullivan Mark McGilchrist

Objective: Medical laboratory data together with prescribing and hospitalisation records are three of the most used electronic health records (EHRs) for data-driven health research. In Scotland, hospitalisation, prescribing and the death register data are available nationally whereas laboratory data is captured, stored and reported from local health board systems with significant heterogeneity. For researchers or other users of this regionally curated data, working on laboratory datasets across regional cohorts requires effort and time.

View Article and Find Full Text PDF

Similar Publications

Characterizing cell-type spatial relationships across length scales in spatially resolved omics data.

Nat Commun

January 2025

Center for Computational Biology, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, USA.

Rafael Dos Santos Peixoto Brendan F Miller Maigan A Brusko Gohta Aihara Lyla Atta

Spatially resolved omics (SRO) technologies enable the identification of cell types while preserving their organization within tissues. Application of such technologies offers the opportunity to delineate cell-type spatial relationships, particularly across different length scales, and enhance our understanding of tissue organization and function. To quantify such multi-scale cell-type spatial relationships, we present CRAWDAD, Cell-type Relationship Analysis Workflow Done Across Distances, as an open-source R package.

View Article and Find Full Text PDF

Similar Publications

pyAKI-An open source solution to automated acute kidney injury classification.

PLoS One

January 2025

Department of Anaesthesiology, Intensive Care and Pain Medicine, University Hospital Müunster, Müunster, Germany.

Christian Porschen Jan Ernsting Paul Brauckmann Raphael Weiss Till Würdemann

Article Synopsis

Acute kidney injury (AKI) affects a significant number of critically ill patients, with the lack of standardized tools for implementing KDIGO criteria creating challenges for researchers.
The pyAKI pipeline was developed to address these issues, using the MIMIC-IV database to establish a standardized model for consistent AKI diagnosis.
Validation tests showed that pyAKI performs better than human annotations, achieving perfect accuracy and offering a valuable resource for clinicians and data scientists in AKI research.

View Article and Find Full Text PDF

Similar Publications

A deep multiple instance learning framework improves microsatellite instability detection from tumor next generation sequencing.

Nat Commun

January 2025

Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA.

John Ziegler Jaclyn F Hechtman Satshil Rana Ryan N Ptashkin Gowtham Jayakumaran

Microsatellite instability (MSI) is a critical phenotype of cancer genomes and an FDA-recognized biomarker that can guide treatment with immune checkpoint inhibitors. Previous work has demonstrated that next-generation sequencing data can be used to identify samples with MSI-high phenotype. However, low tumor purity, as frequently observed in routine clinical samples, poses a challenge to the sensitivity of existing algorithms.

View Article and Find Full Text PDF

Similar Publications

Cost-effectiveness models of non-small cell lung cancer: A systematic literature review.

J Manag Care Spec Pharm

January 2025

Medical Oncology Department 1, Fondazione IRCCS Istituto Nazionale Tumori, Milan, Italy (Prelaj).

Michael Willis Andreas Nilsson Zin Min Thet Lwin Gunnar Brådvik Arsela Prelaj

Background: Non-small cell lung cancer (NSCLC) presents a formidable global health challenge owing to significant morbidity, high mortality rates, and substantial economic burden. Recent advances in targeted therapies and immunotherapies have transformed NSCLC treatment, but efficacy varies across patients. Tailoring treatment to patients can improve outcomes and potentially improve cost-effectiveness (ie, value for money) as well.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!