The delivery of accurate diagnoses is crucial in healthcare and represents the gateway to appropriate and timely treatment. Although recent large language models (LLMs) have demonstrated impressive capabilities in few-shot or zero-shot learning, their effectiveness in clinical diagnosis remains unproven. Here we present MedFound, a generalist medical language model with 176 billion parameters, pre-trained on a large-scale corpus derived from diverse medical text and real-world clinical records. We further fine-tuned MedFound to learn physicians' inferential diagnosis with a self-bootstrapping strategy-based chain-of-thought approach and introduced a unified preference alignment framework to align it with standard clinical practice. Extensive experiments demonstrate that our medical LLM outperforms other baseline LLMs and specialized models in in-distribution (common diseases), out-of-distribution (external validation) and long-tailed distribution (rare diseases) scenarios across eight specialties. Further ablation studies indicate the effectiveness of key components in our medical LLM training approach. We conducted a comprehensive evaluation of the clinical applicability of LLMs for diagnosis involving artificial intelligence (AI) versus physician comparison, AI-assistance study and human evaluation framework. Our proposed framework incorporates eight clinical evaluation metrics, covering capabilities such as medical record summarization, diagnostic reasoning and risk management. Our findings demonstrate the model's feasibility in assisting physicians with disease diagnosis as part of the clinical workflow.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1038/s41591-024-03416-6 | DOI Listing |
Global Health
January 2025
Research Group: Implementation Research, Bernhard Nocht Institute for Tropical Medicine, Hamburg, Germany.
Background: Adequate knowledge and awareness regarding diseases are essential for appropriate, high-quality healthcare. Female Genital Schistosomiasis (FGS) is a non-sexually transmitted gynaecological disease that is caused by the presence of Schistosoma haematobium eggs in the female genital tract and the resulting immune response that causes tissue damage. It is estimated to affect 56 million women, mostly in sub-Saharan Africa (SSA), where healthcare workers (HCWs) have limited awareness and knowledge of FGS.
View Article and Find Full Text PDFBMC Public Health
January 2025
School of Public Health, Faculty of Health, University of Technology Sydney, Level 8, Building 10, 235-253 Jones St, Ultimo, NSW, Australia.
Background: While some general patterns and trends of health information seeking and literacy in the Australian population are known, there is a need to understand these behaviours and skills specific to the focus areas outlined in the National Preventive Health Strategy (NPHS).
Methods: In response, this study employed a cross-sectional online survey of adults in the Australian general population (n = 1509) to investigate their knowledge and health information seeking behaviour regarding the NPHS' seven focus areas. It also explored primary care practitioners as a preventive health information source.
BMC Pregnancy Childbirth
January 2025
School of Health Sciences, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, New South Wales, NSW, 2308, Australia.
Background: Women with a history of hypertensive disorders of pregnancy (HDP), including chronic hypertension, gestational hypertension, and preeclampsia have an increased risk of cardiovascular disease (CVD). Current research suggests that general practitioners are unaware of women's HDP history, and although ideally placed to follow-up with these women, there is limited understanding of current CVD prevention practices in women after HDP. Additionally, preeclampsia confers a higher CVD risk compared to other types of HDP, and Australian research suggests that lower socioeconomic status (SES) is associated with a higher incidence of both HDP and CVD.
View Article and Find Full Text PDFNat Med
January 2025
State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China.
The delivery of accurate diagnoses is crucial in healthcare and represents the gateway to appropriate and timely treatment. Although recent large language models (LLMs) have demonstrated impressive capabilities in few-shot or zero-shot learning, their effectiveness in clinical diagnosis remains unproven. Here we present MedFound, a generalist medical language model with 176 billion parameters, pre-trained on a large-scale corpus derived from diverse medical text and real-world clinical records.
View Article and Find Full Text PDFNat Med
January 2025
Google Research, Mountain View, CA, USA.
Large language models (LLMs) have shown promise in medical question answering, with Med-PaLM being the first to exceed a 'passing' score in United States Medical Licensing Examination style questions. However, challenges remain in long-form medical question answering and handling real-world workflows. Here, we present Med-PaLM 2, which bridges these gaps with a combination of base LLM improvements, medical domain fine-tuning and new strategies for improving reasoning and grounding through ensemble refinement and chain of retrieval.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!