AI Article Synopsis

  • Large language models (LLMs) show promise for clinical use, but issues with errors and harmful outputs limit their application, especially in rare diseases (RDs).
  • The study presents RDguru, an AI agent built on the LangChain framework using GPT-3.5-turbo, which provides advanced medical consultations, knowledge Q&A, and a unique diagnostic model for improved accuracy.
  • RDguru outperforms ChatGPT and current top methods by effectively integrating multiple diagnostic strategies and enhancing clinical decision-making in rare disease cases.

Article Abstract

Large language models (LLMs) hold significant promise in clinical practice, yet their real-world adoption is constrained by their propensity to produce erroneous and occasionally harmful outputs, particularly in the intricate domain of rare diseases (RDs). This study introduces RDguru, a conversational intelligent agent leveraging the LangChain framework and powered by GPT-3.5-turbo. RDguru offers a comprehensive suite of functionalities, encompassing evidence-traceable knowledge Q&A and professional medical consultations for differential diagnosis (DDX), integrating authoritative knowledge sources and reliable tools. A novel multi-source fusion diagnostic model, rooted in deep Q-network, amalgamates three diagnostic recommendation strategies (GPT-4, PheLR, and phenotype matching) to enhance diagnostic recall during medical consultations. Through tailored tools and advanced algorithms for retrieval-augmented generation, RDguru excels in knowledge Q&A, automated phenotype annotation, and RD DDX. A multi-aspect Q&A analysis demonstrates RDguru outperforms ChatGPT in generating descriptions aligned with authoritative knowledge, quantified by ROUGE scores, GPT-4-based automatic rating, and RAGAs evaluation metrics. Testing on 238 published RD cases reveals that RDguru's top 5 multi-source fusion diagnoses recapture 63.87% of actual diagnoses, marking a 5.47% improvement over the state-of-the-art diagnostic method PheLR. Furthermore, RDguru's consultation strategy proves effective in eliciting diagnostically beneficial phenotypes and refining the prioritization of genuine diagnoses through multi-round phenotype-orient questioning. Evaluations against established benchmarks and real-world patient data demonstrate RDguru's efficacy and reliability, highlighting its potential to enhance clinical decision-making in the realm of RDs.

Download full-text PDF

Source
http://dx.doi.org/10.1109/JBHI.2024.3464555DOI Listing

Publication Analysis

Top Keywords

rdguru conversational
8
conversational intelligent
8
intelligent agent
8
rare diseases
8
knowledge q&a
8
medical consultations
8
authoritative knowledge
8
multi-source fusion
8
rdguru
5
agent rare
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!