OphGLM: An ophthalmology large language-and-vision assistant.

Zhuo Deng Weihao Gao Chucheng Chen Zhiyuan Niu Zheng Gong Ruiheng Zhang Zhenjie Cao Fang Li Zhaoyi Ma Wenbin Wei Lan Ma

Artif Intell Med

Shenzhen International Graduate School, Tsinghua University, Shenzhen, China. Electronic address:

Published: November 2024

Vision computer-aided diagnostic methods have been used in early ophthalmic disease screening and diagnosis. However, the limited output formats of these methods lead to poor human-computer interaction and low clinical applicability value. Thus, ophthalmic visual question answering is worth studying. Unfortunately, no practical solutions exist before Large Language Models(LLMs). In this paper, we investigate the ophthalmic visual diagnostic interaction problem. We construct an ophthalmology large language-and-vision assistant, OphGLM, consisting of an image encoder, a text encoder, a fusion module, and an LLM module. We establish a new Chinese ophthalmic fine-tuning dataset, FundusTuning-CN, including the fundus instruction and conversation sets. Based on FundusTuning-CN, we establish a novel LLM-tuning strategy to introduce visual model understanding and ophthalmic knowledge into LLMs at a low cost and high efficiency. Leveraging the pre-training of the image encoder, OphGLM demonstrates strong visual understanding and surpasses open-source visual language models in common fundus disease classification tasks. The FundusTuning-CN enables OphGLM to surpass open-source medical LLMs in both ophthalmic knowledge and interactive capabilities. Our proposed OphGLM has the potential to revolutionize clinical applications in ophthalmology. The dataset, code, and models will be publicly available at https://github.com/ML-AILab/OphGLM.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.artmed.2024.103001	DOI Listing

Publication Analysis

Top Keywords

ophthalmology large

large language-and-vision

language-and-vision assistant

ophthalmic visual

image encoder

ophthalmic knowledge

ophthalmic

ophglm

visual

ophglm ophthalmology

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered