Objective: This study aimed to investigate the accuracy of large language models (LLMs), specifically ChatGPT and Claude, in surgical decision-making and radiological assessment for spine pathologies compared to experienced spine surgeons.

Methods: The study employed a comparative analysis between the LLMs and a panel of attending spine surgeons. Five written clinical scenarios encompassing various spine pathologies were presented to the LLMs and surgeons, who provided recommended surgical treatment plans. Additionally, magnetic resonance imaging images depicting spine pathologies were analyzed by the LLMs and surgeons to assess their radiological interpretation abilities. Spino-pelvic parameters were estimated from a scoliosis radiograph by the LLMs.

Results: Qualitative content analysis revealed limitations in the LLMs' consideration of patient-specific factors and the breadth of treatment options. Both ChatGPT and Claude provided detailed descriptions of magnetic resonance imaging findings but differed from the surgeons in terms of specific levels and severity of pathologies. The LLMs acknowledged the limitations of accurately measuring spino-pelvic parameters without specialized tools. The accuracy of surgical decision-making for the LLMs (20%) was lower than that of the attending surgeons (100%). Statistical analysis showed no significant differences in accuracy between the groups.

Conclusions: The study highlights the potential of LLMs in assisting with radiological interpretation and surgical decision-making in spine surgery. However, the current limitations, such as the lack of consideration for patient-specific factors and inaccuracies in treatment recommendations, emphasize the need for further refinement and validation of these artificial intelligence (AI) models. Continued collaboration between AI researchers and clinical experts is crucial to address these challenges and realize the full potential of AI in spine surgery.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.wneu.2024.11.114DOI Listing

Publication Analysis

Top Keywords

surgical decision-making
16
spine pathologies
16
spine
9
comparative analysis
8
large language
8
language models
8
spine surgeons
8
decision-making radiological
8
radiological assessment
8
assessment spine
8

Similar Publications

laparoscopy has emerged as a pivotal tool for the management of acute abdominal pathologies. It provides diagnostic and therapeutic advantages, enabling surgeons to evaluate and address diverse acute abdominal conditions using minimally invasive techniques. The aim of this consensus was to obtain evidence-based guidance for surgeons regarding the utilization of laparoscopy in emergency medical settings, and has been divided into trauma and non-trauma emergencies.

View Article and Find Full Text PDF

Age, frequency, and strategy optimization for organized colorectal cancer screening: a decision analysis conducted in China for the years 2023-2038.

BMC Cancer

December 2024

Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine, Peking Union Medical College, No. 5 Dongdansantiao Street, Dongcheng District, Beijing, 100005, China.

Background: The colorectal cancer mortality rate in China has exceeded that in many developing countries and is expected to further increase owing to multiple factors, including the aging population. However, the optimal policy for colorectal cancer screening is unknown.

Methods: We synthesized the most up-to-date data using a 12-state Markov model populated with a cohort of Chinese men and women born during 1949-1988, and evaluated 16 conventional and 40 risk-tailored schemes for colorectal cancer screening, considering possible combinations of age (starting at 40 + years and ending at 75 years), frequency, and strategy (standard colonoscopy, fecal immunochemical testing with colonoscopy if positive, or risk-tailored).

View Article and Find Full Text PDF

Prognostic implications of immune classification based on PD-L1 expression and tumor-infiltrating lymphocytes in endocervical adenocarcinoma.

Transl Oncol

December 2024

Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangzhou 510060, China; Department of Pathology, Sun Yat-sen University Cancer Center, Guangzhou 510060, China. Electronic address:

Background: Endocervical adenocarcinoma (ECA) comprises a heterogeneous group of diseases whose incidence has increased significantly in recent decades. ECA can be histologically classified into human papillomavirus-associated (HPVA) and non-HPVA (NHPVA) types. Given the variability in pathological features and clinical behavior between the subtypes, evaluating their respective immune microenvironments is essential.

View Article and Find Full Text PDF

Objective: The primary objective of this study is to investigate various applications of artificial intelligence (AI) and statistical methodologies for analyzing and managing peritoneal metastases (PM) caused by gastrointestinal cancers.

Methods: Relevant keywords and search criteria were comprehensively researched on PubMed and Google Scholar to identify articles and reviews related to the topic. The AI approaches considered were conventional machine learning (ML) and deep learning (DL) models, and the relevant statistical approaches included biostatistics and logistic models.

View Article and Find Full Text PDF

Background: Surgery is the best approach to treat endometrial cancer (EC); however, there is currently a deficiency in effective scoring systems for predicting EC recurrence post-surgical resection. This study aims to develop a clinicopathological-inflammatory parameters-based nomogram to accurately predict the postoperative recurrence-free survival (RFS) rate of EC patients.

Methods: A training set containing 1068 patients and an independent validation set consisting of 537 patients were employed in this retrospective study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!