Objective: This study aimed to investigate the accuracy of large language models (LLMs), specifically ChatGPT and Claude, in surgical decision-making and radiological assessment for spine pathologies compared to experienced spine surgeons.
Methods: The study employed a comparative analysis between the LLMs and a panel of attending spine surgeons. Five written clinical scenarios encompassing various spine pathologies were presented to the LLMs and surgeons, who provided recommended surgical treatment plans. Additionally, magnetic resonance imaging images depicting spine pathologies were analyzed by the LLMs and surgeons to assess their radiological interpretation abilities. Spino-pelvic parameters were estimated from a scoliosis radiograph by the LLMs.
Results: Qualitative content analysis revealed limitations in the LLMs' consideration of patient-specific factors and the breadth of treatment options. Both ChatGPT and Claude provided detailed descriptions of magnetic resonance imaging findings but differed from the surgeons in terms of specific levels and severity of pathologies. The LLMs acknowledged the limitations of accurately measuring spino-pelvic parameters without specialized tools. The accuracy of surgical decision-making for the LLMs (20%) was lower than that of the attending surgeons (100%). Statistical analysis showed no significant differences in accuracy between the groups.
Conclusions: The study highlights the potential of LLMs in assisting with radiological interpretation and surgical decision-making in spine surgery. However, the current limitations, such as the lack of consideration for patient-specific factors and inaccuracies in treatment recommendations, emphasize the need for further refinement and validation of these artificial intelligence (AI) models. Continued collaboration between AI researchers and clinical experts is crucial to address these challenges and realize the full potential of AI in spine surgery.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.wneu.2024.11.114 | DOI Listing |
laparoscopy has emerged as a pivotal tool for the management of acute abdominal pathologies. It provides diagnostic and therapeutic advantages, enabling surgeons to evaluate and address diverse acute abdominal conditions using minimally invasive techniques. The aim of this consensus was to obtain evidence-based guidance for surgeons regarding the utilization of laparoscopy in emergency medical settings, and has been divided into trauma and non-trauma emergencies.
View Article and Find Full Text PDFBMC Cancer
December 2024
Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine, Peking Union Medical College, No. 5 Dongdansantiao Street, Dongcheng District, Beijing, 100005, China.
Background: The colorectal cancer mortality rate in China has exceeded that in many developing countries and is expected to further increase owing to multiple factors, including the aging population. However, the optimal policy for colorectal cancer screening is unknown.
Methods: We synthesized the most up-to-date data using a 12-state Markov model populated with a cohort of Chinese men and women born during 1949-1988, and evaluated 16 conventional and 40 risk-tailored schemes for colorectal cancer screening, considering possible combinations of age (starting at 40 + years and ending at 75 years), frequency, and strategy (standard colonoscopy, fecal immunochemical testing with colonoscopy if positive, or risk-tailored).
Transl Oncol
December 2024
Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangzhou 510060, China; Department of Pathology, Sun Yat-sen University Cancer Center, Guangzhou 510060, China. Electronic address:
Background: Endocervical adenocarcinoma (ECA) comprises a heterogeneous group of diseases whose incidence has increased significantly in recent decades. ECA can be histologically classified into human papillomavirus-associated (HPVA) and non-HPVA (NHPVA) types. Given the variability in pathological features and clinical behavior between the subtypes, evaluating their respective immune microenvironments is essential.
View Article and Find Full Text PDFBrief Funct Genomics
December 2024
Department of Machine Learning, H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Drive, Tampa FL 33612, United States.
Objective: The primary objective of this study is to investigate various applications of artificial intelligence (AI) and statistical methodologies for analyzing and managing peritoneal metastases (PM) caused by gastrointestinal cancers.
Methods: Relevant keywords and search criteria were comprehensively researched on PubMed and Google Scholar to identify articles and reviews related to the topic. The AI approaches considered were conventional machine learning (ML) and deep learning (DL) models, and the relevant statistical approaches included biostatistics and logistic models.
J Inflamm Res
December 2024
Department of Gynecology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, People's Republic of China.
Background: Surgery is the best approach to treat endometrial cancer (EC); however, there is currently a deficiency in effective scoring systems for predicting EC recurrence post-surgical resection. This study aims to develop a clinicopathological-inflammatory parameters-based nomogram to accurately predict the postoperative recurrence-free survival (RFS) rate of EC patients.
Methods: A training set containing 1068 patients and an independent validation set consisting of 537 patients were employed in this retrospective study.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!