Autonomous Medical Evaluation for Guideline Adherence (AMEGA) is a comprehensive benchmark designed to evaluate large language models' adherence to medical guidelines across 20 diagnostic scenarios spanning 13 specialties. It includes an evaluation framework and methodology to assess models' capabilities in medical reasoning, differential diagnosis, treatment planning, and guideline adherence, using open-ended questions that mirror real-world clinical interactions. It includes 135 questions and 1337 weighted scoring elements designed to assess comprehensive medical knowledge.
View Article and Find Full Text PDFPurpose: Large language models (LLMs) promise to streamline radiology reporting. With the release of OpenAI's GPT-4o (Generative Pre-trained Transformers-4 omni), which processes not only text but also speech, multimodal LLMs might now also be used as medical speech recognition software for radiology reporting in multiple languages. This proof-of-concept study investigates the feasibility of using GPT-4o for automated voice-to-text transcription of radiology reports in English and German.
View Article and Find Full Text PDFStructured reporting (SR) has long been a goal in radiology to standardize and improve the quality of radiology reports. Despite evidence that SR reduces errors, enhances comprehensiveness, and increases adherence to guidelines, its widespread adoption has been limited. Recently, large language models (LLMs) have emerged as a promising solution to automate and facilitate SR.
View Article and Find Full Text PDFBackground: To investigate the reproducibility of automated volumetric bone mineral density (vBMD) measurements from routine thoracoabdominal computed tomography (CT) assessed with segmentations by a convolutional neural network and automated correction of contrast phases, on diverse scanners, with scanner-specific asynchronous or scanner-agnostic calibrations.
Methods: We obtained 679 observations from 278 CT scans in 121 patients (77 males, 63.6%) studied from 04/2019 to 06/2020.
Automated segmentation of brain white matter lesions is crucial for both clinical assessment and scientific research in multiple sclerosis (MS). Over a decade ago, we introduced an engineered lesion segmentation tool, LST. While recent lesion segmentation approaches have leveraged artificial intelligence (AI), they often remain proprietary and difficult to adopt.
View Article and Find Full Text PDFAutomated segmentation of brain white matter lesions is crucial for both clinical assessment and scientific research in multiple sclerosis (MS). Over a decade ago, we introduced an engineered lesion segmentation tool, LST. While recent lesion segmentation approaches have leveraged artificial intelligence (AI), they often remain proprietary and difficult to adopt.
View Article and Find Full Text PDFBackground: To investigate reproducibility of texture features and volumetric bone mineral density (vBMD) extracted from trabecular bone in the thoracolumbar spine in routine clinical multi-detector computed tomography (MDCT) data in a single scanner environment.
Methods: Patients who underwent two routine clinical thoraco-abdominal MDCT exams at a single scanner with a time interval of 6 to 26 months (n=203, 131 males; time interval mean, 13 months; median, 12 months) were included in this observational study. Exclusion criteria were metabolic and hematological disorders, bone metastases, use of bone-active medications, and history of osteoporotic vertebral fractures (VFs) or prior diagnosis of osteoporosis.