We developed a named entity (NE) framework for information extraction from semi-structured clinical notes retrieved from The Cancer Genome Atlas-Thyroid Cancer (TCGA-THCA) database and examined Large Language Models (LLMs) strategies to classify the 8 edition of American Joint Committee on Cancer (AJCC) staging and American Thyroid Association (ATA) risk category for patients with well-differentiated thyroid cancer. The NE framework consisted of annotation guidelines development, ground truth labelling, prompting approaches, and evaluation codes. Four LLMs (Mistral-7B-Instruct, Llama-3.1-8B-Instruct, Gemma-2-9B-Instruct, and Qwen2.5-7B-Instruct) were offline utilised for information extraction, comparing with expert-curated ground truth. Our framework was developed using 50 TCGA-THCA pathology notes. 289 TCGA-THCA notes and 35 pseudo-clinical cases were used for validation. Taking an ensemble-like majority-vote strategy achieved satisfactory performance for AJCC and ATA in both development and validation sets. Our framework and ensemble classifier optimised efficiency and accuracy of classifying stage and risk category in thyroid cancer patients.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11873034PMC
http://dx.doi.org/10.1038/s41746-025-01528-yDOI Listing

Publication Analysis

Top Keywords

thyroid cancer
12
named entity
8
entity framework
8
large language
8
language models
8
risk category
8
ground truth
8
cancer
6
framework
5
developing named
4

Similar Publications

Ocular motor cranial neuropathy and risk of thyroid cancer: A Korean population-based study.

PLoS One

March 2025

Department of Ophthalmology, Hallym University School of Medicine, Dongtan Sacred Heart Hospital, Hwaseong, Republic of Korea.

This study investigates whether ocular motor cranial neuropathy (OMCN) can predict the onset of thyroid cancer given its association with common cardiovascular risk factors including obesity, diabetes mellitus (DM), hypertension, and dyslipidemia. We conducted a retrospective, nationwide, population-based cohort study utilizing data from the Korean National Health Insurance Service. Individuals comprised those aged ≥ 20 years diagnosed with OMCN between 2010 and 2017.

View Article and Find Full Text PDF

SCN3B is an Anti-breast Cancer Molecule with Migration Inhibition Effect.

Biochem Genet

March 2025

Department of Gynecology, People's Hospital of Jianshi, Enshi Tujia and Miao Autonomous Prefecture, Enshi City, Hubei Province, China.

Breast cancer is a prevalent and highly heterogeneous malignancy that continues to be a major global health concern. Voltage-gated sodium channels are primarily known for their role in neuronal excitability, but emerging evidence suggests their involvement in the pathogenesis of various cancers, including breast cancer. However, the effect of β-subunits on breast cancer cells is not yet studied.

View Article and Find Full Text PDF

Background: Thyroid cancer is a prevalent malignant tumor, especially with a higher incidence in women. Tumor microenvironment changes induced by inflammation and alterations in metabolic characteristics are critical in the development of thyroid cancer. Nevertheless, their causal relationships remain unclear.

View Article and Find Full Text PDF

This study unveils PKM2 as a master metabolic coordinator in triple-negative breast cancer (TNBC), governing the glycolysis-lipolysis balance through the AMPK/KLF4/ACADVL axis. We demonstrate stage-specific PKM2 upregulation in TNBC, with CRISPR/Cas9 knockout inducing dual metabolic reprogramming-suppressed glycolysis and activated lipid catabolism. Mechanistically, PKM2 ablation triggers AMPK-dependent nuclear translocation of KLF4, which directly activates ACADVL (mitochondrial β-oxidation rate-limiting enzyme), explaining lipid droplet depletion.

View Article and Find Full Text PDF

Validation of Diagnostic Utility of Washout CYFRA 21-1 in Lymph Node Metastasis of Thyroid Cancer.

Clin Cancer Res

March 2025

Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea, Seoul, Korea (South), Republic of.

Purpose: Traditional methods, fine-needle aspiration cytology (FNAC) and washout thyroglobulin (Tg), do not always provide sufficient accuracy for diagnosing lymph node (LN) metastasis in thyroid cancer. This study aimed to validate the diagnostic performance of washout cytokeratin fragment 21-1 (CYFRA 21-1) as a complementary biomarker for diagnosing metastatic LNs in thyroid cancer and to explore its relationship with molecular analysis and distant metastasis.

Patients And Methods: In this retrospective cohort study involving 230 LNs in 224 patients with PTC, FNAC, washout Tg, and CYFRA 21-1 levels were measured in suspicious LNs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!