Purpose: Typically stored as unstructured notes, surgical pathology reports contain data elements valuable to cancer research that require labor-intensive manual extraction. Although studies have described natural language processing (NLP) of surgical pathology reports to automate information extraction, efforts have focused on specific cancer subtypes rather than across multiple oncologic domains. To address this gap, we developed and evaluated an NLP method to extract tumor staging and diagnosis information across multiple cancer subtypes.
Methods: The NLP pipeline was implemented on an open-source framework called Leo. We used a total of 555,681 surgical pathology reports of 329,076 patients to develop the pipeline and evaluated our approach on subsets of reports from patients with breast, prostate, colorectal, and randomly selected cancer subtypes.
Results: Averaged across all four cancer subtypes, the NLP pipeline achieved an accuracy of 1.00 for International Classification of Diseases, Tenth Revision codes, 0.89 for T staging, 0.90 for N staging, and 0.97 for M staging. It achieved an F1 score of 1.00 for International Classification of Diseases, Tenth Revision codes, 0.88 for T staging, 0.90 for N staging, and 0.24 for M staging.
Conclusion: The NLP pipeline was developed to extract tumor staging and diagnosis information across multiple cancer subtypes to support the research enterprise in our institution. Although it was not possible to demonstrate generalizability of our NLP pipeline to other institutions, other institutions may find value in adopting a similar NLP approach-and reusing code available at GitHub-to support the oncology research enterprise with elements extracted from surgical pathology reports.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8812635 | PMC |
http://dx.doi.org/10.1200/CCI.21.00065 | DOI Listing |
Achondroplasia, the most prevalent short-stature disorder, is caused by missense variants overactivating the fibroblast growth factor receptor 3 (FGFR3). As current surgical and pharmaceutical treatments only partially improve some disease features, we sought to explore a genetic approach. We show that an enhancer located 29 kb upstream of mouse Fgfr3 (-29E) is sufficient to confer a transgenic mouse reporter with a domain of expression in cartilage matching that of Fgfr3.
View Article and Find Full Text PDFWorld J Gastrointest Oncol
January 2025
Faculty of Medicine, Carol Davila University of Medicine and Pharmacy, Bucharest 050474, Romania.
Background: Pancreatic ductal adenocarcinoma (PDAC) is an aggressive lethal malignancy with limited options for treatment and a 5-year survival rate of 11% in the United States. As for other types of tumors, such as colorectal cancer, aberrant lipid synthesis and reprogrammed lipid metabolism have been suggested to be associated with PDAC development and progression.
Aim: To identify the possible involvement of lipid metabolism in PDAC by analyzing in tumoral and non-tumoral tissues the expression level of the most relevant genes involved in the long-chain fatty acid (FA) import into cell.
World J Gastrointest Oncol
January 2025
Department of General Surgery, Hospital General de Requena, Requena 46340, Spain.
In this editorial we examine the article by Wu published in the . Surgical resection for peritoneal metastases from colorectal cancer (CRC) has been gradually accepted in the medical oncology community. A randomized trial (PRODIGE 7) on cytoreductive surgery (CRS) with hyperthermic intraperitoneal chemotherapy (HIPEC) failed to prove any benefit of oxaliplatin in the overall survival of patients with peritoneal metastases from colorectal origin.
View Article and Find Full Text PDFWorld J Gastrointest Oncol
January 2025
Department of Hepatobiliary and Pancreaticosplenic Surgery, Jingzhou Hospital Affiliated to Yangtze University, Jingzhou 434100, Hubei Province, China.
Background: The liver, as the main target organ for hematogenous metastasis of colorectal cancer, early and accurate prediction of liver metastasis is crucial for the diagnosis and treatment of patients. Herein, this study aims to investigate the application value of a combined machine learning (ML) based model based on the multiparameter magnetic resonance imaging for prediction of rectal metachronous liver metastasis (MLM).
Aim: To investigate the efficacy of radiomics based on multiparametric magnetic resonance imaging images of preoperative first diagnosed rectal cancer in predicting MLM from rectal cancer.
World J Gastrointest Oncol
January 2025
Pathology Department, Xuanhan County People's Hospital, Dazhou 636150, Sichuan Province, China.
Background: Pancreatic cancer remains one of the most lethal malignancies worldwide, with a poor prognosis often attributed to late diagnosis. Understanding the correlation between pathological type and imaging features is crucial for early detection and appropriate treatment planning.
Aim: To retrospectively analyze the relationship between different pathological types of pancreatic cancer and their corresponding imaging features.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!