J Biomed Semantics
September 2024
Background: Natural language processing (NLP) is increasingly being used to extract structured information from unstructured text to assist clinical decision-making and aid healthcare research. The availability of expert-annotated documents for the development and validation of NLP applications is limited. We created synthetic clinical documents to address this, and to validate the Extraction of Epilepsy Clinical Text version 2 (ExECTv2) NLP pipeline.
View Article and Find Full Text PDFBackground: Cancer multidisciplinary team (MDT) meetings are under intense pressure to reform given the rapidly rising incidence of cancer and national mandates for protocolized streaming of cases. The aim of this study was to validate a natural language processing (NLP)-based web platform to automate evidence-based MDT decisions for skin cancer with basal cell carcinoma as a use case.
Methods: A novel and validated NLP information extraction model was used to extract perioperative tumour and surgical factors from histopathology reports.
Identifying genetic risk factors for highly heterogeneous disorders like epilepsy remains challenging. Here, we present the largest whole-exome sequencing study of epilepsy to date, with >54,000 human exomes, comprising 20,979 deeply phenotyped patients from multiple genetic ancestry groups with diverse epilepsy subtypes and 33,444 controls, to investigate rare variants that confer disease risk. These analyses implicate seven individual genes, three gene sets, and four copy number variants at exome-wide significance.
View Article and Find Full Text PDFIntroduction: Routinely collected healthcare data are a powerful research resource, but often lack detailed disease-specific information that is collected in clinical free text such as histopathology reports. We aim to use natural Language Processing (NLP) techniques to extract detailed clinical and pathological information from histopathology reports to enrich routinely collected data.
Methods: We used the general architecture for text engineering (GATE) framework to build an NLP information extraction system using rule-based techniques.
Purpose: The COVID-19 pandemic has increased mortality worldwide and those with chronic conditions may have been disproportionally affected. However, it is unknown whether the pandemic has changed mortality rates for people with epilepsy. We aimed to compare mortality rates in people with epilepsy in Wales during the pandemic with pre-pandemic rates.
View Article and Find Full Text PDFObjective: This study was undertaken to determine whether epilepsy and antiepileptic drugs (including enzyme-inducing and non-enzyme-inducing drugs) are associated with major cardiovascular events using population-level, routinely collected data.
Methods: Using anonymized, routinely collected, health care data in Wales, UK, we performed a retrospective matched cohort study (2003-2017) of adults with epilepsy prescribed an antiepileptic drug. Controls were matched with replacement on age, gender, deprivation quintile, and year of entry into the study.
Objective: To characterize trends in incidence, prevalence, and health care outcomes in the idiopathic intracranial hypertension (IIH) population in Wales using routinely collected health care data.
Methods: We used and validated primary and secondary care IIH diagnosis codes within the Secure Anonymised Information Linkage databank to ascertain IIH cases and controls in a retrospective cohort study between 2003 and 2017. We recorded body mass index (BMI), deprivation quintile, CSF diversion surgery, and unscheduled hospital admissions in case and control cohorts.
Objective: Routinely collected healthcare data are a powerful research resource but often lack detailed disease-specific information that is collected in clinical free text, for example, clinic letters. We aim to use natural language processing techniques to extract detailed clinical information from epilepsy clinic letters to enrich routinely collected data.
Design: We used the general architecture for text engineering (GATE) framework to build an information extraction system, ExECT (extraction of epilepsy clinical text), combining rule-based and statistical techniques.
Purpose: Anonymised, routinely-collected healthcare data is increasingly being used for epilepsy research. We validated algorithms using general practitioner (GP) primary healthcare records to identify people with epilepsy from anonymised healthcare data within the Secure Anonymised Information Linkage (SAIL) databank in Wales, UK.
Method: A reference population of 150 people with definite epilepsy and 150 people without epilepsy was ascertained from hospital records and linked to records contained within SAIL (containing GP records for 2.