Publications by authors named "Beata Fonferko-Shadrach"

Background: Natural language processing (NLP) is increasingly being used to extract structured information from unstructured text to assist clinical decision-making and aid healthcare research. The availability of expert-annotated documents for the development and validation of NLP applications is limited. We created synthetic clinical documents to address this, and to validate the Extraction of Epilepsy Clinical Text version 2 (ExECTv2) NLP pipeline.

View Article and Find Full Text PDF

Background: Cancer multidisciplinary team (MDT) meetings are under intense pressure to reform given the rapidly rising incidence of cancer and national mandates for protocolized streaming of cases. The aim of this study was to validate a natural language processing (NLP)-based web platform to automate evidence-based MDT decisions for skin cancer with basal cell carcinoma as a use case.

Methods: A novel and validated NLP information extraction model was used to extract perioperative tumour and surgical factors from histopathology reports.

View Article and Find Full Text PDF
Article Synopsis
  • This study aimed to connect genetic data with regular health information of epilepsy patients to see how rare genetic variants affect epilepsy outcomes.
  • Researchers linked whole-exome sequencing data with health records of adults from the Swansea Neurology Biobank and analyzed factors like hospital admissions and medication types.
  • The findings showed no significant genetic influence on epilepsy outcomes despite successfully merging genetic and health data, highlighting a need for larger studies for better insights.
View Article and Find Full Text PDF

Identifying genetic risk factors for highly heterogeneous disorders like epilepsy remains challenging. Here, we present the largest whole-exome sequencing study of epilepsy to date, with >54,000 human exomes, comprising 20,979 deeply phenotyped patients from multiple genetic ancestry groups with diverse epilepsy subtypes and 33,444 controls, to investigate rare variants that confer disease risk. These analyses implicate seven individual genes, three gene sets, and four copy number variants at exome-wide significance.

View Article and Find Full Text PDF

Introduction: Routinely collected healthcare data are a powerful research resource, but often lack detailed disease-specific information that is collected in clinical free text such as histopathology reports. We aim to use natural Language Processing (NLP) techniques to extract detailed clinical and pathological information from histopathology reports to enrich routinely collected data.

Methods: We used the general architecture for text engineering (GATE) framework to build an NLP information extraction system using rule-based techniques.

View Article and Find Full Text PDF

Purpose: The COVID-19 pandemic has increased mortality worldwide and those with chronic conditions may have been disproportionally affected. However, it is unknown whether the pandemic has changed mortality rates for people with epilepsy. We aimed to compare mortality rates in people with epilepsy in Wales during the pandemic with pre-pandemic rates.

View Article and Find Full Text PDF
Article Synopsis
  • There is a growing amount of unstructured text data generated in fields like health, law, and social media, which holds valuable information but is difficult to analyze without substantial expert input for labeling.
  • Annotation tools like Markup help convert this unstructured text into structured annotations, making it easier to utilize in research and analysis, especially in Natural Language Processing (NLP) applications.
  • Markup is a web-based, open-source tool that uses NLP and Active Learning technologies to streamline the annotation process, allowing for custom configurations and real-world applications, such as in healthcare, where it has been utilized to process clinic letters.
View Article and Find Full Text PDF

Objective: This study was undertaken to determine whether epilepsy and antiepileptic drugs (including enzyme-inducing and non-enzyme-inducing drugs) are associated with major cardiovascular events using population-level, routinely collected data.

Methods: Using anonymized, routinely collected, health care data in Wales, UK, we performed a retrospective matched cohort study (2003-2017) of adults with epilepsy prescribed an antiepileptic drug. Controls were matched with replacement on age, gender, deprivation quintile, and year of entry into the study.

View Article and Find Full Text PDF

Objective: To characterize trends in incidence, prevalence, and health care outcomes in the idiopathic intracranial hypertension (IIH) population in Wales using routinely collected health care data.

Methods: We used and validated primary and secondary care IIH diagnosis codes within the Secure Anonymised Information Linkage databank to ascertain IIH cases and controls in a retrospective cohort study between 2003 and 2017. We recorded body mass index (BMI), deprivation quintile, CSF diversion surgery, and unscheduled hospital admissions in case and control cohorts.

View Article and Find Full Text PDF

Objective: Routinely collected healthcare data are a powerful research resource but often lack detailed disease-specific information that is collected in clinical free text, for example, clinic letters. We aim to use natural language processing techniques to extract detailed clinical information from epilepsy clinic letters to enrich routinely collected data.

Design: We used the general architecture for text engineering (GATE) framework to build an information extraction system, ExECT (extraction of epilepsy clinical text), combining rule-based and statistical techniques.

View Article and Find Full Text PDF

Purpose: Anonymised, routinely-collected healthcare data is increasingly being used for epilepsy research. We validated algorithms using general practitioner (GP) primary healthcare records to identify people with epilepsy from anonymised healthcare data within the Secure Anonymised Information Linkage (SAIL) databank in Wales, UK.

Method: A reference population of 150 people with definite epilepsy and 150 people without epilepsy was ascertained from hospital records and linked to records contained within SAIL (containing GP records for 2.

View Article and Find Full Text PDF