AI Article Synopsis

  • The main challenge for clinical Named Entity Recognition (NER) is the lack of annotated training data due to technical and legal issues with electronic health records (EHRs).
  • This study explores how generating pseudo-data using gazetteering with a neural network model affects clinical NER performance.
  • Results indicate that using gazetteers with a higher number of relevant terms leads to better inclusion of proper terms while filtering out unnecessary words like determiners and pronouns.

Article Abstract

One of the primary challenges for clinical Named Entity Recognition (NER) is the availability of annotated training data. Technical and legal hurdles prevent the creation and release of corpora related to electronic health records (EHRs). In this work, we look at the impact of pseudo-data generation on clinical NER using gazetteering utilizing a neural network model. We report that gazetteers can result in the inclusion of proper terms with the exclusion of determiners and pronouns in preceding and middle positions. Gazetteers that had higher numbers of terms inclusive to the original dataset had a higher impact.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8378614PMC

Publication Analysis

Top Keywords

pseudo-data generation
8
generation extraction
4
extraction problems
4
problems treatments
4
treatments tests
4
tests primary
4
primary challenges
4
challenges clinical
4
clinical named
4
named entity
4

Similar Publications

International norms for adult handgrip strength: A systematic review of data on 2.4 million adults aged 20 to 100+ years from 69 countries and regions.

J Sport Health Sci

November 2024

Alliance for Research in Exercise, Nutrition and Activity (ARENA), Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.

Article Synopsis
  • Muscular strength, particularly measured by handgrip strength (HGS), is an important indicator of health and a predictor of age-related diseases, but no international benchmarks exist for HGS across different ages and sexes.
  • The study systematically analyzed data from over 2.4 million adults across 69 countries to establish sex- and age-specific norms for HGS, identifying a peak in strength between ages 30 and 39 before a gradual decline.
  • Findings revealed that while absolute and body size-normalized HGS improves slightly in early adulthood, a more significant drop occurs from middle to late adulthood, with males generally experiencing a faster decline than females.
View Article and Find Full Text PDF
Article Synopsis
  • Continual learning (CL) focuses on learning new tasks while retaining knowledge from previous tasks, but traditional methods often struggle due to the lack of available raw data because of copyright and privacy issues.
  • This paper introduces two innovative settings for CL: data-efficient CL (DECL-APIs) and data-free CL (DFCL-APIs), which utilize machine learning APIs to train models with little or no raw data, facing unique challenges like incomplete data and model variability.
  • A new cooperative continual distillation learning framework is proposed, using adversarial training with generators to create synthetic data from APIs, enabling effective knowledge transfer to a CL model while implementing a regularization term to prevent forgetting previous knowledge; results show that this approach can perform compar
View Article and Find Full Text PDF
Article Synopsis
  • Computer-aided diagnosis (CAD) systems enhance pathologists' ability to accurately and consistently diagnose cancers, but models trained on data from a single center face challenges in generalizing to different medical environments.
  • The proposed solution, SSL-FT-BT, utilizes a self-supervised federated learning framework and generates pseudo-histopathological images from each center, allowing for enhanced model training without compromising patient privacy.
  • Experimental results demonstrate that this approach improves both diagnostic accuracy and the ability to generalize across different datasets, showing its effectiveness in real-world applications.
View Article and Find Full Text PDF

The automated extraction of critical information from electronic medical records, such as oncological medical events, has become increasingly important with the widespread use of electronic health records. However, extracting tumor-related medical events can be challenging due to their unique characteristics. To address this difficulty, we propose a novel approach that utilizes Generative Adversarial Networks (GANs) for data augmentation and pseudo-data generation algorithms to improve the model's transfer learning skills for various tumor-related medical events.

View Article and Find Full Text PDF

Application of machine learning techniques to the analysis and prediction of drug pharmacokinetics.

J Control Release

December 2022

Department of Drug Delivery Research, Graduate School of Pharmaceutical Sciences, Kyoto University, Sakyo-ku, Kyoto 606-8501, Japan; Department of Applied Pharmacy and Pharmacokinetics, Graduate School of Pharmaceutical Sciences, Kyoto University, Sakyo-ku, Kyoto 606-8501, Japan. Electronic address:

In this review, we describe the current status and challenges in applying machine-learning techniques to the analysis and prediction of pharmacokinetic data. The theory of pharmacokinetics has been developed over decades on the basis of physiology and reaction kinetics. Mathematical models allow the reduction of pharmacokinetic data to parameter values, giving insight and understanding into ADME processes and predicting the outcome of different dosing scenarios.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!