Background: Textual datasets (corpora) are crucial for the application of natural language processing (NLP) models. However, corpus creation in the medical field is challenging, primarily because of privacy issues with raw clinical data such as health records. Thus, the existing clinical corpora are generally small and scarce. Medical NLP (MedNLP) methodologies perform well with limited data availability.
Objectives: We present the outcomes of the Real-MedNLP workshop, which was conducted using limited and parallel medical corpora. Real-MedNLP exhibits three distinct characteristics: (1) limited annotated documents: the training data comprise only a small set (∼100) of case reports (CRs) and radiology reports (RRs) that have been annotated. (2) Bilingually parallel: the constructed corpora are parallel in Japanese and English. (3) Practical tasks: the workshop addresses fundamental tasks, such as named entity recognition (NER) and applied practical tasks.
Methods: We propose three tasks: NER of ∼100 available documents (Task 1), NER based only on annotation guidelines for humans (Task 2), and clinical applications (Task 3) consisting of adverse drug effect (ADE) detection for CRs and identical case identification (CI) for RRs.
Results: Nine teams participated in this study. The best systems achieved 0.65 and 0.89 F1-scores for CRs and RRs in Task 1, whereas the top scores in Task 2 decreased by 50 to 70%. In Task 3, ADE reports were detected by up to 0.64 F1-score, and CI scored up to 0.96 binary accuracy.
Conclusion: Most systems adopt medical-domain-specific pretrained language models using data augmentation methods. Despite the challenge of limited corpus size in Tasks 1 and 2, recent approaches are promising because the partial match scores reached ∼0.8-0.9 F1-scores. Task 3 applications revealed that the different availabilities of external language resources affected the performance per language.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1055/a-2405-2489 | DOI Listing |
PLoS One
January 2025
Department of Computer Science, GC Women University Sialkot, Sialkot, Pakistan.
Modern dialogue systems rely on emotion recognition in conversation (ERC) as a core element enabling empathetic and human-like interactions. However, the weak correlation between emotions and semantics poses significant challenges to emotion recognition in dialogue. Semantically similar utterances can express different types of emotions, depending on the context or speaker.
View Article and Find Full Text PDFPLoS One
January 2025
Fisher Center for the Study and Conservation of Apes, Lincoln Park Zoo, Chicago, Illinois, United States of America.
The nature of western lowland gorilla social relationships within and between groups is largely understudied, partly due to the challenges of monitoring associations between individuals who live in neighboring groups. In this study, we examined the social relationships of four western lowland gorilla groups in the Ndoki landscape of northern Republic of Congo. To do so, we compiled all-occurrence social interaction and silverback nearest neighbor social networks from data collected during daily group follows conducted over several years.
View Article and Find Full Text PDFAlzheimers Dement
January 2025
Institute of Neurological and Psychiatric Disorders, Shenzhen Bay Laboratory, Shenzhen, China.
Introduction: Alzheimer's disease (AD) patients with higher educational attainment (EA) often exhibit better cognitive function. However, the relationship among EA status, AD pathology, structural brain reserve, and cognitive decline requires further investigation.
Methods: We compared cognitive performance across different amyloid beta (Aβ) positron emission tomography (A ±) statuses and EA levels (High EA/Low EA).
Eur J Neurosci
January 2025
Institute of Neuroscience (IONS), UCLouvain, Brussels, Belgium.
Experiencing music often entails the perception of a periodic beat. Despite being a widespread phenomenon across cultures, the nature and neural underpinnings of beat perception remain largely unknown. In the last decade, there has been a growing interest in developing methods to probe these processes, particularly to measure the extent to which beat-related information is contained in behavioral and neural responses.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!