The wide adoption of electronic health records (EHRs) offers immense potential as a source of support for clinical research. However, previous studies focused on extracting only a limited set of medical concepts to support information extraction in the cancer domain for the Spanish language. Building on the success of deep learning for processing natural language texts, this paper proposes a transformer-based approach to extract named entities from breast cancer clinical notes written in Spanish and compares several language models. To facilitate this approach, a schema for annotating clinical notes with breast cancer concepts is presented, and a corpus for breast cancer is developed. Results indicate that both BERT-based and RoBERTa-based language models demonstrate competitive performance in clinical Named Entity Recognition (NER). Specifically, BETO and multilingual BERT achieve F-scores of 93.71% and 94.63%, respectively. Additionally, RoBERTa Biomedical attains an F-score of 95.01%, while RoBERTa BNE achieves an F-score of 94.54%. The findings suggest that transformers can feasibly extract information in the clinical domain in the Spanish language, with the use of models trained on biomedical texts contributing to enhanced results. The proposed approach takes advantage of transfer learning techniques by fine-tuning language models to automatically represent text features and avoiding the time-consuming feature engineering process.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.artmed.2023.102625DOI Listing

Publication Analysis

Top Keywords

breast cancer
16
language models
16
domain spanish
8
spanish language
8
clinical notes
8
clinical
6
language
6
cancer
5
transformers extracting
4
breast
4

Similar Publications

Aim: Breast cancer is the second most common cancer among women and the leading cause of cancer-related mortality in this population. Numerous factors have been identified as either risk factors or protective factors for breast cancer. However, the role of Vitamin D (Vit.

View Article and Find Full Text PDF

Purpose: To investigate the effects of compression therapy combined with exercise for cancer patients (EXCAP) in patients with peripheral neuropathy caused by breast cancer chemotherapy.

Methods: Overall, 108 patients with peripheral neuropathy after chemotherapy for breast cancer were randomly divided into the control group (routine nursing), experimental group 1 (compression therapy), and experimental group 2 (compression therapy and EXCAP). The National Institute of Cancer Drug Toxicity Rating Scale and the Chemotherapy-Induced Peripheral Neuropathy Assessment Tool were assessed and compared between groups.

View Article and Find Full Text PDF

Purpose: Over 50% of households in the United States have at least one musician-many musicians are also breast cancer survivors. This group has not been well studied, and given the level of fine sensory-motor skill required for musicianship, we hypothesized that musicians experience unique manifestations of breast cancer treatment toxicities.

Methods: A nine-item Musical Toxicity Questionnaire (MTQ) was distributed to patients who had consented to participate in the Mayo Clinic Breast Cancer Registry.

View Article and Find Full Text PDF

Rare Indocyanine-Induced Anaphylactic Shock During Deep Inferior Epigastric Artery Perforator Breast Reconstruction: A Case Report.

Ann Plast Surg

February 2025

From the Department of Plastic and Reconstructive Surgery, Ewha Womans University College of Medicine, Mokdong Hospital, Seoul, Republic of Korea.

Indocyanine green (ICG) is a water-soluble green substance that is detectable through infrared cameras and emits greenish light. Approved for medical use in the 1950s, ICG has gained prominence as a real-time visualization tool. Widely recognized as a generally safe substance, ICG is applied in diverse fields.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!