In health sciences, high-quality text embeddings may augment qualitative data analysis of large amounts of text by enabling, e.g., searching and clustering of health information. This study aimed to evaluate three different sentence-level embedding methods in clustering sentences in nursing narratives from individual patients' hospital care episodes. Two of these embeddings are generated from language models based on the BERT framework, and the third on the Sent2Vec method. These embedding methods were used to cluster sentences from 20 patient care episodes and the results were manually evaluated. Findings suggest that the best clusters were produced by the embeddings from a BERT model fine-tuned for the proxy task of predicting subject headings for nursing text.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.3233/SHTI220606 | DOI Listing |
Ther Adv Infect Dis
January 2025
Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA.
Background: Kentucky is one of seven states with high, sustained rural HIV transmission tied to injection drug use. Expanding access to pre-exposure prophylaxis (PrEP) has been endorsed as a key HIV prevention strategy; however, uptake among people who inject drugs (PWID) has been negligible in rural areas. Syringe services programs (SSPs) have been implemented throughout Kentucky's Appalachian region, providing an important opportunity to integrate PrEP services.
View Article and Find Full Text PDFAlzheimers Dement (Amst)
January 2025
Introduction: Brain age gap (BAG), defined as the difference between MRI-predicted 'brain age' and chronological age, can capture information underlying various neurological disorders. We investigated the pathophysiological significance of the BAG across neurodegenerative disorders.
Methods: We developed a brain age estimator using structural MRIs of healthy-aged individuals from one cohort study.
BMC Bioinformatics
January 2025
Department of Information Technology, Vardhaman College of Engineering, Shamshabad, Hyderabad, India.
Background: Biomedical text mining is a technique that extracts essential information from scientific articles using named entity recognition (NER). Traditional NER methods rely on dictionaries, rules, or curated corpora, which may not always be accessible. To overcome these challenges, deep learning (DL) methods have emerged.
View Article and Find Full Text PDFNat Commun
January 2025
School of Information Science and Technology, Fudan University, Shanghai, China.
Accelerating the discovery of novel crystal materials by machine learning is crucial for advancing various technologies from clean energy to information processing. The machine-learning models for prediction of materials properties require embedding atomic information, while traditional methods have limited effectiveness in enhancing prediction accuracy. Here, we proposed an atomic embedding strategy called universal atomic embeddings (UAEs) for their broad applicability as atomic fingerprints, and generated the UAE tensors based on the proposed CrystalTransformer model.
View Article and Find Full Text PDFMod Pathol
January 2025
Hematopathology Service, Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA. Electronic address:
T-cell clonality assessment constitutes an essential part of the diagnostic evaluation of suspected T-cell neoplasms. Recent advances in flow cytometry-based analysis of TCR β chain constant region 1 (TRBC1) have introduced an accurate method of assessment of T-cell clonality. Its broader applicability is constrained due to the requirement of viable cells.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!