Many clinical natural language processing methods rely on non-contextual word embedding (NCWE) or contextual word embedding (CWE) models. Yet, few, if any, intrinsic evaluation benchmarks exist comparing embedding representations against clinician judgment. We developed intrinsic evaluation tasks for embedding models using a corpus of radiology reports: term pair similarity for NCWEs and cloze task accuracy for CWEs. Using surveys, we quantified the agreement between clinician judgment and embedding model representations. We compare embedding models trained on a custom radiology report corpus (RRC), a general corpus, and PubMed and MIMIC-III corpora (P&MC). Cloze task accuracy was equivalent for RRC and P&MC models. For term pair similarity, P&MC-trained NCWEs outperformed all other NCWE models (ρ 0.61 vs. 0.27-0.44). Among models trained on RRC, fastText models often outperformed other NCWE models and spherical embeddings provided overly optimistic representations of term pair similarity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8861761PMC

Publication Analysis

Top Keywords

intrinsic evaluation
12
term pair
12
pair similarity
12
non-contextual word
8
radiology reports
8
word embedding
8
models
8
clinician judgment
8
embedding models
8
cloze task
8

Similar Publications

Purpose: The Healthy Hearts pilot study evaluated the effect of an eHealth motivational interviewing-framed intervention on cardiomyopathy screening-related knowledge, health beliefs, intrinsic motivation, and behavioral action steps among adult survivors of childhood cancer.

Methods: We consented N = 73 survivors to participate in a single-arm pilot study. Participants completed an online baseline survey (n = 68) assessing knowledge, health beliefs, and intrinsic motivation related to cancer therapy-induced cardiomyopathy and screening echocardiograms.

View Article and Find Full Text PDF

Single particle inductively coupled plasma mass spectrometry (SP-ICP-MS) is a powerful tool for metallic nanoparticle (NP) characterisation in terms of concentration and, taking into account several assumptions, also size. However, this technique faces challenges, such as the intrinsic matrix effect, which significantly impact the results when analysing real complex samples. This issue is critical for the calculations of key SP-ICP-MS parameters ultimately altering the final outcomes.

View Article and Find Full Text PDF

Infectious Agents and Cancer journal has recently launched a new collection of papers about "Point-of-Care (POC) for HPV-related genital cancers" putting together some interesting works on the accuracy of HPV tests for screening. This editorial initiative gave us the opportunity to reflect on the relations between accuracy measures, prevalence and characteristics of the tested population in the case of HPV-based screening. In screening test evaluation, we look at the clinical accuracy of the test as an intrinsic characteristic of the assay, which interacts with the characteristics of the population, the result being the screening performance.

View Article and Find Full Text PDF

Historical (1960 - 2011) and spatial analysis of mercury and arsenic in two species of tropical birds in southeastern Mexico.

J Hazard Mater

December 2024

Instituto EPOMEX, Universidad Autónoma de Campeche, Av. Héroe de Nacozari No. 480, San Francisco de Campeche, Campeche 24070, Mexico. Electronic address:

Spatiotemporal variation in the concentrations of mercury (Hg) and arsenic (As) in body feathers of Red-throated Ant-Tanager (Driophlox fuscicauda) and Clay-colored thrush (Turdus grayi) were evaluated. Body feathers were obtained from scientific collections (specimens collected from 1960 to 2011) in Mexico. Trace elements concentrations were determined by voltammetry through acid digestion.

View Article and Find Full Text PDF

Angiogenesis is an intrinsic physiological process involving the formation of new capillaries from existing ones. Synthetic cannabinoids refer to a class of human-made chemicals that are primarily designed to mimic the effects of delta-9-tetrahydrocannabinol, the primary psychoactive compound in cannabis. Studies investigating the association between synthetic cannabinoids and cellular reactions are limited, and the available scientific evidence is insufficient.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!