Dictionary Pruning with Visual Word Significance for Medical Image Retrieval.

Neurocomputing (Amst)

Dept of Informatics, University of Albany State University of New York, United States; Robotics Institute, Carnegie Mellon University, United States.

Published: February 2016

Content-based medical image retrieval (CBMIR) is an active research area for disease diagnosis and treatment but it can be problematic given the small visual variations between anatomical structures. We propose a retrieval method based on a bag-of-visual-words (BoVW) to identify discriminative characteristics between different medical images with Pruned Dictionary based on Latent Semantic Topic description. We refer to this as the PD-LST retrieval. Our method has two main components. First, we calculate a topic-word significance value for each visual word given a certain latent topic to evaluate how the word is connected to this latent topic. The latent topics are learnt, based on the relationship between the images and words, and are employed to bridge the gap between low-level visual features and high-level semantics. These latent topics describe the images and words semantically and can thus facilitate more meaningful comparisons between the words. Second, we compute an overall-word significance value to evaluate the significance of a visual word within the entire dictionary. We designed an iterative ranking method to measure overall-word significance by considering the relationship between all latent topics and words. The words with higher values are considered meaningful with more significant discriminative power in differentiating medical images. We evaluated our method on two public medical imaging datasets and it showed improved retrieval accuracy and efficiency.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5036942PMC
http://dx.doi.org/10.1016/j.neucom.2015.11.008DOI Listing

Publication Analysis

Top Keywords

visual word
12
latent topics
12
medical image
8
image retrieval
8
retrieval method
8
medical images
8
significance visual
8
latent topic
8
overall-word significance
8
latent
6

Similar Publications

The Role of Morphological Structure in Determining the Optimal Viewing Position During Visual Word Recognition in Beginning Readers.

Children (Basel)

November 2024

Univ. Lille, CNRS, UMR 9193-SCALab-Sciences Cognitives et Sciences Affectives, F-59000 Lille, France.

Background/objectives: The present study examines the role of morphemic units in the initial word recognition stage among beginning readers. We assess whether and to what extent sublexical units, such as morphemes, are used in processing French words and how their use varies with reading proficiency.

Methods: Two experiments were conducted to investigate the perceptual and morphological effects on the recognition of words presented in central vision, using a variable-viewing-position technique.

View Article and Find Full Text PDF

Native and Non-Native Speakers' Recognition of Chinese Two-Character Words in Audio Sentence Comprehension.

Behav Sci (Basel)

December 2024

College of Chinese Language and Literature, Qufu Normal University, No. 57, Jingxuan Road, Qufu 273165, China.

Two experiments were conducted to examine native and non-native speakers' recognition of Chinese two-character words (2C-words) in the context of audio sentence comprehension. The recording was played of a sentence, in which a collocation composed of a number word, a sortal classifier, and a noun (NCN) was embedded. When the participants were about to hear the noun of the NCN (Noun), the playing stopped, and a target was visually presented, which was the Noun, the character-transposed word of the Noun (NounT), or a control word (NounC), or was a homophone nonword for Noun, NounT, or NounC.

View Article and Find Full Text PDF

The visual word form area (VWFA) is a region in the left ventrotemporal cortex (VTC) whose specificity remains contentious. Using precision fMRI, we examine the VWFA's responses to numerous visual and nonvisual stimuli, comparing them to adjacent category-selective visual regions and regions involved in language and attentional demand. We find that VWFA responds moderately to non-word visual stimuli, but is unique within VTC in its pronounced selectivity for visual words.

View Article and Find Full Text PDF

Visual-linguistic Diagnostic Semantic Enhancement for medical report generation.

J Biomed Inform

December 2024

Department of Nasopharyngeal Carcinoma, Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Guangdong Provincial Clinical Research Center for Cancer, Guangzhou 510060, China. Electronic address:

Generative methods are currently popular for medical report generation, as they automatically generate professional reports from input images, assisting physicians in making faster and more accurate decisions. However, current methods face significant challenges: 1) Lesion areas in medical images are often difficult for models to capture accurately, and 2) even when captured, these areas are frequently not described using precise clinical diagnostic terms. To address these problems, we propose a Visual-Linguistic Diagnostic Semantic Enhancement model (VLDSE) to generate high-quality reports.

View Article and Find Full Text PDF

Introduction: Implicit statistical learning is, by definition, learning that occurs without conscious awareness. However, measures that putatively assess implicit statistical learning often require explicit reflection, for example, deciding if a sequence is 'grammatical' or 'ungrammatical'. By contrast, 'processing-based' tasks can measure learning without requiring conscious reflection, by measuring processes that are facilitated by implicit statistical learning.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!