Unsupervised Content Mining in CBIR: Harnessing Latent Diffusion for Complex Text-Based Query Interpretation.

J Imaging

Department of EECE, GITAM School of Technology, GITAM Deemed to be University, Rushikonda, Visakhapatnam 530045, India.

Published: June 2024

The paper demonstrates a novel methodology for Content-Based Image Retrieval (CBIR), which shifts the focus from conventional domain-specific image queries to more complex text-based query processing. Latent diffusion models are employed to interpret complex textual prompts and address the requirements of effectively interpreting the complex textual query. Latent Diffusion models successfully transform complex textual queries into visually engaging representations, establishing a seamless connection between textual descriptions and visual content. Custom triplet network design is at the heart of our retrieval method. When trained well, a triplet network will represent the generated query image and the different images in the database. The cosine similarity metric is used to assess the similarity between the feature representations in order to find and retrieve the relevant images. Our experiments results show that latent diffusion models can successfully bridge the gap between complex textual prompts for image retrieval without relying on labels or metadata that are attached to database images. This advancement sets the stage for future explorations in image retrieval, leveraging the generative AI capabilities to cater to the ever-evolving demands of big data and complex query interpretations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11204759PMC
http://dx.doi.org/10.3390/jimaging10060139DOI Listing

Publication Analysis

Top Keywords

latent diffusion
16
complex textual
16
image retrieval
12
diffusion models
12
complex text-based
8
text-based query
8
textual prompts
8
triplet network
8
complex
7
query
5

Similar Publications

Manifold learning techniques have emerged as crucial tools for uncovering latent patterns in high-dimensional single-cell data. However, most existing dimensionality reduction methods primarily rely on 2D visualization, which can distort true data relationships and fail to extract reliable biological information. Here, we present DTNE (diffusive topology neighbor embedding), a dimensionality reduction framework that faithfully approximates manifold distance to enhance cellular relationships and dynamics.

View Article and Find Full Text PDF

The application of supervised models to clinical screening tasks is challenging due to the need for annotated data for each considered pathology. Unsupervised Anomaly Detection (UAD) is an alternative approach that aims to identify any anomaly as an outlier from a healthy training distribution. A prevalent strategy for UAD in brain MRI involves using generative models to learn the reconstruction of healthy brain anatomy for a given input image.

View Article and Find Full Text PDF

Postoperative delirium is the most common postsurgical complication in older adults and is associated with an increased risk of long-term cognitive decline and Alzheimer's disease (AD) and related dementias (ADRD). However, the neurological basis of this increased risk-whether postoperative delirium unmasks latent preoperative pathology or leads to AD-relevant pathology after perioperative brain injury-remains unclear. Recent advancements in neuroimaging techniques now enable the detection of subtle brain features or damage that may underlie clinical symptoms.

View Article and Find Full Text PDF

RetinaRegNet: A zero-shot approach for retinal image registration.

Comput Biol Med

January 2025

Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL, 32610, United States; Department of Medicine, University of Florida, Gainesville, FL, 32610, United States; Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, 32610, United States; Intelligent Clinical Care Center, University of Florida, Gainesville, FL, 32610, United States. Electronic address:

Retinal image registration is essential for monitoring eye diseases and planning treatments, yet it remains challenging due to large deformations, minimal overlap, and varying image quality. To address these challenges, we propose RetinaRegNet, a multi-stage image registration model with zero-shot generalizability across multiple retinal imaging modalities. RetinaRegNet begins by extracting image features using a pretrained latent diffusion model.

View Article and Find Full Text PDF

Distribution of Airway Findings in ANCA-Associated Vasculitis: A 20-Year Observational Analysis.

Diagnostics (Basel)

December 2024

Department of Internal Medicine, Division of Rheumatology, Mayo Clinic, Jacksonville, FL 32224, USA.

Pulmonary involvement is commonly observed in anti-neutrophil cytoplasmic antibody (ANCA)-associated vasculitis (AAV), presenting with manifestations such as diffuse alveolar hemorrhage, inflammatory infiltrates, pulmonary nodules, and tracheobronchial disease. We aimed to identify distinct subgroups of tracheobronchial disease patterns in patients with anti-neutrophil cytoplasmic antibody (ANCA)-associated vasculitis (AAV) using latent class analysis (LCA), and to evaluate their clinical characteristics and outcomes. We conducted a retrospective cohort study using electronic medical records of patients aged >18 years diagnosed with AAV and tracheobronchial disease between 1 January 2002 and 6 September 2022.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!