The paper demonstrates a novel methodology for Content-Based Image Retrieval (CBIR), which shifts the focus from conventional domain-specific image queries to more complex text-based query processing. Latent diffusion models are employed to interpret complex textual prompts and address the requirements of effectively interpreting the complex textual query. Latent Diffusion models successfully transform complex textual queries into visually engaging representations, establishing a seamless connection between textual descriptions and visual content. Custom triplet network design is at the heart of our retrieval method. When trained well, a triplet network will represent the generated query image and the different images in the database. The cosine similarity metric is used to assess the similarity between the feature representations in order to find and retrieve the relevant images. Our experiments results show that latent diffusion models can successfully bridge the gap between complex textual prompts for image retrieval without relying on labels or metadata that are attached to database images. This advancement sets the stage for future explorations in image retrieval, leveraging the generative AI capabilities to cater to the ever-evolving demands of big data and complex query interpretations.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11204759 | PMC |
http://dx.doi.org/10.3390/jimaging10060139 | DOI Listing |
Proc Natl Acad Sci U S A
January 2025
Guangdong Institute of Intelligence Science and Technology, 519031 Hengqin, Zhuhai, Guangdong, China.
Manifold learning techniques have emerged as crucial tools for uncovering latent patterns in high-dimensional single-cell data. However, most existing dimensionality reduction methods primarily rely on 2D visualization, which can distort true data relationships and fail to extract reliable biological information. Here, we present DTNE (diffusive topology neighbor embedding), a dimensionality reduction framework that faithfully approximates manifold distance to enhance cellular relationships and dynamics.
View Article and Find Full Text PDFComput Biol Med
January 2025
Hamburg University of Technology, Hamburg, Germany.
The application of supervised models to clinical screening tasks is challenging due to the need for annotated data for each considered pathology. Unsupervised Anomaly Detection (UAD) is an alternative approach that aims to identify any anomaly as an outlier from a healthy training distribution. A prevalent strategy for UAD in brain MRI involves using generative models to learn the reconstruction of healthy brain anatomy for a given input image.
View Article and Find Full Text PDFmedRxiv
January 2025
Department of Anesthesiology, Duke University Medical Center, Durham, NC.
Postoperative delirium is the most common postsurgical complication in older adults and is associated with an increased risk of long-term cognitive decline and Alzheimer's disease (AD) and related dementias (ADRD). However, the neurological basis of this increased risk-whether postoperative delirium unmasks latent preoperative pathology or leads to AD-relevant pathology after perioperative brain injury-remains unclear. Recent advancements in neuroimaging techniques now enable the detection of subtle brain features or damage that may underlie clinical symptoms.
View Article and Find Full Text PDFComput Biol Med
January 2025
Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL, 32610, United States; Department of Medicine, University of Florida, Gainesville, FL, 32610, United States; Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, 32610, United States; Intelligent Clinical Care Center, University of Florida, Gainesville, FL, 32610, United States. Electronic address:
Retinal image registration is essential for monitoring eye diseases and planning treatments, yet it remains challenging due to large deformations, minimal overlap, and varying image quality. To address these challenges, we propose RetinaRegNet, a multi-stage image registration model with zero-shot generalizability across multiple retinal imaging modalities. RetinaRegNet begins by extracting image features using a pretrained latent diffusion model.
View Article and Find Full Text PDFDiagnostics (Basel)
December 2024
Department of Internal Medicine, Division of Rheumatology, Mayo Clinic, Jacksonville, FL 32224, USA.
Pulmonary involvement is commonly observed in anti-neutrophil cytoplasmic antibody (ANCA)-associated vasculitis (AAV), presenting with manifestations such as diffuse alveolar hemorrhage, inflammatory infiltrates, pulmonary nodules, and tracheobronchial disease. We aimed to identify distinct subgroups of tracheobronchial disease patterns in patients with anti-neutrophil cytoplasmic antibody (ANCA)-associated vasculitis (AAV) using latent class analysis (LCA), and to evaluate their clinical characteristics and outcomes. We conducted a retrospective cohort study using electronic medical records of patients aged >18 years diagnosed with AAV and tracheobronchial disease between 1 January 2002 and 6 September 2022.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!