AI Article Synopsis

Article Abstract

Images from social media can reflect diverse viewpoints, heated arguments, and expressions of creativity, adding new complexity to retrieval tasks. Researchers working on Content-Based Image Retrieval (CBIR) have traditionally tuned their algorithms to match filtered results with user search intent. However, we are now bombarded with composite images of unknown origin, authenticity, and even meaning. With such uncertainty, users may not have an initial idea of what the search query results should look like. For instance, hidden people, spliced objects, and subtly altered scenes can be difficult for a user to detect initially in a meme image, but may contribute significantly to its composition. It is pertinent to design systems that retrieve images with these nuanced relationships in addition to providing more traditional results, such as duplicates and near-duplicates - and to do so with enough efficiency at large scale. We propose a new approach for spatial verification that aims at modeling object-level regions using image keypoints retrieved from an image index, which is then used to accurately weight small contributing objects within the results, without the need for costly object detection steps. We call this method the Objects in Scene to Objects in Scene (OS2OS) score, and it is optimized for fast matrix operations, which can run quickly on either CPUs or GPUs. It performs comparably to state-of-the-art methods on classic CBIR problems (Oxford 5K, Paris 6K, and Google-Landmarks), and outperforms them in emerging retrieval tasks such as image composite matching in the NIST MFC2018 dataset and meme-style imagery from Reddit.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2021.3097175DOI Listing

Publication Analysis

Top Keywords

spatial verification
8
image retrieval
8
retrieval tasks
8
objects scene
8
image
6
fast local
4
local spatial
4
verification feature-agnostic
4
feature-agnostic large-scale
4
large-scale image
4

Similar Publications

This paper presents a Regeneration filter for reducing near Salt-and-Pepper (nS&P) noise in images, designed for selective noise removal while simultaneously preserving structural details. Unlike conventional methods, the proposed filter eliminates the need for median or other filters, focusing exclusively on restoring noise-affected pixels through localized contextual analysis in the immediate surroundings. Our approach employs an iterative processing method, where additional iterations do not degrade the image quality achieved after the first filtration, even with high noise densities up to 97% spatial distribution.

View Article and Find Full Text PDF

Remote sensing change detection (RSCD), which utilizes dual-temporal images to predict change locations, plays an essential role in long-term Earth observation missions. Although many deep learning based RSCD models perform well, challenges remain in effectively extracting change information between dual-temporal images and fully leveraging interactions between their feature maps. To address these challenges, a constraint- and interaction-based network (CINet) for RSCD is proposed.

View Article and Find Full Text PDF

Epilepsy Prediction and Detection Using Attention-CssCDBN with Dual-Task Learning.

Sensors (Basel)

December 2024

Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of China, Beijing 100081, China.

Epilepsy is a group of neurological disorders characterized by epileptic seizures, and it affects tens of millions of people worldwide. Currently, the most effective diagnostic method employs the monitoring of brain activity through electroencephalogram (EEG). However, it is critical to predict epileptic seizures in patients prior to their onset, allowing for the administration of preventive medications before the seizure occurs.

View Article and Find Full Text PDF

Background: High-dose-rate (HDR) brachytherapy using Iridium-192 as a radiation source is widely employed in cancer treatment to deliver concentrated radiation doses while minimizing normal tissue exposure. In this treatment, the precision with which the sealed radioisotope source is delivered significantly impacts clinical outcomes.

Purpose: This study aims to evaluate the feasibility of a new four-dimensional (4D) in vivo source tracking and treatment verification system for HDR brachytherapy using a patient-specific approach.

View Article and Find Full Text PDF

Observation-based verification of regional/national methane (CH) emission trends is crucial for transparent monitoring and mitigation strategy planning. Although surface observations track the global and sub-hemispheric emission trends well, their sparse spatial coverage limits our ability to assess regional trends. Dense satellite observations complement surface observations, offering a valuable means to validate emission trends, especially in regions where emissions changes are substantial but debated.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!