Deep metric learning turns to be attractive in zero-shot image retrieval and clustering (ZSRC) task in which a good embedding/metric is requested such that the unseen classes can be distinguished well. Most existing works deem this "good" embedding just to be the discriminative one and race to devise the powerful metric objectives or the hard-sample mining strategies for learning discriminative deep metrics. However, in this article, we first emphasize that the generalization ability is also a core ingredient of this "good" metric and it largely affects the metric performance in zero-shot settings as a matter of fact. Then, we propose the confusion-based metric learning (CML) framework to explicitly optimize a robust metric. It is mainly achieved by introducing two interesting regularization terms, i.e., the energy confusion (EC) and diversity confusion (DC) terms. These terms daringly break away from the traditional deep metric learning idea of designing discriminative objectives and instead seek to "confuse" the learned model. These two confusion terms focus on local and global feature distribution confusions, respectively. We train these confusion terms together with the conventional deep metric objective in an adversarial manner. Although it seems weird to "confuse" the model learning, we show that our CML indeed serves as an efficient regularization framework for deep metric learning and it is applicable to various conventional metric methods. This article empirically and experimentally demonstrates the importance of learning an embedding/metric with good generalization, achieving the state-of-the-art performances on the popular CUB, CARS, Stanford Online Products, and In-Shop datasets for ZSRC tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2022.3185668DOI Listing

Publication Analysis

Top Keywords

metric learning
20
deep metric
16
confusion terms
12
metric
10
confusion-based metric
8
learning
8
zero-shot image
8
image retrieval
8
retrieval clustering
8
learning cml
8

Similar Publications

Artificial intelligence-based quantitative coronary angiography (AI-QCA) was introduced to address manual QCA's limitations in reproducibility and correction process. The present study aimed to assess the performance of an updated AI-QCA solution (MPXA-2000) in lesion detection and quantification using manual QCA as the reference standard, and to demonstrate its superiority over visual estimation. This multi-center retrospective study analyzed 1,076 coronary angiography images obtained from 420 patients, comparing AI-QCA and visual estimation against manual QCA as the reference standard.

View Article and Find Full Text PDF

Dilated SE-DenseNet for brain tumor MRI classification.

Sci Rep

January 2025

Department of Applied Mathematics, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.

In the field of medical imaging, particularly MRI-based brain tumor classification, we propose an advanced convolutional neural network (CNN) leveraging the DenseNet-121 architecture, enhanced with dilated convolutional layers and Squeeze-and-Excitation (SE) networks' attention mechanisms. This novel approach aims to improve upon state-of-the-art methods of tumor identification. Our model, trained and evaluated on a comprehensive Kaggle brain tumor dataset, demonstrated superior performance over established convolution-based and transformer-based models: ResNet-101, VGG-19, original DenseNet-121, MobileNet-V2, ViT-L/16, and Swin-B across key metrics: F1-score, accuracy, precision, and recall.

View Article and Find Full Text PDF

Background And Objective: Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M-proteins. However, its clinical application is limited by a heavy reliance on experienced experts.

Methods: A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M-proteins.

View Article and Find Full Text PDF

Object detection models and active learning for improvement of e-waste collection management systems in Korea.

Waste Manag

January 2025

Department of Industrial and Systems Engineering, Kyonggi University, Suwon, Republic of Korea. Electronic address:

This study identifies and analyzes issues within the management system of the waste home appliances free pickup service and seeks to enhance the system by using an object detection model. To overcome the limitations of manually inspecting approximately 5,000 collections per day, the YOLOv8 model was implemented. Photos for proof of collection, which were difficult to verify visually, were excluded from the image data.

View Article and Find Full Text PDF

Bananas (Musa spp.) are a critical global food crop, providing a primary source of nutrition for millions of people. Traditional methods for disease monitoring and detection are often time-consuming, labor-intensive, and prone to inaccuracies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!