During the past decade, both multi-label learning and zero-shot learning have attracted huge research attention, and significant progress has been made. Multi-label learning algorithms aim to predict multiple labels given one instance, while most existing zero-shot learning approaches target at predicting a single testing label for each unseen class via transferring knowledge from auxiliary seen classes to target unseen classes. However, relatively less effort has been made on predicting multiple labels in the zero-shot setting, which is nevertheless a quite challenging task. In this work, we investigate and formalize a flexible framework consisting of two components, i.e., visual-semantic embedding and zero-shot multi-label prediction. First, we present a deep regression model to project the visual features into the semantic space, which explicitly exploits the correlations in the intermediate semantic layer of word vectors and makes label prediction possible. Then, we formulate the label prediction problem as a pairwise one and employ Ranking SVM to seek the unique multi-label correlations in the embedding space. Furthermore, we provide a transductive multi-label zeroshot prediction approach that exploits the testing data manifold structure. We demonstrate the effectiveness of the proposed approach on three popular multi-label datasets with state-of-theart performance obtained on both conventional and generalized ZSL settings.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2020.2991527DOI Listing

Publication Analysis

Top Keywords

zero-shot multi-label
8
multi-label learning
8
zero-shot learning
8
multiple labels
8
label prediction
8
multi-label
7
zero-shot
5
deep ranking
4
ranking image
4
image zero-shot
4

Similar Publications

A zero-shot attribute-embedded model with a feature difference mapping sigmoid function for compound fault diagnosis of rotating machinery.

ISA Trans

December 2024

State Key Laboratory of Mechanical Transmission for Advanced Equipment, Chongqing University, Chongqing 400044, PR China. Electronic address:

Article Synopsis
  • Current methods for detecting machinery compound faults struggle due to the lack of available training data, as collecting sufficient compound fault samples is often impractical in engineering.
  • The paper introduces a zero-shot attribute-embedded model (ZSAECFD), which allows for diagnosing unseen compound faults using only single fault data by constructing attribute prototypes and utilizing a new activation function, F-sigmoid.
  • The model demonstrates high diagnostic accuracy—81.82% for bearing faults and 88.17% for gear faults—showing its effectiveness compared to traditional methods, even without training on compound fault data.
View Article and Find Full Text PDF

In fully supervised learning-based medical image classification, the robustness of a trained model is influenced by its exposure to the range of candidate disease classes. Generalized Zero Shot Learning (GZSL) aims to correctly predict seen and novel unseen classes. Current GZSL approaches have focused mostly on the single-label case.

View Article and Find Full Text PDF

Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge.

Med Image Anal

October 2024

Department of Population Health Sciences, Weill Cornell Medicine, 10065, New York, NY, USA. Electronic address:

Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" - there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification.

View Article and Find Full Text PDF

Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge.

ArXiv

April 2024

Department of Population Health Sciences, Weill Cornell Medicine, 10065, New York, NY USA.

Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" - there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a and problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification.

View Article and Find Full Text PDF

Multi-label Zero-shot Learning (ZSL) is more reasonable and realistic than standard single-label ZSL because several objects can co-exist in a natural image in real scenarios. Intra-class feature entanglement is a significant factor influencing the alignment of visual and semantic features, resulting in the model's inability to recognize unseen samples comprehensively and completely. We observe that existing multi-label ZSL methods place a greater emphasis on attention-based refinement and decoupling of visual features, while ignoring the relationship between label semantics.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!