Supervised learning-based image classification in computer vision relies on visual samples containing a large amount of labeled information. Considering that it is labor-intensive to collect and label images and construct datasets manually, Zero-Shot Learning (ZSL) achieves knowledge transfer from seen categories to unseen categories by mining auxiliary information, which reduces the dependence on labeled image samples and is one of the current research hotspots in computer vision. However, most ZSL methods fail to properly measure the relationships between classes, or do not consider the differences and similarities between classes at all. In this paper, we propose Adaptive Relation-Aware Network (ARAN), a novel ZSL approach that incorporates the improved triplet loss from deep metric learning into a VAE-based generative model, which helps to model inter-class and intra-class relationships for different classes in ZSL datasets and generate an arbitrary amount of high-quality visual features containing more discriminative information. Moreover, we validate the effectiveness and superior performance of our ARAN through experimental evaluations under ZSL and more practical GZSL settings on three popular datasets AWA2, CUB, and SUN.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2024.106227DOI Listing

Publication Analysis

Top Keywords

adaptive relation-aware
8
relation-aware network
8
computer vision
8
relationships classes
8
zsl
5
network zero-shot
4
zero-shot classification
4
classification supervised
4
supervised learning-based
4
learning-based image
4

Similar Publications

Article Synopsis
  • Unsupervised domain adaptation (UDA) in remote sensing image classification allows for the use of well-labeled data from one scene to classify another scene without supervision, but it faces challenges in multisource data scenarios due to data heterogeneity and incomplete feature representation.* -
  • The GeIraA-Net model is proposed to overcome these challenges by enhancing knowledge transfer at the class level through a graph embedding approach that captures both local and global features of multisource data.* -
  • To improve alignment and classification accuracy, GeIraA-Net employs a joint de-scrambling strategy and an adaptive inter-class topology classifier, leading to significant improvements in classification performance compared to traditional methods.*
View Article and Find Full Text PDF

Traditional methods for pest recognition have certain limitations in addressing the challenges posed by diverse pest species, varying sizes, diverse morphologies, and complex field backgrounds, resulting in a lower recognition accuracy. To overcome these limitations, this paper proposes a novel pest recognition method based on attention mechanism and multi-scale feature fusion (AM-MSFF). By combining the advantages of attention mechanism and multi-scale feature fusion, this method significantly improves the accuracy of pest recognition.

View Article and Find Full Text PDF

Supervised learning-based image classification in computer vision relies on visual samples containing a large amount of labeled information. Considering that it is labor-intensive to collect and label images and construct datasets manually, Zero-Shot Learning (ZSL) achieves knowledge transfer from seen categories to unseen categories by mining auxiliary information, which reduces the dependence on labeled image samples and is one of the current research hotspots in computer vision. However, most ZSL methods fail to properly measure the relationships between classes, or do not consider the differences and similarities between classes at all.

View Article and Find Full Text PDF

LigBind: Identifying Binding Residues for Over 1000 Ligands with Relation-Aware Graph Neural Networks.

J Mol Biol

July 2023

Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China. Electronic address:

Article Synopsis
  • Identifying interactions between proteins and ligands is crucial for drug discovery, but many current methods overlook shared preferences among ligands and are limited in scope.
  • The study introduces LigBind, a novel framework that uses graph neural networks to improve predictions of binding residues for 1,159 ligands, including those with limited known binding data.
  • LigBind shows strong performance on large datasets and can accurately predict binding residues in key SARS-CoV-2 proteins, with resources available online for researchers.
View Article and Find Full Text PDF

An intelligent robot requires episodic memory that can retrieve a sequence of events for a service task learned from past experiences to provide a proper service to a user. Various episodic memories, which can learn new tasks incrementally without forgetting the tasks learned previously, have been designed based on adaptive resonance theory (ART) networks. The conventional ART-based episodic memories, however, do not have the adaptability to the changing environments.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!