Deep learning-based models have been shown to outperform human beings in many computer vision tasks with massive available labeled training data in learning. However, humans have an amazing ability to easily recognize images of novel categories by browsing only a few examples of these categories. In this case, few-shot learning comes into being to make machines learn from extremely limited labeled examples. One possible reason why human beings can well learn novel concepts quickly and efficiently is that they have sufficient visual and semantic prior knowledge. Toward this end, this work proposes a novel knowledge-guided semantic transfer network (KSTNet) for few-shot image recognition from a supplementary perspective by introducing auxiliary prior knowledge. The proposed network jointly incorporates vision inferring, knowledge transferring, and classifier learning into one unified framework for optimal compatibility. A category-guided visual learning module is developed in which a visual classifier is learned based on the feature extractor along with the cosine similarity and contrastive loss optimization. To fully explore prior knowledge of category correlations, a knowledge transfer network is then developed to propagate knowledge information among all categories to learn the semantic-visual mapping, thus inferring a knowledge-based classifier for novel categories from base categories. Finally, we design an adaptive fusion scheme to infer the desired classifiers by effectively integrating the above knowledge and visual information. Extensive experiments are conducted on two widely used Mini-ImageNet and Tiered-ImageNet benchmarks to validate the effectiveness of KSTNet. Compared with the state of the art, the results show that the proposed method achieves favorable performance with minimal bells and whistles, especially in the case of one-shot learning.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2023.3240195DOI Listing

Publication Analysis

Top Keywords

transfer network
12
prior knowledge
12
knowledge-guided semantic
8
semantic transfer
8
few-shot image
8
image recognition
8
human beings
8
novel categories
8
knowledge
7
learning
5

Similar Publications

Background: 'Intellectual assets' generated in traditional university settings, that may not fit the interests of the standard 'valuation criteria' (i.e. commercially profitable), such as non-pharmacological dementia care research, often remain siloed within their respective research disciplines and originating institutions.

View Article and Find Full Text PDF

Photothermal therapy (PTT) demonstrates significant potential in cancer treatment, wound healing, and antibacterial therapy, with its efficacy largely depending on the performance of photothermal agents (PTAs). Metal-phenolic network (MPN) materials are ideal PTA candidates due to their low cost, good biocompatibility and excellent ligand-to-metal charge transfer properties. However, not all MPNs exhibit significant photothermal properties, and the vast chemical space of MPNs (over 700,000 potential combinations) complicates the screening of high-photothermal materials.

View Article and Find Full Text PDF

Objective: Extracorporeal membrane oxygenation (ECMO) is a high-risk procedure with significant morbidity and mortality and there is an uncertain volume-outcome relationship, especially regarding long-term functional outcomes. The aim of this study was to examine the association between ECMO centre volume and long-term death and disability outcomes.

Design Setting And Participants: This is a registry-embedded observational cohort study.

View Article and Find Full Text PDF

Driver fatigue recognition using limited amount of individual electroencephalogram.

Biomed Eng Lett

January 2025

Department of Biomedical Engineering, College of Health Science, Yonsei University, Wonju, Republic of Korea.

Unlabelled: This study aims to create a fatigue recognition system that utilizes electroencephalogram (EEG) signals to assess a driver's physiological and mental state, with the goal of minimizing the risk of road accidents by detecting driver fatigue regardless of physical cues or vehicle attributes. A fatigue state recognition system was developed using transfer learning applied to partial ensemble averaged EEG power spectral density (PSD). The study utilized layer-wise relevance propagation (LRP) analysis to identify critical cortical regions and frequency bands for effective fatigue discrimination.

View Article and Find Full Text PDF

Effective BCDNet-based breast cancer classification model using hybrid deep learning with VGG16-based optimal feature extraction.

BMC Med Imaging

January 2025

Department of Information Technology, Manipal Institute of Technology Bengaluru, Manipal Academy of Higher Education, Manipal, Karnataka, 576104, India.

Problem: Breast cancer is a leading cause of death among women, and early detection is crucial for improving survival rates. The manual breast cancer diagnosis utilizes more time and is subjective. Also, the previous CAD models mostly depend on manmade visual details that are complex to generalize across ultrasound images utilizing distinct techniques.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!