Recent advancements in food image recognition have underscored its importance in dietary monitoring, which promotes a healthy lifestyle and aids in the prevention of diseases such as diabetes and obesity. While mainstream food recognition methods excel in scenarios with large-scale annotated datasets, they falter in few-shot regimes where data is limited. This paper addresses this challenge by introducing a variational generative method, the Multivariate Knowledge-guided Variational AutoEncoder (MK-VAE), for few-shot food recognition. MK-VAE leverages handcrafted features and semantic embeddings as multivariate prior knowledge to strengthen feature learning and feature generation in different phases. Specifically, we design a lightweight and flexible feature distillation module that distills handcrafted features to enhance the feature learning network for capturing the salient visual information in few-shot samples. During the feature generation phase, we utilize a variational autoencoder to learn the difference distribution of food data and explicitly boost the latent representation with category-level semantic embeddings to pull homogeneous features closer together while pushing inhomogeneous features apart. Experimental results demonstrate that our proposed MK-VAE significantly outperforms state-of-the-art few-shot food recognition methods in both 5-way 1-shot and 5-way 5-shot settings on three widely-used benchmark datasets: Food-101, VIREO Food-172, and UECFood-256.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/JBHI.2025.3550347 | DOI Listing |
IEEE J Biomed Health Inform
March 2025
Recent advancements in food image recognition have underscored its importance in dietary monitoring, which promotes a healthy lifestyle and aids in the prevention of diseases such as diabetes and obesity. While mainstream food recognition methods excel in scenarios with large-scale annotated datasets, they falter in few-shot regimes where data is limited. This paper addresses this challenge by introducing a variational generative method, the Multivariate Knowledge-guided Variational AutoEncoder (MK-VAE), for few-shot food recognition.
View Article and Find Full Text PDFPlant Phenomics
November 2024
Nanjing Forestry University, Nanjing 210037, China.
Accurate counting of cereals crops, e.g., maize, rice, sorghum, and wheat, is crucial for estimating grain production and ensuring food security.
View Article and Find Full Text PDFSensors (Basel)
September 2024
School of Information Engineering, China University of Geosciences, Beijing 100083, China.
Accurate crop disease classification is crucial for ensuring food security and enhancing agricultural productivity. However, the existing crop disease classification algorithms primarily focus on a single image modality and typically require a large number of samples. Our research counters these issues by using pre-trained Vision-Language Models (VLMs), which enhance the multimodal synergy for better crop disease classification than the traditional unimodal approaches.
View Article and Find Full Text PDFJ Biopharm Stat
September 2024
Data & Statistical Sciences, AbbVie Inc, North Chicago, Illinois, USA.
Biosens Bioelectron
January 2025
Beijing Laboratory of Food Quality and Safety, College of Engineering, China Agricultural University, Beijing, 100083, PR China; Sanya Institute, China Agricultural University, Sanya, 572024, PR China. Electronic address:
Human sensory techniques are inadequate for automating fish quality monitoring and maintaining controlled storage conditions throughout the supply chain. The dynamic monitoring of a single quality index cannot anticipate explicit freshness losses, which remarkably drops consumer acceptability. For the first time, a complete artificial sensory system is designed for the early detection of fish quality prediction.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!