Deep neural networks have achieved state-of-the-art performance in image classification. Due to this success, deep learning is now also being applied to other data modalities such as multispectral images, lidar and radar data. However, successfully training a deep neural network requires a large reddataset. Therefore, transitioning to a new sensor modality (e.g., from regular camera images to multispectral camera images) might result in a drop in performance, due to the limited availability of data in the new modality. This might hinder the adoption rate and time to market for new sensor technologies. In this paper, we present an approach to leverage the knowledge of a teacher network, that was trained using the original data modality, to improve the performance of a student network on a new data modality: a technique known in literature as knowledge distillation. By applying knowledge distillation to the problem of sensor transition, we can greatly speed up this process. We validate this approach using a multimodal version of the MNIST dataset. Especially when little data is available in the new modality (i.e., 10 images), training with additional teacher supervision results in increased performance, with the student network scoring a test set accuracy of 0.77, compared to an accuracy of 0.37 for the baseline. We also explore two extensions to the default method of knowledge distillation, which we evaluate on a multimodal version of the CIFAR-10 dataset: an annealing scheme for the hyperparameter α and selective knowledge distillation. Of these two, the first yields the best results. Choosing the optimal annealing scheme results in an increase in test set accuracy of 6%. Finally, we apply our method to the real-world use case of skin lesion classification.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512581PMC
http://dx.doi.org/10.3390/s21196523DOI Listing

Publication Analysis

Top Keywords

knowledge distillation
20
data modality
16
deep neural
8
camera images
8
performance student
8
student network
8
multimodal version
8
test set
8
set accuracy
8
annealing scheme
8

Similar Publications

First Report of Causing Root Rot of Incense Cedar in Tennessee and the United States.

Plant Dis

January 2025

Tennessee State University, Otis Floyd Nursery Research Center, 472 Cadillac Lane, McMinnville, Tennessee, United States, 37110;

Incense cedar [ (Torr.) Florin] is a coniferous evergreen tree, indigenous to western North America, that is being evaluated in Tennessee for its adaptability to eastern U.S.

View Article and Find Full Text PDF

Blue honeysuckle (Lonicera caerulea L.) has been widely used in food, medicine, health products, cosmetics, materials, and other products. Between September 2022 and September 2023, a leaf spot disease was observed on approximately 20% of blue honeysuckle plants of the 'Lanjingling' cultivar grown in a 0.

View Article and Find Full Text PDF

Tumor detection on bronchoscopic images by unsupervised learning.

Sci Rep

January 2025

Department of Pulmonary and Critical Care Medicine, The Second Xiangya Hospital, Central South University, Changsha, 410011, Hunan, China.

The diagnosis and early identification of intratracheal tumors relies on the experience of the operators and the specialists. Operations by physicians with insufficient experience may lead to misdiagnosis or misjudgment of tumors. To address this issue, a datasets for intratracheal tumor detection has been constructed to simulate the diagnostic level of experienced specialists, and a Knowledge Distillation-based Memory Feature Unsupervised Anomaly Detection (KD-MFAD) model was proposed to learn from this simulated experience.

View Article and Find Full Text PDF

Diabetes prediction is an important topic in the field of medical health. Accurate prediction can help early intervention and reduce patients' health risks and medical costs. This paper proposes a data preprocessing method, including removing outliers, filling missing values, and using sparse autoencoder (SAE) feature enhancement.

View Article and Find Full Text PDF

Inspect quantitative signals in placental histopathology: Computer-assisted multiple functional tissues identification through multi-model fusion and distillation framework.

Comput Med Imaging Graph

December 2024

Shanghai Key Laboratory of Multidimensional Information Processing, School of Communication and Electronic Engineering, East China Normal University, Shanghai 200241, China. Electronic address:

Pathological analysis of placenta is currently a valuable tool for gaining insights into pregnancy outcomes. In placental histopathology, multiple functional tissues can be inspected as potential signals reflecting the transfer functionality between fetal and maternal circulations. However, the identification of multiple functional tissues is challenging due to (1) severe heterogeneity in texture, size and shape, (2) distribution across different scales and (3) the need for comprehensive assessment at the whole slide image (WSI) level.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!