Taming Overconfident Prediction on Unlabeled Data From Hindsight.

IEEE Trans Neural Netw Learn Syst

Published: October 2024

Minimizing prediction uncertainty on unlabeled data is a key factor to achieve good performance in semi-supervised learning (SSL). The prediction uncertainty is typically expressed as the entropy computed by the transformed probabilities in output space. Most existing works distill low-entropy prediction by either accepting the determining class (with the largest probability) as the true label or suppressing subtle predictions (with the smaller probabilities). Unarguably, these distillation strategies are usually heuristic and less informative for model training. From this discernment, this article proposes a dual mechanism, named adaptive sharpening (ADS), which first applies a soft-threshold to adaptively mask out determinate and negligible predictions, and then seamlessly sharpens the informed predictions, distilling certain predictions with the informed ones only. More importantly, we theoretically analyze the traits of ADS by comparing it with various distillation strategies. Numerous experiments verify that ADS significantly improves state-of-the-art SSL methods by making it a plug-in. Our proposed ADS forges a cornerstone for future distillation-based SSL research.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TNNLS.2023.3274845	DOI Listing

Publication Analysis

Top Keywords

unlabeled data

prediction uncertainty

distillation strategies

taming overconfident

prediction

overconfident prediction

prediction unlabeled

data hindsight

hindsight minimizing

minimizing prediction

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!