Binary Transformer Based on the Alignment and Correction of Distribution.

Sensors (Basel)

Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China.

Published: December 2024

Transformer is a powerful model widely used in artificial intelligence applications. It contains complex structures and has extremely high computational requirements that are not suitable for embedded intelligent sensors with limited computational resources. The binary quantization technology takes up less memory space and has a faster calculation speed; however, it is seldom studied for the lightweight transformer. Compared with full-precision networks, the key bottleneck lies in the distribution shift problem caused by the existing binary quantization methods. To tackle this problem, the feature distribution alignment operation in binarization is investigated. The median shift and mean restore is designed to ensure consistency between the binary feature distribution and the full-precision transformer. Then, a knowledge distillation architecture for distribution correction is developed, which has a teacher-student structure comprising a full-precision and binary transformer, to further rectify the feature distribution of the binary student network to ensure the completeness and accuracy of the data. Experimental results on the CIFAR10, CIFAR100, ImageNet-1k, and TinyImageNet datasets show the effectiveness of the proposed binary optimization model, which outperforms the previous state-of-the-art binarization mechanisms while maintaining the same computational complexity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11680024PMC
http://dx.doi.org/10.3390/s24248190DOI Listing

Publication Analysis

Top Keywords

feature distribution
12
binary transformer
8
binary quantization
8
binary
7
distribution
6
transformer based
4
based alignment
4
alignment correction
4
correction distribution
4
transformer
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!