An FPGA-Based YOLOv5 Accelerator for Real-Time Industrial Vision Applications.

Micromachines (Basel)

Institute of Information Science, Beijing Jiaotong University, Beijing 100044, China.

Published: September 2024

The You Only Look Once (YOLO) object detection network has garnered widespread adoption in various industries, owing to its superior inference speed and robust detection capabilities. This model has proven invaluable in automating production processes such as material processing, machining, and quality inspection. However, as market competition intensifies, there is a constant demand for higher detection speed and accuracy. Current FPGA accelerators based on 8-bit quantization have struggled to meet these increasingly stringent performance requirements. In response, we present a novel 4-bit quantization-based neural network accelerator for the YOLOv5 model, designed to enhance real-time processing capabilities while maintaining high detection accuracy. To achieve effective model compression, we introduce an optimized quantization scheme that reduces the bit-width of the entire YOLO network-including the first layer-to 4 bits, with only a 1.5% degradation in mean Average Precision (mAP). For the hardware implementation, we propose a unified Digital Signal Processor (DSP) packing scheme, coupled with a novel parity adder tree architecture that accommodates the proposed quantization strategies. This approach efficiently reduces on-chip DSP utilization by 50%, offering a significant improvement in performance and resource efficiency. Experimental results show that the industrial object detection system based on the proposed FPGA accelerator achieves a throughput of 808.6 GOPS and an efficiency of 0.49 GOPS/DSP for YOLOv5s on the ZCU102 board, which is 29% higher than a commercial FPGA accelerator design (Xilinx's Vitis AI).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11434529PMC
http://dx.doi.org/10.3390/mi15091164DOI Listing

Publication Analysis

Top Keywords

object detection
8
fpga accelerator
8
detection
5
fpga-based yolov5
4
accelerator
4
yolov5 accelerator
4
accelerator real-time
4
real-time industrial
4
industrial vision
4
vision applications
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!