Visual attention advances object detection by attending neural networks to object representations. While existing methods incorporate empirical modules to empower network attention, we rethink attentive object detection from the network learning perspective in this work. We propose a NEural Attention Learning approach (NEAL) which consists of two parts. During the back-propagation of each training iteration, we first calculate the partial derivatives (a.k.a. the accumulated gradients) of the classification output with respect to the input features. We refine these partial derivatives to obtain attention response maps whose elements reflect the contributions to the final network predictions. Then, we formulate the attention response maps as extra objective functions, which are combined together with the original detection loss to train detectors in an end-to-end manner. In this way, we succeed in learning an attentive CNN model without introducing additional network structures. We apply NEAL to the two-stage object detection frameworks, which are usually composed of a CNN feature backbone, a region proposal network (RPN), and a classifier. We show that the proposed NEAL not only helps the RPN attend to objects but also enables the classifier to pay more attention to the premier positive samples. To this end, the localization (proposal generation) and classification mutually benefit from each other in our proposed method. Extensive experiments on large-scale benchmark datasets, including MS COCO 2017 and Pascal VOC 2012, demonstrate that the proposed NEAL algorithm advances the two-stage object detector over state-of-the-art approaches.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TIP.2023.3251693 | DOI Listing |
Sci Rep
December 2024
College of Electrical Engineering, Anhui Polytechnic University, Wuhu, 241000, Anhui, China.
The quantity of cable conductors is a crucial parameter in cable manufacturing, and accurately detecting the number of conductors can effectively promote the digital transformation of the cable manufacturing industry. Challenges such as high density, adhesion, and knife mark interference in cable conductor images make intelligent detection of conductor quantity particularly difficult. To address these challenges, this study proposes the YOLO-cable model, which is an improvement made upon the YOLOv10 model.
View Article and Find Full Text PDFSci Rep
December 2024
School of Construction Machinery, Shandong Jiaotong University, Jinan, 250023, China.
Injection molded parts are increasingly utilized across various industries due to their cost-effectiveness, lightweight nature, and durability. However, traditional defect detection methods for these parts often rely on manual visual inspection, which is inefficient, expensive, and prone to errors. To enhance the accuracy of defect detection in injection molded parts, a new method called MRB-YOLO, based on the YOLOv8 model, has been proposed.
View Article and Find Full Text PDFSci Rep
December 2024
Physical Research Laboratory, Ahmedabad, Gujarat, 380009, India.
Talbot length, the distance between two consecutive self-image planes along the propagation axis for a periodic diffraction object (grating) illuminated by a plane wave, depends on the period of the object and the wavelength of illumination. This property makes the Talbot effect a straightforward technique for measuring the period of a periodic object (grating) by accurately determining the Talbot length for a given illumination wavelength. However, since the Talbot length scale is proportional to the square of the grating period, traditional Talbot techniques face challenges when dealing with smaller grating periods and minor changes in the grating period.
View Article and Find Full Text PDFJ Xray Sci Technol
December 2024
School of Electrical and Information Engineering, Tianjin University, Nankai District, Tianjin, China.
Background: Airport security is still a main concern for assuring passenger safety and stopping illegal activity. Dual-energy X-ray Imaging (DEXI) is one of the most important technologies for detecting hidden items in passenger luggage. However, noise in DEXI images, arising from various sources such as electronic interference and fluctuations in X-ray intensity, can compromise the effectiveness of object identification.
View Article and Find Full Text PDFNanomaterials (Basel)
December 2024
Department of Biomedical Engineering, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong 999077, China.
Multiplexed biosensing methods for simultaneously detecting multiple biomolecules are important for investigating biological mechanisms associated with physiological processes, developing applications in life sciences, and conducting medical tests. The development of biosensors, especially those advanced biosensors with multiplexing potentials, strongly depends on advancements in nanotechnologies, including the nano-coating of thin films, micro-nano 3D structures, and nanotags for signal generation. Surface functionalization is a critical process for biosensing applications, one which enables the immobilization of biological probes or other structures that assist in the capturing of biomolecules.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!