Pixel Self-Attention Guided Real-Time Instance Segmentation for Group Raised Pigs.

Zongwei Jia Zhichuan Wang Chenyu Zhao Ningning Zhang Xinyue Wen Zhiwei Hu

Animals (Basel)

College of Information Science and Engineering, Shanxi Agricultural University, Jinzhong 030801, China.

Published: November 2023

Instance segmentation is essential for managing pig farms, but challenges like occlusion and varying postures complicate accurate identification of multiple pigs.
Researchers conducted experiments with video data from 45 pigs, labeling 1917 images to improve segmentation using a new pixel self-attention (PSA) module integrated into the YOLACT model.
Results showed that the PSA module outperformed existing attention mechanisms, achieving significant improvements in performance metrics, especially with the ResNet101 backbone, and confirming its robustness through visualization and transfer performance tests.

Instance segmentation is crucial to modern agriculture and the management of pig farms. In practical farming environments, challenges arise due to the mutual adhesion, occlusion, and dynamic changes in body posture among pigs, making accurate segmentation of multiple target pigs complex. To address these challenges, we conducted experiments using video data captured from varying angles and non-fixed lenses. We selected 45 pigs aged between 20 and 105 days from eight pens as research subjects. Among these, 1917 images were meticulously labeled, with 959 images designated for the training set, 192 for validation, and 766 for testing. To enhance feature utilization and address limitations in the fusion process between bottom-up and top-down feature maps within the feature pyramid network (FPN) module of the YOLACT model, we propose a pixel self-attention (PSA) module, incorporating joint channel and spatial attention. The PSA module seamlessly integrates into multiple stages of the FPN feature extraction within the YOLACT model. We utilized ResNet50 and ResNet101 as backbone networks and compared performance metrics, including AP, AP, AP, and AR, between the YOLACT model with the PSA module and YOLACT models equipped with BAM, CBAM, and SCSE attention modules. Experimental results indicated that the PSA attention module outperforms BAM, CBAM, and SCSE, regardless of the selected backbone network. In particular, when employing ResNet101 as the backbone network, integrating the PSA module yields a 2.7% improvement over no attention, 2.3% over BAM, 2.4% over CBAM, and 2.1% over SCSE across the AP metric. We visualized prototype masks within YOLACT to elucidate the model's mechanism. Furthermore, we visualized the PSA attention to confirm its ability to capture valuable pig-related information. Additionally, we validated the transfer performance of our model on a top-down view dataset, affirming the robustness of the YOLACT model with the PSA module.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10705772	PMC
http://dx.doi.org/10.3390/ani13233591	DOI Listing

Publication Analysis

Top Keywords

psa module

yolact model

pixel self-attention

instance segmentation

module yolact

resnet101 backbone

model psa

bam cbam

cbam scse

psa attention

Similar Publications

Multi-Scale Pyramid Squeeze Attention Similarity Optimization Classification Neural Network for ERP Detection.

Neural Netw

January 2025

Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China; Center of Intelligent Computing, School of Mathematics, East China University of Science and Technology, Shanghai 200237, China. Electronic address:

Ruitian Xu Brendan Z Allison Xueqing Zhao Wei Liang Xingyu Wang

Event-related potentials (ERPs) can reveal brain activity elicited by external stimuli. Innovative methods to decode ERPs could enhance the accuracy of brain-computer interface (BCI) technology and promote the understanding of cognitive processes. This paper proposes a novel Multi-Scale Pyramid Squeeze Attention Similarity Optimization Classification Neural Network (MS-PSA-SOC) for ERP Detection.

View Article and Find Full Text PDF

Similar Publications

An Automatic Deep-Radiomics Framework for Prostate Cancer Diagnosis and Stratification in Patients with Serum Prostate-Specific Antigen of 4.0-10.0 ng/mL: A Multicenter Retrospective Study.

Acad Radiol

January 2025

Department of Urology, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong 510515, China (B.Z., F.M., X.S., S.L., Q.W.); Department of Urology, Guangdong Provincial People's Hospital, Southern Medical University, Guangzhou, Guangdong 510080, China (Q.W.). Electronic address:

Bowen Zheng Futian Mo Xiaoran Shi Wenhao Li Quanyou Shen

Rationale And Objectives: To develop an automatic deep-radiomics framework that diagnoses and stratifies prostate cancer in patients with prostate-specific antigen (PSA) levels between 4 and 10 ng/mL.

Materials And Methods: A total of 1124 patients with histological results and PSA levels between 4 and 10 ng/mL were enrolled from one public dataset and two local institutions. An nnUNet was trained for prostate masks, and a feature extraction module identified suspicious lesion masks.

View Article and Find Full Text PDF

Similar Publications

Attention-Based Lightweight YOLOv8 Underwater Target Recognition Algorithm.

Sensors (Basel)

November 2024

Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Relative Pose Precision Measurement Laboratory, Jilin 130033, China.

Shun Cheng Zhiqian Wang Shaojin Liu Yan Han Pengtao Sun

Underwater object detection is highly complex and requires a high speed and accuracy. In this paper, an underwater target detection model based on YOLOv8 (SPSM-YOLOv8) is proposed. It solves the problems of high computational complexities, slow detection speeds and low accuracies.

View Article and Find Full Text PDF

Similar Publications

PSA-HWT: handwritten font generation based on pyramid squeeze attention.

PeerJ Comput Sci

August 2024

School of Computer and Communication, Lanzhou University of Technology, Lanzhou, Gansu, China.

Hong Zhao Jinhai Huang Wengai Li Zhaobin Chang Weijie Wang

The generator, which combines convolutional neural network (CNN) and Transformer as its core modules, serves as the primary model for the handwriting font generation network and demonstrates effective performance. However, there are still problems with insufficient feature extraction in the overall structure of the font, the thickness of strokes, and the curvature of strokes, resulting in subpar detail in the generated fonts. To solve the problems, we propose a method for constructing a handwritten font generation model based on Pyramid Squeeze Attention, called PSA-HWT.

View Article and Find Full Text PDF

Similar Publications

Future Health Today: A pragmatic cluster randomised trial of quality improvement activities in general practice for patients at risk of undiagnosed cancer.

Br J Gen Pract

November 2024

The University of Melbourne Centre for Cancer Research, Department of General Practice and Primary Care, Melbourne, Australia.

Sophie Chima Javiera Martinez Gutierrez Barbara Hunter Adrian Laughlin Patty Chondros

Unlabelled: Diagnosing cancer in general practice is complex, given the non-specific nature of many presenting symptoms and the overlap of potential diagnoses. This trial evaluated the effectiveness of a technology, Future Health Today (FHT), which provides clinical decision support, auditing, and quality improvement monitoring, on the appropriate follow-up of patients at risk of undiagnosed cancer.

Methods: Pragmatic, cluster randomised trial in Australian general practice.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!