Instance segmentation is crucial to modern agriculture and the management of pig farms. In practical farming environments, challenges arise due to the mutual adhesion, occlusion, and dynamic changes in body posture among pigs, making accurate segmentation of multiple target pigs complex. To address these challenges, we conducted experiments using video data captured from varying angles and non-fixed lenses. We selected 45 pigs aged between 20 and 105 days from eight pens as research subjects. Among these, 1917 images were meticulously labeled, with 959 images designated for the training set, 192 for validation, and 766 for testing. To enhance feature utilization and address limitations in the fusion process between bottom-up and top-down feature maps within the feature pyramid network (FPN) module of the YOLACT model, we propose a pixel self-attention (PSA) module, incorporating joint channel and spatial attention. The PSA module seamlessly integrates into multiple stages of the FPN feature extraction within the YOLACT model. We utilized ResNet50 and ResNet101 as backbone networks and compared performance metrics, including AP, AP, AP, and AR, between the YOLACT model with the PSA module and YOLACT models equipped with BAM, CBAM, and SCSE attention modules. Experimental results indicated that the PSA attention module outperforms BAM, CBAM, and SCSE, regardless of the selected backbone network. In particular, when employing ResNet101 as the backbone network, integrating the PSA module yields a 2.7% improvement over no attention, 2.3% over BAM, 2.4% over CBAM, and 2.1% over SCSE across the AP metric. We visualized prototype masks within YOLACT to elucidate the model's mechanism. Furthermore, we visualized the PSA attention to confirm its ability to capture valuable pig-related information. Additionally, we validated the transfer performance of our model on a top-down view dataset, affirming the robustness of the YOLACT model with the PSA module.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10705772 | PMC |
http://dx.doi.org/10.3390/ani13233591 | DOI Listing |
Neural Netw
January 2025
Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China; Center of Intelligent Computing, School of Mathematics, East China University of Science and Technology, Shanghai 200237, China. Electronic address:
Event-related potentials (ERPs) can reveal brain activity elicited by external stimuli. Innovative methods to decode ERPs could enhance the accuracy of brain-computer interface (BCI) technology and promote the understanding of cognitive processes. This paper proposes a novel Multi-Scale Pyramid Squeeze Attention Similarity Optimization Classification Neural Network (MS-PSA-SOC) for ERP Detection.
View Article and Find Full Text PDFAcad Radiol
January 2025
Department of Urology, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong 510515, China (B.Z., F.M., X.S., S.L., Q.W.); Department of Urology, Guangdong Provincial People's Hospital, Southern Medical University, Guangzhou, Guangdong 510080, China (Q.W.). Electronic address:
Rationale And Objectives: To develop an automatic deep-radiomics framework that diagnoses and stratifies prostate cancer in patients with prostate-specific antigen (PSA) levels between 4 and 10 ng/mL.
Materials And Methods: A total of 1124 patients with histological results and PSA levels between 4 and 10 ng/mL were enrolled from one public dataset and two local institutions. An nnUNet was trained for prostate masks, and a feature extraction module identified suspicious lesion masks.
Sensors (Basel)
November 2024
Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Relative Pose Precision Measurement Laboratory, Jilin 130033, China.
Underwater object detection is highly complex and requires a high speed and accuracy. In this paper, an underwater target detection model based on YOLOv8 (SPSM-YOLOv8) is proposed. It solves the problems of high computational complexities, slow detection speeds and low accuracies.
View Article and Find Full Text PDFPeerJ Comput Sci
August 2024
School of Computer and Communication, Lanzhou University of Technology, Lanzhou, Gansu, China.
The generator, which combines convolutional neural network (CNN) and Transformer as its core modules, serves as the primary model for the handwriting font generation network and demonstrates effective performance. However, there are still problems with insufficient feature extraction in the overall structure of the font, the thickness of strokes, and the curvature of strokes, resulting in subpar detail in the generated fonts. To solve the problems, we propose a method for constructing a handwritten font generation model based on Pyramid Squeeze Attention, called PSA-HWT.
View Article and Find Full Text PDFBr J Gen Pract
November 2024
The University of Melbourne Centre for Cancer Research, Department of General Practice and Primary Care, Melbourne, Australia.
Unlabelled: Diagnosing cancer in general practice is complex, given the non-specific nature of many presenting symptoms and the overlap of potential diagnoses. This trial evaluated the effectiveness of a technology, Future Health Today (FHT), which provides clinical decision support, auditing, and quality improvement monitoring, on the appropriate follow-up of patients at risk of undiagnosed cancer.
Methods: Pragmatic, cluster randomised trial in Australian general practice.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!