In order to solve the problem of weak single domain generalization ability in existing crowd counting methods, this study proposes a new crowd counting framework called Multi-scale Attention and Hierarchy level Enhancement (MAHE). Firstly, the model can focus on both the detailed features and the macro information of structural position changes through the fusion of channel attention and spatial attention. Secondly, the addition of multi-head attention feature module facilitates the model's capacity to effectively capture complex dependency relationships between sequence elements. In addition, the three-stage encoding and decoding processing mode enables the model to effectively represent crowd density information. Finally, the fusion of multi-scale features derived from different receptive fields is further enhanced through multi-scale hierarchy level feature fusion, thereby enabling the model to learn high-level semantic information and low-level multi-scale visual field feature information. This method enhances the model's capacity to capture key feature information, even in highly differentiated datasets, thereby improving the model's generalization ability on a single domain. The model has demonstrated strong generalization capabilities through extensive experiments on different datasets. This study not only improves the accuracy of crowd counting, but also introduces a new research approach for single domain generalization of crowd counting.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1038/s41598-024-83725-5 | DOI Listing |
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11696471 | PMC |
Sci Rep
January 2025
School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha, 410114, China.
In order to solve the problem of weak single domain generalization ability in existing crowd counting methods, this study proposes a new crowd counting framework called Multi-scale Attention and Hierarchy level Enhancement (MAHE). Firstly, the model can focus on both the detailed features and the macro information of structural position changes through the fusion of channel attention and spatial attention. Secondly, the addition of multi-head attention feature module facilitates the model's capacity to effectively capture complex dependency relationships between sequence elements.
View Article and Find Full Text PDFNeural Netw
December 2024
College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, China. Electronic address:
Currently, for obtaining more accurate counts, existing methods primarily utilize RGB images combined with features of complementary modality (X-modality) for counting. However, designing a model that can adapt to various sensors is still an unsolved issue due to the differences in features between different modalities. Therefore, this paper proposes a unified fusion framework called CMFX for RGB-X crowd counting.
View Article and Find Full Text PDFCPT Pharmacometrics Syst Pharmacol
December 2024
Clinical Pharmacology, AbbVie Inc., Ludwigshafen am Rhein, Germany.
Myelodysplastic syndromes (MDS) represent a group of bone marrow disorders involving cytopenias, hypercellular bone marrow, and dysplastic hematopoietic progenitors. MDS remains a challenge to treat due to the complex interplay between disease-induced and treatment-related cytopenias. Venetoclax, a selective BCL-2 inhibitor, in combination with azacitidine, a hypomethylating agent, is currently being investigated in patients with previously untreated higher-risk MDS.
View Article and Find Full Text PDFFront Psychol
November 2024
Department of Psychology, University of Bamberg, Bamberg, Germany.
In city centers worldwide, including the UNESCO World Heritage Site of Bamberg's old town in Germany, alleviating pedestrian overcrowding is a pressing concern. Leveraging crowd-counting technologies with real-time data collection offers promising solutions, yet poses challenges regarding data privacy and informed consent. This preregistered study examines public response to a Smart City Bamberg project aimed at addressing pedestrian congestion through crowd-counting methods.
View Article and Find Full Text PDFSci Rep
November 2024
College of Computer and Information Engineering, Henan Normal University, Xinxiang, 453007, China.
The automatic detection and counting of wheat spike images are of great significance for yield prediction and variety evaluation. Therefore, accurate and timely estimation of spike numbers is crucial for wheat production. However, in actual production, due to the susceptibility of wheat spike images to factors such as lighting conditions, shooting angles, occlusion, and overlap, the contour and features of wheat spike is unclear, which affects the accuracy of automatic detection and counting of wheat spike.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!