In this paper, we propose a novel two-stage transformer with GhostNet, which improves the performance of the small object detection task. Specifically, based on the original Deformable Transformers for End-to-End Object Detection (deformable DETR), we chose GhostNet as the backbone to extract features, since it is better suited for an efficient feature extraction. Furthermore, at the target detection stage, we selected the 300 best bounding box results as which were subsequently set as primary object queries of the decoder layer. Finally, in the decoder layer, we optimized and modified the queries to increase the target accuracy. In order to validate the performance of the proposed model, we adopted a widely used COCO 2017 dataset. Extensive experiments demonstrated that the proposed scheme yielded a higher average precision (AP) score in detecting small objects than the existing deformable DETR model.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503248PMC
http://dx.doi.org/10.3390/s22186939DOI Listing

Publication Analysis

Top Keywords

object detection
12
two-stage transformer
8
small object
8
deformable detr
8
decoder layer
8
ghostformer ghostnet-based
4
ghostnet-based two-stage
4
transformer small
4
object
4
detection
4

Similar Publications

MEVDT: Multi-modal event-based vehicle detection and tracking dataset.

Data Brief

February 2025

Department of Electrical and Computer Engineering, University of Michigan-Dearborn, 4901 Evergreen Rd, Dearborn, 48128 MI, USA.

In this data article, we introduce the Multi-Modal Event-based Vehicle Detection and Tracking (MEVDT) dataset. This dataset provides a synchronized stream of event data and grayscale images of traffic scenes, captured using the Dynamic and Active-Pixel Vision Sensor (DAVIS) 240c hybrid event-based camera. MEVDT comprises 63 multi-modal sequences with approximately 13k images, 5M events, 10k object labels, and 85 unique object tracking trajectories.

View Article and Find Full Text PDF

Robust kernel extreme learning machines for postgraduate learning performance prediction.

Heliyon

January 2025

College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.

In the context of graduate learning in China, mentors are the teachers with the highest frequency of contact and the closest relationships with postgraduate students. Nevertheless, a number of issues pertaining to the relationship between mentors and postgraduate students have emerged with increasing frequency in recent years, resulting in a notable decline in the quality of graduate education. In this paper, we investigate the influence of the relationship between mentors and postgraduate students on the postgraduate learning performance, with postgraduate students' admission motivation and learning pressure acting as moderating variables.

View Article and Find Full Text PDF

Human microbiota-associated murine models, using fecal microbiota transplantation (FMT) from human donors, help explore the microbiome's role in diseases like Alzheimer's disease (AD). This study examines how gut bacteria from donors with protective factors against AD influence behavior and brain pathology in an AD mouse model. Female 3xTgAD mice received weekly FMT for 2 months from (i) an 80-year-old AD patient (AD-FMT), (ii) a cognitively healthy 73-year-old with the protective APOEe2 allele (APOEe2-FMT), (iii) a 22-year-old healthy donor (Young-FMT), and (iv) untreated mice (Mice-FMT).

View Article and Find Full Text PDF

In clearance measurements involving a single material type, a conversion factor was applied to convert measurement results to activity based on an assumed uniform density. However, this factor has been found to underestimate activity in material mixtures. In this study, we proposed a method to identify the location with the lowest detection sensitivity (minimum location) in a mixture and evaluated its applicability to the conversion factor.

View Article and Find Full Text PDF

Image-Based Shrimp Aquaculture Monitoring.

Sensors (Basel)

January 2025

Instituto de Telecomunicações (IT), Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisbon, Portugal.

Shrimp farming is a growing industry, and automating certain processes within aquaculture tanks is becoming increasingly important to improve efficiency. This paper proposes an image-based system designed to address four key tasks in an aquaculture tank with : estimating shrimp length and weight, counting shrimps, and evaluating feed pellet food attractiveness. A setup was designed, including a camera connected to a Raspberry Pi computer, to capture high-quality images around a feeding plate during feeding moments.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!