Accurate fruit detection is crucial for automated fruit picking. However, real-world scenarios, influenced by complex environmental factors such as illumination variations, occlusion, and overlap, pose significant challenges to accurate fruit detection. These challenges subsequently impact the commercialization of fruit harvesting robots. A tomato detection model named YOLO-SwinTF, based on YOLOv7, is proposed to address these challenges. Integrating Swin Transformer (ST) blocks into the backbone network enables the model to capture global information by modeling long-range visual dependencies. Trident Pyramid Networks (TPN) are introduced to overcome the limitations of PANet's focus on communication-based processing. TPN incorporates multiple self-processing (SP) modules within existing top-down and bottom-up architectures, allowing feature maps to generate new findings for communication. In addition, Focaler-IoU is introduced to reconstruct the original intersection-over-union (IoU) loss to allow the loss function to adjust its focus based on the distribution of difficult and easy samples. The proposed model is evaluated on a tomato dataset, and the experimental results demonstrated that the proposed model's detection recall, precision, F score, and AP reach 96.27%, 96.17%, 96.22%, and 98.67%, respectively. These represent improvements of 1.64%, 0.92%, 1.28%, and 0.88% compared to the original YOLOv7 model. When compared to other state-of-the-art detection methods, this approach achieves superior performance in terms of accuracy while maintaining comparable detection speed. In addition, the proposed model exhibits strong robustness under various lighting and occlusion conditions, demonstrating its significant potential in tomato detection.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11464322PMC
http://dx.doi.org/10.3389/fpls.2024.1452821DOI Listing

Publication Analysis

Top Keywords

tomato detection
12
yolov7 model
8
swin transformer
8
trident pyramid
8
pyramid networks
8
detection
8
accurate fruit
8
fruit detection
8
proposed model
8
model
6

Similar Publications

What does cancer screening have to do with tomato growing?

Clin Chem Lab Med

January 2025

Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Sinai Health System, Toronto, Canada.

Cancer screening is considered to be a major strategy for combatting cancer. The United States Preventive Services Task Force (USPSTF) recommends screening for five cancers, but the strength of evidence about the effectiveness of screening is limited. To gain insights into the efficacy of early detection requires prospective, blinded, placebo-controlled clinical trials with decades of follow-up and inclusion of millions of participants.

View Article and Find Full Text PDF

Tomato ripeness and stem recognition based on improved YOLOX.

Sci Rep

January 2025

College of Information Science and Engineering, Shanxi Agricultural University, Jinzhong, 030800, China.

To address the challenges of unbalanced class labels with varying maturity levels of tomato fruits and low recognition accuracy for both fruits and stems in intelligent harvesting, we propose the YOLOX-SE-GIoU model for identifying tomato fruit maturity and stems. The SE focus module was incorporated into YOLOX to improve the identification accuracy, addressing the imbalance in the number of tomato fruits and stems. Additionally, we optimized the loss function to GIoU loss to minimize discrepancies across different scales of fruits and stems.

View Article and Find Full Text PDF

Root Development of Tomato Plants Infected by the Cacao Pathogen Moniliophthora perniciosa Is Affected by Limited Sugar Availability.

Plant Cell Environ

January 2025

Centro de Energia Nuclear na Agricultura, Universidade de São Paulo (USP), Piracicaba, São Paulo, Brazil.

Moniliophthora perniciosa is the causal agent of the witches' broom disease of cacao (Theobroma cacao), and it can infect the tomato (Solanum lycopersicum) 'Micro-Tom' (MT) cultivar. Typical symptoms of infection are stem swelling and axillary shoot outgrowth, whereas reduction in root biomass is another side effect. Using infected MT, we investigated whether impaired root growth derives from hormonal imbalance or sink competition.

View Article and Find Full Text PDF

Identifying weed species at early-growth stages is critical for precision agriculture. Accurate classification at the species-level enables targeted control measures, significantly reducing pesticide use. This paper presents a dataset of RGB images captured with a Sony ILCE-6300L camera mounted on an unmanned aerial vehicle (UAV) flying at an altitude of 11 m above ground level.

View Article and Find Full Text PDF

Attention Score-Based Multi-Vision Transformer Technique for Plant Disease Classification.

Sensors (Basel)

January 2025

Department of AI & Big Data, Honam University, Gwangju 62399, Republic of Korea.

This study proposes an advanced plant disease classification framework leveraging the Attention Score-Based Multi-Vision Transformer (Multi-ViT) model. The framework introduces a novel attention mechanism to dynamically prioritize relevant features from multiple leaf images, overcoming the limitations of single-leaf-based diagnoses. Building on the Vision Transformer (ViT) architecture, the Multi-ViT model aggregates diverse feature representations by combining outputs from multiple ViTs, each capturing unique visual patterns.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!