Accurate fruit detection is crucial for automated fruit picking. However, real-world scenarios, influenced by complex environmental factors such as illumination variations, occlusion, and overlap, pose significant challenges to accurate fruit detection. These challenges subsequently impact the commercialization of fruit harvesting robots. A tomato detection model named YOLO-SwinTF, based on YOLOv7, is proposed to address these challenges. Integrating Swin Transformer (ST) blocks into the backbone network enables the model to capture global information by modeling long-range visual dependencies. Trident Pyramid Networks (TPN) are introduced to overcome the limitations of PANet's focus on communication-based processing. TPN incorporates multiple self-processing (SP) modules within existing top-down and bottom-up architectures, allowing feature maps to generate new findings for communication. In addition, Focaler-IoU is introduced to reconstruct the original intersection-over-union (IoU) loss to allow the loss function to adjust its focus based on the distribution of difficult and easy samples. The proposed model is evaluated on a tomato dataset, and the experimental results demonstrated that the proposed model's detection recall, precision, F score, and AP reach 96.27%, 96.17%, 96.22%, and 98.67%, respectively. These represent improvements of 1.64%, 0.92%, 1.28%, and 0.88% compared to the original YOLOv7 model. When compared to other state-of-the-art detection methods, this approach achieves superior performance in terms of accuracy while maintaining comparable detection speed. In addition, the proposed model exhibits strong robustness under various lighting and occlusion conditions, demonstrating its significant potential in tomato detection.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11464322 | PMC |
http://dx.doi.org/10.3389/fpls.2024.1452821 | DOI Listing |
Clin Chem Lab Med
January 2025
Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Sinai Health System, Toronto, Canada.
Cancer screening is considered to be a major strategy for combatting cancer. The United States Preventive Services Task Force (USPSTF) recommends screening for five cancers, but the strength of evidence about the effectiveness of screening is limited. To gain insights into the efficacy of early detection requires prospective, blinded, placebo-controlled clinical trials with decades of follow-up and inclusion of millions of participants.
View Article and Find Full Text PDFSci Rep
January 2025
College of Information Science and Engineering, Shanxi Agricultural University, Jinzhong, 030800, China.
To address the challenges of unbalanced class labels with varying maturity levels of tomato fruits and low recognition accuracy for both fruits and stems in intelligent harvesting, we propose the YOLOX-SE-GIoU model for identifying tomato fruit maturity and stems. The SE focus module was incorporated into YOLOX to improve the identification accuracy, addressing the imbalance in the number of tomato fruits and stems. Additionally, we optimized the loss function to GIoU loss to minimize discrepancies across different scales of fruits and stems.
View Article and Find Full Text PDFPlant Cell Environ
January 2025
Centro de Energia Nuclear na Agricultura, Universidade de São Paulo (USP), Piracicaba, São Paulo, Brazil.
Moniliophthora perniciosa is the causal agent of the witches' broom disease of cacao (Theobroma cacao), and it can infect the tomato (Solanum lycopersicum) 'Micro-Tom' (MT) cultivar. Typical symptoms of infection are stem swelling and axillary shoot outgrowth, whereas reduction in root biomass is another side effect. Using infected MT, we investigated whether impaired root growth derives from hormonal imbalance or sink competition.
View Article and Find Full Text PDFData Brief
February 2025
Institute of Agricultural Sciences, Spanish National Research Council (ICA-CSIC), Serrano 115b, 28006 Madrid, Spain.
Identifying weed species at early-growth stages is critical for precision agriculture. Accurate classification at the species-level enables targeted control measures, significantly reducing pesticide use. This paper presents a dataset of RGB images captured with a Sony ILCE-6300L camera mounted on an unmanned aerial vehicle (UAV) flying at an altitude of 11 m above ground level.
View Article and Find Full Text PDFSensors (Basel)
January 2025
Department of AI & Big Data, Honam University, Gwangju 62399, Republic of Korea.
This study proposes an advanced plant disease classification framework leveraging the Attention Score-Based Multi-Vision Transformer (Multi-ViT) model. The framework introduces a novel attention mechanism to dynamically prioritize relevant features from multiple leaf images, overcoming the limitations of single-leaf-based diagnoses. Building on the Vision Transformer (ViT) architecture, the Multi-ViT model aggregates diverse feature representations by combining outputs from multiple ViTs, each capturing unique visual patterns.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!