Reducing complexity of the pipeline of instance segmentation is crucial for real-world applications. This work addresses this problem by introducing an anchor-box free and single-shot instance segmentation framework, termed PolarMask++, which reformulates the instance segmentation problem as predicting the contours of objects in the polar coordinate, leading to several appealing benefits. (1) The polar representation unifies instance segmentation (masks) and object detection (bounding boxes) into a single framework, reducing the design and computational complexity. (2) We carefully design two modules (soft polar centerness and polar IoU loss) to sample high-quality center examples and optimize polar contour regression, making the performance of PolarMask++ does not depend on the bounding box prediction and thus more efficient in training. (3) PolarMask++ is fully convolutional and can be easily embedded into most off-the-shelf detectors. To further improve the accuracy of the framework, a Refined Feature Pyramid is introduced to improve the feature representation at different scales. Extensive experiments demonstrate the effectiveness of PolarMask++, which achieves competitive results on COCO dataset, and new state-of-the-art results on text detection and cell segmentation datasets. We hope polar representation can provide a new perspective for designing algorithms to solve single-shot instance segmentation. Code is released at: github.com/xieenze/PolarMask.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2021.3080324 | DOI Listing |
Sci Rep
January 2025
Department of Industrial Engineering and Management, Ming Chi University of Technology, New Taipei City, 243, Taiwan.
This study develops the you only look once segmentation (YOLOSeg), an end-to-end instance segmentation model, with applications to segment small particle defects embedded on a wafer die. YOLOSeg uses YOLOv5s as the basis and extends a UNet-like structure to form the segmentation head. YOLOSeg can predict not only bounding boxes of particle defects but also the corresponding bounding polygons.
View Article and Find Full Text PDFSci Data
January 2025
Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, 3800, Australia.
Primary malignant bone tumors are the third highest cause of cancer-related mortality among patients under the age of 20. X-ray scan is the primary tool for detecting bone tumors. However, due to the varying morphologies of bone tumors, it is challenging for radiologists to make a definitive diagnosis based on radiographs.
View Article and Find Full Text PDFWorld J Gastrointest Oncol
January 2025
Department of Medical College, Jinan University, Guangzhou 510000, Guangdong Province, China.
Background: Gallbladder neuroendocrine carcinoma (NEC) represents a subtype of gallbladder malignancies characterized by a low incidence, aggressive nature, and poor prognosis. Despite its clinical severity, the genetic alterations, mechanisms, and signaling pathways underlying gallbladder NEC remain unclear.
Case Summary: This case study presents a rare instance of primary gallbladder NEC in a 73-year-old female patient, who underwent a radical cholecystectomy with hepatic hilar lymphadenectomy and resection of liver segments IV-B and V.
Ophthalmol Sci
November 2024
Casey Eye Institute, Oregon Health & Science University, Portland, Oregon.
Purpose: The diagnosis of fungal keratitis using potassium hydroxide (KOH) smears of corneal scrapings enables initiation of the correct antimicrobial therapy at the point-of-care but requires time-consuming manual examination and expertise. This study evaluates the efficacy of a deep learning framework, dual stream multiple instance learning (DSMIL), in automating the analysis of whole slide imaging (WSI) of KOH smears for rapid and accurate detection of fungal infections.
Design: Retrospective observational study.
Waste Manag
January 2025
ZheJiang University, Department of Mechanical Engineering, ZheJiang, 310000, China.
With the rapid increase in end-of-life smartphones, enhancing the automation and intelligence of their recycling processes has become an urgent challenge. At present, the disassembly of discarded smartphones predominantly relies on manual labor, which is not only inefficient but also associated with environmental pollution and high labor intensity. In the context of end-of-life smartphone recycling, complex situations such as stacking and occlusion are commonly encountered.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!