Recently, there has been growing interest in deep spectral methods for image localization and segmentation, influenced by traditional spectral segmentation approaches. These methods reframe the image decomposition process as a graph partitioning task by extracting features using self-supervised learning and utilizing the Laplacian of the affinity matrix to obtain eigensegments. However, instance segmentation has received less attention than other tasks within the context of deep spectral methods. This paper addresses that not all channels of the feature map extracted from a self-supervised backbone contain sufficient information for instance segmentation purposes. Some channels are noisy and hinder the accuracy of the task. To overcome this issue, this paper proposes two channel reduction modules, Noise Channel Reduction (NCR) and Deviation-based Channel Reduction (DCR). The NCR retains channels with lower entropy, as they are less likely to be noisy, while DCR prunes channels with low standard deviation, as they lack sufficient information for effective instance segmentation. Furthermore, the paper demonstrates that the dot product, commonly used in deep spectral methods, is not suitable for instance segmentation due to its sensitivity to feature map values, potentially leading to incorrect instance segments. A novel similarity metric called Bray-curtis over Chebyshev (BoC) is proposed to address this issue. This metric considers the distribution of features in addition to their values, providing a more robust similarity measure for instance segmentation. Quantitative and qualitative results on the Youtube-VIS 2019 and OVIS datasets highlight the improvements achieved by the proposed channel reduction methods and using BoC instead of the conventional dot product for creating the affinity matrix. These improvements regarding mean Intersection over Union (mIoU) and extracted instance segments are observed, demonstrating enhanced instance segmentation performance. The code is available on: https://github.com/farnooshar/SpecUnIIS.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11458003 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0307432 | PLOS |
J Imaging
January 2025
School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou 213000, China.
Manual labeling of lesions in medical image analysis presents a significant challenge due to its labor-intensive and inefficient nature, which ultimately strains essential medical resources and impedes the advancement of computer-aided diagnosis. This paper introduces a novel medical image-segmentation framework named Efficient Generative-Adversarial U-Net (EGAUNet), designed to facilitate rapid and accurate multi-organ labeling. To enhance the model's capability to comprehend spatial information, we propose the Global Spatial-Channel Attention Mechanism (GSCA).
View Article and Find Full Text PDFBioengineering (Basel)
January 2025
Institute of Electronic Information Engineering, Beihang University, 37 Xueyuan Road, Haidian District, Beijing 100191, China.
Due to the labor-intensive manual annotations for nuclei segmentation, point-supervised segmentation based on nuclei coordinate supervision has gained recognition in recent years. Despite great progress, two challenges hinder the performance of weakly supervised nuclei segmentation methods: (1) The stable and effective segmentation of adjacent cell nuclei remains an unresolved challenge. (2) Existing approaches rely solely on initial pseudo-labels generated from point annotations for training, and inaccurate labels may lead the model to assimilate a considerable amount of noise information, thereby diminishing performance.
View Article and Find Full Text PDFBioengineering (Basel)
January 2025
Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of Korea.
Objective: This study evaluated the applicability of the classical method, height loss ratio (HLR), for identifying major acute compression fractures in clinical practice and compared its performance with deep learning (DL)-based VCF detection methods. Additionally, it examined whether combining the HLR with DL approaches could enhance performance, exploring the potential integration of classical and DL methodologies.
Methods: End-to-End VCF Detection (EEVD), Two-Stage VCF Detection with Segmentation and Detection (TSVD_SD), and Two-Stage VCF Detection with Detection and Classification (TSVD_DC).
Front Plant Sci
January 2025
Research Institute of Forest Policy and Information, Chinese Academy of Forestry, Beijing, China.
The processing of LiDAR point cloud data is of critical importance in the context of forest resource surveys, as well as representing a pivotal element in the realm of forest physiological and ecological studies.Nonetheless, conventional denoising algorithms frequently exhibit deficiencies with regard to adaptability and denoising efficacy, particularly when employed in relation to disparate datasets.To address these issues, this study introduces DEN4, an unsupervised, deep learning-based point cloud denoising algorithm designed to improve the accuracy of single tree segmentation in LiDAR point clouds.
View Article and Find Full Text PDFNeural Netw
January 2025
School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049, China.
The discrete cosine transform (DCT) has been widely used in computer vision tasks due to its ability of high compression ratio and high-quality visual presentation. However, conventional DCT is usually affected by the size of transform region and results in blocking effect. Therefore, eliminating the blocking effects to efficiently serve for vision tasks is significant and challenging.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!