With its ability of joint representation learning and clustering via deep neural networks, the deep clustering have gained significant attention in recent years. Despite the considerable progress, most of the previous deep clustering methods still suffer from three critical limitations. First, they tend to associate some distribution-based clustering loss to the neural network, which often overlook the sample-wise contrastiveness for discriminative representation learning. Second, they generally utilize the features learned at a single layer for the clustering process, which, surprisingly, cannot go beyond a single layer to explore multiple layers for joint multi-layer (multi-stage) learning. Third, they typically use the convolutional neural network (CNN) for clustering images, which focus on local information yet cannot well capture the global dependencies. To tackle these issues, this paper presents a new deep clustering method called pyramid contrastive learning for clustering (PCLC), which is able to incorporate a pyramidal contrastive architecture to jointly enforce contrastive learning and clustering at multiple network layers (or stages). Particularly, for an input image, two types of augmentations are first performed to generate two paralleled augmented views. To bridge the gap between the CNN (for capturing local information) and the Transformer (for reflecting global dependencies), a mixed CNN-Transformer based encoder is utilized as the backbone, whose CNN-Transformer blocks are further divided into four stages, thus giving rise to a pyramid of multi-stage feature representations. Thereafter, multiple stages of twin contrastive learning are simultaneously conducted at both the instance-level and the cluster-level, through the optimization of which the final clustering can be achieved. Extensive experiments on multiple challenging image datasets demonstrate the superior clustering performance of PCLC over the state-of-the-art. The source code is available at https://github.com/Zachary-Chow/PCLC.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.neunet.2025.107217 | DOI Listing |
Jpn J Radiol
March 2025
Department of Radiology, Mie University Graduate School of Medicine, 2-174 Edobashi, Tsu, Mie, 514-8507, Japan.
Purpose: Myocardial computed tomography (CT) late enhancement (LE) allows assessment of myocardial scarring. Super-resolution deep learning image reconstruction (SR-DLR) trained on data acquired from ultra-high-resolution CT may improve image quality for CT-LE. Therefore, this study investigated image noise and image quality with SR-DLR compared with conventional DLR (C-DLR) and hybrid iterative reconstruction (hybrid IR).
View Article and Find Full Text PDFJ Cancer Educ
March 2025
Department of Neurosurgery, Cheeloo College of Medicine and Institute of Brain and Brain-Inspired Science, Qilu Hospital, Shandong University, Jinan, 250012, China.
This study aims to explore the effects of problem-based learning (PBL) and prescription-based preoperative talk (PPT) teaching methods in the teaching of tumors in cerebellopontine angle (CPA) of clinical neurosurgery residents.One hundred-thirty neurosurgery residents working in Qilu Hospital of Shandong University from September 2021 to June 2024 were randomly divided into two groups. The experimental group adopted the combination of PBL and PPT, referred to as PPP.
View Article and Find Full Text PDFPsychol Res
March 2025
School of Education, Guangzhou University, Guangzhou, 510006, People's Republic of China.
Cognitive offloading refers to the use of external tools to assist in memory processes.This study investigates the effects of item difficulty and value on cognitive offloading during a word-pair learning task, comparing children and young adults in a context where both cues coexist. In Experiment 1, we examined the impact of difficulty and value cues on cognitive offloading using a 2 (difficulty: easy vs.
View Article and Find Full Text PDFJ Biomol Struct Dyn
March 2025
School of Mechatronic Engineering and automation, Shanghai University, Shanghai, China.
Prediction of protein-ligand interactions is critical for drug discovery and repositioning. Traditional prediction methods are computationally intensive and limited in modeling structural changes. In contrast, data-driven deep learning methods significantly reduce computational costs and offer a more efficient approach for drug discovery.
View Article and Find Full Text PDFElife
March 2025
Department of Neuroscience, Georgetown University Medical Center, Washington DC, United States.
Research on brain plasticity, particularly in the context of deafness, consistently emphasizes the reorganization of the auditory cortex. But to what extent do all individuals with deafness show the same level of reorganization? To address this question, we examined the individual differences in functional connectivity (FC) from the deprived auditory cortex. Our findings demonstrate remarkable differentiation between individuals deriving from the absence of shared auditory experiences, resulting in heightened FC variability among deaf individuals, compared to more consistent FC in the hearing group.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!