Nuclear Norm Clustering: a promising alternative method for clustering tasks.

Sci Rep

State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China.

Published: July 2018

Clustering techniques are widely used in many applications. The goal of clustering is to identify patterns or groups of similar objects within a dataset of interest. However, many cluster methods are neither robust nor sensitive to noises and outliers in real data. In this paper, we present Nuclear Norm Clustering (NNC, available at https://sourceforge.net/projects/nnc/), an algorithm that can be used in various fields as a promising alternative to the k-means clustering method. The NNC algorithm requires users to provide a data matrix M and a desired number of cluster K. We employed simulated annealing techniques to choose an optimal label vector that minimizes nuclear norm of the pooled within cluster residual matrix. To evaluate the performance of the NNC algorithm, we compared the performance of both 15 public datasets and 2 genome-wide association studies (GWAS) on psoriasis, comparing our method with other classic methods. The results indicate that NNC method has a competitive performance in terms of F-score on 15 benchmarked public datasets and 2 psoriasis GWAS datasets. So NNC is a promising alternative method for clustering tasks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6052164PMC
http://dx.doi.org/10.1038/s41598-018-29246-4DOI Listing

Publication Analysis

Top Keywords

nuclear norm
12
promising alternative
12
norm clustering
8
alternative method
8
method clustering
8
clustering tasks
8
nnc algorithm
8
public datasets
8
clustering
7
method
5

Similar Publications

Background: Photon-counting computed tomography (CT) is an advanced imaging technique that enables multi-energy imaging from a single scan. However, the limited photon count assigned to narrow energy bins leads to increased quantum noise in the reconstructed spectral images. To address this issue, leveraging the prior information in the spectral images is essential.

View Article and Find Full Text PDF

Predicting human miRNA disease association with minimize matrix nuclear norm.

Sci Rep

December 2024

Department of Electricity and Energy, Selcuk University, Konya, Turkey.

microRNAs (miRNAs) are non-coding RNA molecules that influence the development and progression of many diseases. Research have documented that miRNAs have a significant role in the prevention, diagnosis, and treatment of complex human diseases. Recently, scientists have devoted extensive resources to attempting to find the connections between miRNAs and diseases.

View Article and Find Full Text PDF

Aromatic-aromatic interactions drive fold switch of GA95 and GB95 with three residue difference.

Chem Sci

January 2025

Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan National Laboratory for Optoelectronics, Wuhan Institute of Physics and Mathematics, Innovation Academy of Precision Measurement, Chinese Academy of Sciences Wuhan 430071 China

Proteins typically adopt a single fold to carry out their function, but metamorphic proteins, with multiple folding states, defy this norm. Deciphering the mechanism of conformational interconversion of metamorphic proteins is challenging. Herein, we employed nuclear magnetic resonance (NMR), circular dichroism (CD), and all-atom molecular dynamics (MD) simulations to elucidate the mechanism of fold switching in proteins GA95 and GB95, which share 95% sequence homology.

View Article and Find Full Text PDF

The translation of nucleotide sequences into amino acid sequences, governed by the genetic code, is one of the most conserved features of molecular biology. The standard genetic code, which uses 61 sense codons to encode one of the 20 standard amino acids and 3 stop codons (UAA, UAG, and UGA) to terminate translation, is used by most extant organisms. The protistan phylum Ciliophora (the 'ciliates') are the most prominent exception to this norm, exhibiting the grfeatest diversity of nuclear genetic code variants and evidence of repeated changes in the code.

View Article and Find Full Text PDF

Background: With increasing focus on patient-reported outcome measures (PROMs) in chronic rheumatic diseases, we aimed to evaluate the self-reported physical and psychosocial health in children with juvenile idiopathic arthritis (JIA) compared to matched population-based controls. Furthermore, we aimed to study the association of patient- and physician-reported outcome measures in JIA with patient-reported physical disability.

Methods: We used data from a Norwegian JIA cohort study (NorJIA), including clinical characteristics and outcome measures in participants with JIA and sex- and age-matched population-based controls.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!