AI Article Synopsis

  • Advances in technology have generated a wealth of single cell RNA sequencing (scRNA-seq) data, leading to the development of various clustering approaches to identify cellular phenotypes.
  • Clustering methods are generally categorized into individual (normal) methods, which focus on single data aspects, and integrated (ensemble) methods, which combine multiple individual methods for improved accuracy but can be sensitive to their base results.
  • The proposed EC-PGMGR algorithm aims to address these challenges by automatically determining cluster numbers and incorporating regularization to enhance the effectiveness of active clustering while mitigating the influence of weaker results.

Article Abstract

Advances in technology have made it convenient to obtain a large amount of single cell RNA sequencing (scRNA-seq) data. Since that clustering is a very important step in identifying or defining cellular phenotypes, many clustering approaches have been developed recently for these applications. The general methods can be roughly divided into normal clustering methods and integrated (ensemble) clustering methods which combine more than two normal clustering methods aiming to get much more informative performance. In order to make a contrast with the integrated clustering algorithm, the normal clustering method is often called individual or base clustering method. Note that the results of many individual clustering methods are often developed to capture one aspect of the data, and the results depend on the initial parameter settings, such as cluster number, distance metric and so on. Compared with individual clustering, although integrative clustering method may get much more accurate performance, the results depend on the base clustering results and integrated systems are often not self-regulation. Therefore, how to design a robust unsupervised clustering method is still a challenge. In order to tackle above limitations, we propose a novel Ensemble Clustering algorithm based on Probability Graphical Model with Graph Regularization, which is called EC-PGMGR for short. On one hand, we use parameter controlling in Probability Graphical Model (PGM) to automatically determine the cluster number without prior knowledge. On the other hand, we add a regularization term to reduce the effect deriving from some weak base clustering results. Particularly, the integrative results collected from base clustering methods can be assembled in the form of combination with self-regulation weights through a pre-learning process, which can efficiently enhance the effect of active clustering methods while weaken the effect of inactive clustering methods. Experiments are carried out on 7 data sets generated by different platforms with the number of single cells from 822 to 5,132. Results show that EC-PGMGR performs better than 4 alternative individual clustering methods and 2 ensemble methods in terms of accuracy including Adjusted Rand Index (ARI) and Normalized Mutual Information (NMI), robustness, effectiveness and so on. EC-PGMGR provides an effective way to integrate different clustering results for more accurate and reliable results in further biological analysis as well. It may provide some new insights to the other applications of clustering.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7673820PMC
http://dx.doi.org/10.3389/fgene.2020.572242DOI Listing

Publication Analysis

Top Keywords

clustering methods
32
clustering
22
clustering method
16
base clustering
16
ensemble clustering
12
probability graphical
12
graphical model
12
normal clustering
12
individual clustering
12
methods
10

Similar Publications

Investigation and elimination of noncovalent artificial aggregates during non-reduced capillary electrophoresis-sodium dodecyl sulfate analysis of a multi-specific antibody.

J Pharm Biomed Anal

January 2025

State Key Laboratory of Neurology and Oncology Drug Development, Nanjing, China; Simcere Zaiming Pharmaceutical Co, Ltd., Nanjing, China. Electronic address:

Capillary electrophoresis-sodium dodecyl sulfate (CE-SDS) is widely used in the biopharmaceutical industry for monitoring purity and analyzing impurities. The accuracy of the method may be compromised by artificial species resulting from sample preparation or electrophoresis separation due to suboptimal conditions. During non-reduced CE-SDS analysis of a multispecific antibody (msAb), named as multispecific antibody C (msAb-C), a cluster of unexpected peaks was observed after the main peak.

View Article and Find Full Text PDF

Influence of Axial Rotation Between the Femoral Neck and Ankle Joint on Kinematics in Normal Knees: A Cross-Sectional Study.

J Am Acad Orthop Surg Glob Res Rev

January 2025

From the Department of Orthopedic Surgery, Faculty of Medicine, The University of Tokyo, Bunkyo, Tokyo (Dr. Kono, Dr. Taketomi, Dr. Kage, Dr. Inui, and Dr. Tanaka); the Department of Information Systems, Faculty of Engineering, Saitama Institute of Technology, Fukaya, Saitama (Dr. Yamazaki); the Department of Orthopedic Biomaterial Science, Osaka University Graduate School of Medicine, Suita, Osaka (Dr. Tamaki, and Dr. Tomita); the Department of Orthopedic Surgery, Saitama Medical University, Saitama Medical Center, Kawagoe, Saitama (Dr. Inui); and the Department of Health Science, Graduate School of Health Science, Morinomiya University of Medical Sciences, Suminoe, Osaka, Japan (Dr. Tomita).

Background: The effect of axial rotation between the femoral neck and ankle joint (total rotation [TR]) on normal knees is unknown. Therefore, this study aimed to investigate the TR effect on normal knee kinematics.

Methods: Volunteers were divided into groups large (L), intermediate (I), and small (S), using hierarchical cluster analysis based on TR in the standing position.

View Article and Find Full Text PDF

Background: Pakistani women are among the most affected groups by obesity and heart failure in Catalonia. Due to cultural and linguistic barriers, their participation in standard health promotion programs is limited. To address this issue, we implemented a culturally and linguistically appropriate food education program called the PakCat Program.

View Article and Find Full Text PDF

Objective: This study aimed to evaluate the occurrence of methicillin-resistant Staphylococcus aureus (MRSA) at the University Hospital Olomouc (UHO) over a 10-year period (2013-2022).

Material And Methods: Data was obtained from the ENVIS LIMS laboratory information system (DS Soft, Czech Republic, Olomouc) of the Department of Microbiology, UHO, for the period 1/1/2013-31/12/2022. Standard microbiological procedures using the MALDI-TOF MS system (Biotyper Microflex, Bruker Daltonics) were applied for the identification.

View Article and Find Full Text PDF

The current paper aimed to estimate the network structure of general psychopathology (internalizing and externalizing symptoms/disorders) among 239 gifted children in Jordan. This cross-sectional study with a convenience sampling method was conducted between September 2023 and October 2024 among gifted children aged 7-12. The Child Behavior Checklist (CBCL) was employed to assess six symptom clusters: conduct problems, attention-deficit/hyperactivity disorder (ADHD), and oppositional defiant problems as externalizing symptoms, and affective problems, anxiety issues, and somatic complaints as internalizing symptoms.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!