Background: Although many consensus clustering methods have been successfully used for combining multiple classifiers in many areas such as machine learning, applied statistics, pattern recognition and bioinformatics, few consensus clustering methods have been applied for combining multiple clusterings of chemical structures. It is known that any individual clustering method will not always give the best results for all types of applications. So, in this paper, three voting and graph-based consensus clusterings were used for combining multiple clusterings of chemical structures to enhance the ability of separating biologically active molecules from inactive ones in each cluster.
Results: The cumulative voting-based aggregation algorithm (CVAA), cluster-based similarity partitioning algorithm (CSPA) and hyper-graph partitioning algorithm (HGPA) were examined. The F-measure and Quality Partition Index method (QPI) were used to evaluate the clusterings and the results were compared to the Ward's clustering method. The MDL Drug Data Report (MDDR) dataset was used for experiments and was represented by two 2D fingerprints, ALOGP and ECFP_4. The performance of voting-based consensus clustering method outperformed the Ward's method using F-measure and QPI method for both ALOGP and ECFP_4 fingerprints, while the graph-based consensus clustering methods outperformed the Ward's method only for ALOGP using QPI. The Jaccard and Euclidean distance measures were the methods of choice to generate the ensembles, which give the highest values for both criteria.
Conclusions: The results of the experiments show that consensus clustering methods can improve the effectiveness of chemical structures clusterings. The cumulative voting-based aggregation algorithm (CVAA) was the method of choice among consensus clustering methods.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3541359 | PMC |
http://dx.doi.org/10.1186/1758-2946-4-37 | DOI Listing |
Breast Cancer Res Treat
January 2025
Department of Breast Surgery, Thyroid Surgery, Huangshi Central Hospital, Affiliated Hospital of Hubei Polytechnic University, No.141, Tianjin Road, Huangshi, 435000, Hubei, China.
Background: The heterogeneity of breast cancer (BC) necessitates the identification of novel subtypes and prognostic models to enhance patient stratification and treatment strategies. This study aims to identify novel BC subtypes based on PANoptosis-related genes (PRGs) and construct a robust prognostic model to guide individualized treatment strategies.
Methods: The transcriptome data along with clinical data of BC patients were sourced from the TCGA and GEO databases.
Schizophr Bull
January 2025
Psychotic Disorders Division, McLean Hospital, Belmont, MA, United States.
Background And Hypothesis: Convergent evidence shows the presence of brain metabolic abnormalities in psychotic disorders. This study examined brain reductive stress and energy metabolism in people with psychotic disorders with impaired or average range cognition. We hypothesized that global cognitive impairment would be associated with greater brain metabolic dysregulation.
View Article and Find Full Text PDFRapid technological advancements have made it possible to generate single-cell data at a large scale. Several laboratories around the world can now generate single-cell transcriptomic data from different tissues. Unsupervised clustering, followed by annotation of the cell type of the identified clusters, is a crucial step in single-cell analyses.
View Article and Find Full Text PDFOrthop J Sports Med
January 2025
Pan Am Clinic and University of Manitoba, Winnipeg, Manitoba, Canada.
Background: Inconsistencies in the workup of labral tears in the hip have been shown to result in a delay in treatment and an increased cost to the medical system.
Purpose: To establish consensus statements among Canadian nonoperative/operative sports medicine physicians via a modified Delphi process on the diagnosis, nonoperative and operative management, and rehabilitation and return to play (RTP) of those with labral tears in the hip.
Study Design: A consensus statement.
Int J Mol Sci
January 2025
School of Environmental Science and Engineering, Hainan University, Haikou 570228, China.
Hepatocellular carcinoma (HCC), a leading liver tumor globally, is influenced by diverse risk factors. Cellular senescence, marked by permanent cell cycle arrest, plays a crucial role in cancer biology, but its markers and roles in the HCC immune microenvironment remain unclear. Three machine learning methods, namely k nearest neighbor (KNN), support vector machine (SVM), and random forest (RF), are utilized to identify eight key HCC cell senescence markers (HCC-CSMs).
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!