The development of data sensing technology has generated a vast amount of high-dimensional data, posing great challenges for machine learning models. Over the past decades, despite demonstrating its effectiveness in data classification, genetic programming (GP) has still encountered three major challenges when dealing with high-dimensional data: 1) solution diversity; 2) multiclass imbalance; and 3) large feature space. In this article, we have developed a problem-specific multiobjective GP framework (PS-MOGP) for handling classification tasks with high-dimensional data. To reduce the large solution space caused by high dimensionality, we incorporate the recursive feature elimination strategy based on mining the archive of evolved GP solutions. A progressive domination Pareto archive evolution strategy (PD-PAES), which optimizes the objectives in a specific order according to their objectives, is proposed to evaluate the GP individuals and maintain a better diversity of solutions. Besides, to address the seriously imbalanced class issue caused by traditional binary decomposition (BD) one versus rest (OVR) for multiclass classification problems, we design a method named BD with a similar positive and negative class size (BD-SPNCS) to generate a set of auxiliary classifiers. Experimental results on benchmark and real-world datasets demonstrate that our proposed PS-MOGP outperforms state-of-the-art traditional and evolutionary classification methods in the context of high-dimensional data classification.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TCYB.2024.3372070 | DOI Listing |
Nat Methods
January 2025
Broad Institute of MIT and Harvard, Cambridge, MA, USA.
A key challenge of the modern genomics era is developing empirical data-driven representations of gene function. Here we present the first unbiased morphology-based genome-wide perturbation atlas in human cells, containing three genome-wide genotype-phenotype maps comprising CRISPR-Cas9-based knockouts of >20,000 genes in >30 million cells. Our optical pooled cell profiling platform (PERISCOPE) combines a destainable high-dimensional phenotyping panel (based on Cell Painting) with optical sequencing of molecular barcodes and a scalable open-source analysis pipeline to facilitate massively parallel screening of pooled perturbation libraries.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Artificial Intelligence and Data Science, College of Computer Science and Engineering, University of Hail, Hail, Saudi Arabia.
In the present digital scenario, the explosion of Internet of Things (IoT) devices makes massive volumes of high-dimensional data, presenting significant data and privacy security challenges. As IoT networks enlarge, certifying sensitive data privacy while still employing data analytics authority is vital. In the period of big data, statistical learning has seen fast progressions in methodological practical and innovation applications.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Orthopedics, Shanghai Changhai Hospital, Shanghai, 200433, China.
With the emergence of numerous classifications, surgical treatment for adolescent idiopathic scoliosis (AIS) can be guided more effectively. However, surgical decision-making and optimal strategies still lack standardization and personalized customization. Our study aims to devise proper deep learning (DL) models that incorporate key factors influencing surgical outcomes on the coronal plane in AIS patients to facilitate surgical decision-making and predict surgical results for AIS patients.
View Article and Find Full Text PDFDrug Saf
January 2025
Clinical Pharmacology, Pharmacy and Environmental Medicine, Department of Public Health, University of Southern Denmark, 5000, Odense C, Denmark.
Introduction: Large administrative healthcare databases can be used for near real-time sequential safety surveillance of drugs as an alternative approach to traditional reporting-based pharmacovigilance. The study aims to build and empirically test a prospective drug safety monitoring setup and perform a sequential safety monitoring of rofecoxib use and risk of cardiovascular outcomes.
Methods: We used Danish population-based health registers and performed sequential analysis of rofecoxib use and cardiovascular outcomes using case-time-control and cohort study designs from January 2000 to September 2004.
J Cell Mol Med
January 2025
Department of Cardiology, Guizhou Provincial People's Hospital, Guiyang, Guizhou Province, China.
It is critical to appreciate the role of the tumour-associated microenvironment (TME) in developing strategies for the effective therapy of cancer, as it is an important factor that determines the evolution and treatment response of tumours. This work combines machine learning and single-cell RNA sequencing (scRNA-seq) to explore the glioma tumour microenvironment's TME. With the help of genome-wide association studies (GWAS) and Mendelian randomization (MR), we found genetic variants associated with TME elements that affect cancer and cardiovascular disease outcomes.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!