Machine learning and statistical model based classifiers have increasingly been used with more complex and high dimensional biological data obtained from high-throughput technologies. Understanding the impact of various factors associated with large and complex microarray datasets on the predictive performance of classifiers is computationally intensive, under investigated, yet vital in determining the optimal number of biomarkers for various classification purposes aimed towards improved detection, diagnosis, and therapeutic monitoring of diseases. We investigate the impact of microarray based data characteristics on the predictive performance for various classification rules using simulation studies. Our investigation using Random Forest, Support Vector Machines, Linear Discriminant Analysis and k-Nearest Neighbour shows that the predictive performance of classifiers is strongly influenced by training set size, biological and technical variability, replication, fold change and correlation between biomarkers. Optimal number of biomarkers for a classification problem should therefore be estimated taking account of the impact of all these factors. A database of average generalization errors is built for various combinations of these factors. The database of generalization errors can be used for estimating the optimal number of biomarkers for given levels of predictive accuracy as a function of these factors. Examples show that curves from actual biological data resemble that of simulated data with corresponding levels of data characteristics. An R package optBiomarker implementing the method is freely available for academic use from the Comprehensive R Archive Network (http://www.cran.r-project.org/web/packages/optBiomarker/).

Download full-text PDF

Source
http://dx.doi.org/10.1142/s0219720010005063DOI Listing

Publication Analysis

Top Keywords

optimal number
16
number biomarkers
16
biomarkers classification
12
predictive performance
12
estimating optimal
8
classification rules
8
biological data
8
impact factors
8
performance classifiers
8
data characteristics
8

Similar Publications

Objective: Understanding healthcare-seeking propensity is crucial for optimizing healthcare utilization, especially for patients with chronic conditions like hypertension or diabetes, given their substantial burden on healthcare systems globally. This study aims to evaluate hypertensive or diabetic patients' healthcare-seeking propensity based on the severity of symptoms, categorizing symptoms as either major or minor. It also explores factors influencing healthcare-seeking propensity and examines whether healthcare-seeking propensity affects healthcare utilization and preventable hospitalizations.

View Article and Find Full Text PDF

Optimizing skull base defect repair: leveraging the reused nasoseptal flap as a reliable material.

Eur Arch Otorhinolaryngol

January 2025

Department of Otolaryngology-Head and Neck Surgery, Taipei Veterans General Hospital, Taipei, Taiwan.

Purpose: The escalating number of endoscopic skull base procedures necessitates exploring additional materials to reduce postoperative cerebrospinal fluid (CSF) leaks in revision or staged surgeries. This study evaluates the effectiveness of reused nasoseptal flaps (NSFs) in such clinical scenarios.

Methods: A retrospective review was conducted on patients who previously underwent surgery involving NSFs and later had revision or secondary skull base surgeries via endoscopic endonasal approaches (EEAs) at a tertiary medical center.

View Article and Find Full Text PDF

Left atrial shunting devices: why, what, how, and… when?

Heart Fail Rev

January 2025

Department of Cardiology, San Luca Hospital, IRCCS Istituto Auxologico Italiano, Milan, Italy.

Left atrial (LA) hypertension is central in the pathophysiology of heart failure (HF) in general and of HF with preserved ejection fraction (HFpEF) in particular. Despite approved treatments, a number of HF patients continue experiencing disabling symptoms due to LA hypertension, causing pulmonary congestion, pulmonary hypertension, and right heart dysfunction, at rest and/or during exercise. LA decompression therapies, i.

View Article and Find Full Text PDF

Finger amputations following complex hand injuries (CHI) pose a significant challenge in hand surgery due to severe tissue trauma and neurovascular damage, necessitating precise arterial repair. While restoring arterial perfusion is critical, it remains unclear whether reconstructing both proper palmar digital arteries is required for optimal outcomes. This study evaluates whether restoring one or both arteries in finger replantation after complex injuries impacts perfusion and overall outcomes.

View Article and Find Full Text PDF

Epidemiological status, development trends, and risk factors of disability-adjusted life years due to diabetic kidney disease: A systematic analysis of Global Burden of Disease Study 2021.

Chin Med J (Engl)

January 2025

Department of Metabolism and Endocrinology, National Clinical Research Center for Metabolic Diseases, Key Laboratory of Diabetes Immunology (Central South University), Ministry of Education, The Second Xiangya Hospital of Central South University, Changsha, Hunan 410011, China.

Background: Approximately 40% of individuals with diabetes worldwide are at risk of developing diabetic kidney disease (DKD), which is not only the leading cause of kidney failure, but also significantly increases the risk of cardiovascular disease, causing significant societal health and financial burdens. This study aimed to describe the burden of DKD and explore its cross-country epidemiological status, predict development trends, and assess its risk factors and sociodemographic transitions.

Methods: Based on the Global Burden of Diseases (GBD) Study 2021, data on DKD due to type 1 diabetes (DKD-T1DM) and type 2 diabetes (DKD-T2DM) were analyzed by sex, age, year, and location.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!