Integrative missing value estimation for microarray data.

BMC Bioinformatics

Molecular and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA 900089, USA.

Published: October 2006

Background: Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples.

Results: We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests.

Conclusion: We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1622759PMC
http://dx.doi.org/10.1186/1471-2105-7-449DOI Listing

Publication Analysis

Top Keywords

missing estimation
20
missing data
12
microarray datasets
12
integrative missing
8
high rates
8
rates missing
8
reference data
8
data sets
8
missing
7
microarray
6

Similar Publications

Socio-economic inequalities in second primary cancer incidence: A competing risks analysis of women with breast cancer in England between 2000 and 2018.

Int J Cancer

January 2025

Inequalities in Cancer Outcomes Network (ICON) group, Department of Health Services Research and Policy, Faculty of Public Health and Policy, London School of Hygiene & Tropical Medicine, London, UK.

We aimed to investigate socio-economic inequalities in second primary cancer (SPC) incidence among breast cancer survivors. Using Data from cancer registries in England, we included all women diagnosed with a first primary breast cancer (PBC) between 2000 and 2018 and aged between 18 and 99 years and followed them up from 6 months after the PBC diagnosis until a SPC event, death, or right censoring, whichever came first. We used flexible parametric survival models adjusting for age and year of PBC diagnosis, ethnicity, PBC tumour stage, comorbidity, and PBC treatments to model the cause-specific hazards of SPC incidence and death according to income deprivation, and then estimated standardised cumulative incidences of SPC by deprivation, taking death as the competing event.

View Article and Find Full Text PDF

Low-Complexity Timing Correction Methods for Heart Rate Estimation Using Remote Photoplethysmography.

Sensors (Basel)

January 2025

Department of Biomedical and Robotics Engineering, Incheon National University, Incheon 22012, Republic of Korea.

With the rise of modern healthcare monitoring, heart rate (HR) estimation using remote photoplethysmography (rPPG) has gained attention for its non-contact, continuous tracking capabilities. However, most HR estimation methods rely on stable, fixed sampling intervals, while practical image capture often involves irregular frame rates and missing data, leading to inaccuracies in HR measurements. This study addresses these issues by introducing low-complexity timing correction methods, including linear, cubic, and filter interpolation, to improve HR estimation from rPPG signals under conditions of irregular sampling and data loss.

View Article and Find Full Text PDF

In this paper, a novel particle filter based on one-step smoothing is proposed for nonlinear systems with random one-step delay and missing measurements. Such problems are commonly encountered in networked control systems, where random one-step delay and missing measurements significantly increase the difficulty of dynamic state estimation. The particle filter is a nonlinear filtering method based on sequential Monte Carlo sampling.

View Article and Find Full Text PDF

Purpose: The aim of this study was to compare estimates of adherence to oral endocrine therapy (OET) based on real-world data (RWD) and on clinical evaluation in people diagnosed with breast cancer in the public healthcare system in Catalonia (Spain).

Methods: We conducted two retrospective cohort studies. Cohort 1 (RWD) consisted of women diagnosed with breast cancer in 2021 in the public healthcare system of Catalonia (Spain).

View Article and Find Full Text PDF

International benchmarking of stage at diagnosis for six childhood solid tumours (the BENCHISTA project): a population-based, retrospective cohort study.

Lancet Child Adolesc Health

February 2025

Developmental Biology and Cancer Research & Teaching Department, UCL Great Ormond Street Institute of Child Health, University College London, London, UK. Electronic address:

Background: International variation in childhood cancer survival might be explained by differences in stage at diagnosis, among other factors. As part of the BENCHISTA project, we aimed to assess geographical variation in tumour stage at diagnosis through the application, by population-based cancer registries working with clinicians, of the international consensus Toronto Childhood Cancer Stage Guidelines.

Methods: This population-based, retrospective cohort study involved 67 cancer registries from 23 European countries, Australia, Brazil, Japan, and Canada.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!