Background: Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples.
Results: We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests.
Conclusion: We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1622759 | PMC |
http://dx.doi.org/10.1186/1471-2105-7-449 | DOI Listing |
Int J Cancer
January 2025
Inequalities in Cancer Outcomes Network (ICON) group, Department of Health Services Research and Policy, Faculty of Public Health and Policy, London School of Hygiene & Tropical Medicine, London, UK.
We aimed to investigate socio-economic inequalities in second primary cancer (SPC) incidence among breast cancer survivors. Using Data from cancer registries in England, we included all women diagnosed with a first primary breast cancer (PBC) between 2000 and 2018 and aged between 18 and 99 years and followed them up from 6 months after the PBC diagnosis until a SPC event, death, or right censoring, whichever came first. We used flexible parametric survival models adjusting for age and year of PBC diagnosis, ethnicity, PBC tumour stage, comorbidity, and PBC treatments to model the cause-specific hazards of SPC incidence and death according to income deprivation, and then estimated standardised cumulative incidences of SPC by deprivation, taking death as the competing event.
View Article and Find Full Text PDFSensors (Basel)
January 2025
Department of Biomedical and Robotics Engineering, Incheon National University, Incheon 22012, Republic of Korea.
With the rise of modern healthcare monitoring, heart rate (HR) estimation using remote photoplethysmography (rPPG) has gained attention for its non-contact, continuous tracking capabilities. However, most HR estimation methods rely on stable, fixed sampling intervals, while practical image capture often involves irregular frame rates and missing data, leading to inaccuracies in HR measurements. This study addresses these issues by introducing low-complexity timing correction methods, including linear, cubic, and filter interpolation, to improve HR estimation from rPPG signals under conditions of irregular sampling and data loss.
View Article and Find Full Text PDFSensors (Basel)
January 2025
School of Mathematics and Information Science, Guangxi University, Nanning 530004, China.
In this paper, a novel particle filter based on one-step smoothing is proposed for nonlinear systems with random one-step delay and missing measurements. Such problems are commonly encountered in networked control systems, where random one-step delay and missing measurements significantly increase the difficulty of dynamic state estimation. The particle filter is a nonlinear filtering method based on sequential Monte Carlo sampling.
View Article and Find Full Text PDFCancers (Basel)
January 2025
Catalan Cancer Plan, Department of Health, L'Hospitalet de Llobregat, 08908 Barcelona, Spain.
Purpose: The aim of this study was to compare estimates of adherence to oral endocrine therapy (OET) based on real-world data (RWD) and on clinical evaluation in people diagnosed with breast cancer in the public healthcare system in Catalonia (Spain).
Methods: We conducted two retrospective cohort studies. Cohort 1 (RWD) consisted of women diagnosed with breast cancer in 2021 in the public healthcare system of Catalonia (Spain).
Lancet Child Adolesc Health
February 2025
Developmental Biology and Cancer Research & Teaching Department, UCL Great Ormond Street Institute of Child Health, University College London, London, UK. Electronic address:
Background: International variation in childhood cancer survival might be explained by differences in stage at diagnosis, among other factors. As part of the BENCHISTA project, we aimed to assess geographical variation in tumour stage at diagnosis through the application, by population-based cancer registries working with clinicians, of the international consensus Toronto Childhood Cancer Stage Guidelines.
Methods: This population-based, retrospective cohort study involved 67 cancer registries from 23 European countries, Australia, Brazil, Japan, and Canada.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!