Background: Untargeted mass spectrometry (MS)-based metabolomics data often contain missing values that reduce statistical power and can introduce bias in biomedical studies. However, a systematic assessment of the various sources of missing values and strategies to handle these data has received little attention. Missing data can occur systematically, e.g. from run day-dependent effects due to limits of detection (LOD); or it can be random as, for instance, a consequence of sample preparation.

Methods: We investigated patterns of missing data in an MS-based metabolomics experiment of serum samples from the German KORA F4 cohort (n = 1750). We then evaluated 31 imputation methods in a simulation framework and biologically validated the results by applying all imputation approaches to real metabolomics data. We examined the ability of each method to reconstruct biochemical pathways from data-driven correlation networks, and the ability of the method to increase statistical power while preserving the strength of established metabolic quantitative trait loci.

Results: Run day-dependent LOD-based missing data accounts for most missing values in the metabolomics dataset. Although multiple imputation by chained equations performed well in many scenarios, it is computationally and statistically challenging. K-nearest neighbors (KNN) imputation on observations with variable pre-selection showed robust performance across all evaluation schemes and is computationally more tractable.

Conclusion: Missing data in untargeted MS-based metabolomics data occur for various reasons. Based on our results, we recommend that KNN-based imputation is performed on observations with variable pre-selection since it showed robust results in all evaluation schemes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6153696PMC
http://dx.doi.org/10.1007/s11306-018-1420-2DOI Listing

Publication Analysis

Top Keywords

missing data
20
missing values
16
ms-based metabolomics
16
metabolomics data
16
data
10
untargeted ms-based
8
missing
8
statistical power
8
data occur
8
ability method
8

Similar Publications

Review of Statistical Considerations and Data Imputation Methodologies in Psoriasis Clinical Trials.

J Clin Aesthet Dermatol

January 2024

Mr. Davidson is with Fallon Medica in Tinton Falls, New Jersey, and was an employee of Bristol Myers Squibb at the time of manuscript development.

Numerous clinical trials have established that various biologic and oral small-molecule therapies are efficacious in patients with psoriasis. However, as there are limited head-to-head trials, healthcare providers may compare results across multiple trials when providing treatment recommendations. Direct comparisons among agents are challenging because psoriasis trials differ in terms of study design, patient population, and data analysis methodologies.

View Article and Find Full Text PDF

Background: Patients who "no-show" (NS) clinical appointments are at a higher risk of poor healthcare outcomes. The objective of this study was to evaluate and characterize the relationship between patient NS prior to primary total hip arthroplasty (THA) and 90-day complication risk after THA.

Methods: We retrospectively reviewed 4147 patients undergoing primary THA.

View Article and Find Full Text PDF

Energy consumption prediction using modified deep CNN-Bi LSTM with attention mechanism.

Heliyon

January 2025

Department of Software Engineering, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, Saudi Arabia.

The prediction of energy consumption in households is essential due to the reliance on electrical appliances for daily activities. Accurate assessment of energy demand is crucial for effective energy generation, preventing overloads and optimizing energy storage. Traditional techniques have limitations in accuracy and error rates, necessitating advancements in prediction techniques.

View Article and Find Full Text PDF

Adult vaccinations against respiratory infections.

Expert Rev Anti Infect Ther

January 2025

Ciber de Enfermedades Respiratorias (Ciberes) Barcelona, Spain.

Introduction: Lower respiratory infections have a huge impact on global health, especially in older individuals, immunocompromised people, and those with chronic comorbidities. The COVID-19 pandemic highlights the importance of vaccination. However, there are lower rates of vaccination in the adult population that are commonly due to a missed opportunity to vaccinate.

View Article and Find Full Text PDF

"We're all in the same storm, but not all of us are in the same boat": qualitative exploration of UK response-focused civil servants experiences of working from home during COVID-19.

BMC Public Health

January 2025

Behavioural Science and Insights Unit, Evaluation & Translation Directorate, Science Group, UK Health Security Agency, Porton Down, Salisbury, UK.

Introduction: The experiences of UK Government response-focused employees, who were considered frontline workers during the coronavirus response, are missing from current literature. Meeting the demands of being on the frontline, whilst also adjusting from a normal and practiced way of working to having to work from within one's home, may bring a plethora of new barriers and facilitators associated with providing an effective pandemic response.

Method: This interview study collected and analysed data from 30 UK Civil servants who worked on the COVID-19 pandemic response from their own homes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!