In the field of data analysis, it is often faced with a large number of missing values, especially in metabolomics data, this problem is more prominent. Data imputation is a common method to deal with missing metabolomics data, while traditional data imputation methods usually ignore the differences in missing types, and thus the results of data imputation are not satisfactory. In order to discriminate the missing types of metabolomics data, a missing data classification model (PX-MDC) based on particle swarm algorithm and XGBoost is proposed in this paper. First, the missing values in a given missing data set are obtained by panning the missing values to obtain the largest subset of complete data, and then the particle swarm algorithm is used to search for the concentration threshold of missing data and the proportion of low concentration deletions as a percentage of overall deletions. Next, the missing data are simulated based on the search results. Finally, the training data are trained using the XGBoost model using the feature set proposed in this paper in order to build a classifier for the missing data. The experimental results show that the particle swarm algorithm is able to match the traditional enumeration method in terms of accuracy and significantly reduce the search time in concentration threshold search. Compared with the current mainstream methods, the PX-MDC model designed in this paper exhibits higher accuracy and is able to distinguish different deletion types for the same metabolite. This study is expected to make an important breakthrough in metabolomics data imputation and provide strong support for research in related fields.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10762217PMC
http://dx.doi.org/10.1038/s41598-023-50646-8DOI Listing

Publication Analysis

Top Keywords

missing data
24
metabolomics data
20
data
17
particle swarm
16
data imputation
16
missing values
12
swarm algorithm
12
missing
11
types metabolomics
8
based particle
8

Similar Publications

Objectives: Factors affecting denosumab-induced hypocalcemia in male patients with osteoporosis remain unclear because of the small patient population. Nevertheless, it is important to explore male-specific risk factors. This study aimed to identify the factors affecting the development of denosumab-induced hypocalcemia in male patients with osteoporosis and compare them with those in female patients with osteoporosis.

View Article and Find Full Text PDF

Objective: To gain a more holistic understanding of oral health in the past by producing an 'Index of Oro-dental Disease' (IOD), incorporating multiple oro-dental diseases and accounting for differences in antemortem/postmortem alveolar bone and tooth loss.

Materials: UK Adult Dental Health Survey, 2009 anonymised dataset (N = 6206). Archaeological dental data from skeletal individuals from medieval and post-medieval Barton-upon-Humber, North Lincolnshire (N = 214, 1150-1855) and St James's Gardens Burial Ground, London (N = 281, 1789-1853).

View Article and Find Full Text PDF

Aim: Dynamic cancer control is a current health system priority, yet methods for achieving it are lacking. This study aims to review the application of system dynamics modeling (SDM) on cancer control and evaluate the research quality.

Methods: Articles were searched in PubMed, Web of Science, and Scopus from the inception of the study to November 15th, 2023.

View Article and Find Full Text PDF

Background: Psilocybin therapy (PT) produces rapid and persistent antidepressant effects in major depressive disorder (MDD). However, the long-term effects of PT have never been compared with gold-standard treatments for MDD such as pharmacotherapy or psychotherapy alone or in combination.

Methods: This is a 6-month follow-up study of a phase 2, double-blind, randomised, controlled trial involving patients with moderate-to-severe MDD.

View Article and Find Full Text PDF

The early symptoms of hepatocellular carcinoma patients are often subtle and easily overlooked. By the time patients exhibit noticeable symptoms, the disease has typically progressed to middle or late stages, missing optimal treatment opportunities. Therefore, discovering biomarkers is essential for elucidating their functions for the early diagnosis and prevention.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!