Background: About 90% of patients who have diabetes suffer from Type 2 DM (T2DM). Many studies suggest using the significant role of lncRNAs to improve the diagnosis of T2DM. Machine learning and Data Mining techniques are tools that can improve the analysis and interpretation or extraction of knowledge from the data. These techniques may enhance the prognosis and diagnosis associated with reducing diseases such as T2DM. We applied four classification models, including K-nearest neighbor (KNN), support vector machine (SVM), logistic regression, and artificial neural networks (ANN) for diagnosing T2DM, and we compared the diagnostic power of these algorithms with each other. We performed the algorithms on six LncRNA variables (LINC00523, LINC00995, HCG27_201, TPT1-AS1, LY86-AS1, DKFZP) and demographic data.

Results: To select the best performance, we considered the AUC, sensitivity, specificity, plotted the ROC curve, and showed the average curve and range. The mean AUC for the KNN algorithm was 91% with 0.09 standard deviation (SD); the mean sensitivity and specificity were 96 and 85%, respectively. After applying the SVM algorithm, the mean AUC obtained 95% after stratified 10-fold cross-validation, and the SD obtained 0.05. The mean sensitivity and specificity were 95 and 86%, respectively. The mean AUC for ANN and the SD were 93% and 0.03, also the mean sensitivity and specificity were 78 and 85%. At last, for the logistic regression algorithm, our results showed 95% of mean AUC, and the SD of 0.05, the mean sensitivity and specificity were 92 and 85%, respectively. According to the ROCs, the Logistic Regression and SVM had a better area under the curve compared to the others.

Conclusion: We aimed to find the best data mining approach for the prediction of T2DM using six lncRNA expression. According to the finding, the maximum AUC dedicated to SVM and logistic regression, among others, KNN and ANN also had the high mean AUC and small standard deviations of AUC scores among the approaches, KNN had the highest mean sensitivity and the highest specificity belonged to SVM. This study's result could improve our knowledge about the early detection and diagnosis of T2DM using the lncRNAs as biomarkers.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7451240PMC
http://dx.doi.org/10.1186/s12859-020-03719-8DOI Listing

Publication Analysis

Top Keywords

sensitivity specificity
20
data mining
16
logistic regression
16
specificity 85%
12
diagnosis t2dm
8
svm logistic
8
auc
8
005 sensitivity
8
t2dm
6
sensitivity
6

Similar Publications

Background: Reduced well-being and depressive episodes frequently complicate pregnancy and can result in serious adverse outcomes for both mother and infant if left untreated. This study aimed to assess the psychometric validity of the 5-item World Health Organization index (WHO-5), and to evaluate if the WHO-5 index can serve as a proxy for two items of core depressive symptoms from the Major Depression Inventory (MDI), identified as MDI-2. Additionally, the paper aimed to assess well-being and detect risk factors of reduced well-being using the WHO-5 index.

View Article and Find Full Text PDF

Metabolomic in severe traumatic brain injury: exploring primary, secondary injuries, diagnosis, and severity.

Crit Care

January 2025

Department of Critical Care Medicine, Cumming School of Medicine, Health Research Innovation Center (HRIC), University of Calgary, Room 4C64, 3280 Hospital Drive N.W., Calgary, AB, T2N 4Z6, Canada.

Background: Traumatic brain injury (TBI) is a major public health concern worldwide, contributing to high rates of injury-related death and disability. Severe traumatic brain injury (sTBI), although it accounts for only 10% of all TBI cases, results in a mortality rate of 30-40% and a significant burden of disability in those that survive. This study explored the potential of metabolomics in the diagnosis of sTBI and explored the potential of metabolomics to examine probable primary and secondary brain injury in sTBI.

View Article and Find Full Text PDF

Background: Chronic obstructive pulmonary disease (COPD) and asthma are the two most prevalent chronic respiratory diseases, significantly impacting public health. Utilizing clinical questionnaires to identify and differentiate patients with COPD and asthma for further diagnostic procedures has emerged as an effective strategy to address this issue. We developed a new diagnostic tool, the COPD-Asthma Differentiation Questionnaire (CAD-Q), to differentiate between COPD and asthma in adults.

View Article and Find Full Text PDF

Background: To assess the value of combined Monocyte Distribution Width (MDW) and Procalcitonin (PCT) detection in diagnosing and predicting neonatal sepsis outcomes.

Methods: This retrospective study, conducted from January 2022 to December 2023.A retrospective analysis of 39 neonatal sepsis and 30 non-infectious systemic inflammatory response syndrome (SIRS) cases was conducted.

View Article and Find Full Text PDF

Background: Numerous noncontrast computed tomography (NCCT) markers have been reported and validated as effective predictors of hematoma expansion (HE). Our objective was to develop and validate a score based on NCCT markers and clinical characteristics to predict risk of HE in acute intracerebral hemorrhage (ICH) patients.

Methods: We prospectively collected spontaneous ICH patients at the First Affiliated Hospital of Chongqing Medical University to form the development cohort (n = 395) and at the Third Affiliated Hospital of Chongqing Medical University to establish the validation cohort (n = 139).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!