Essential thrombocythemia (ET) is a type of myeloproliferative neoplasm that increases the risk of thrombosis. To diagnose this disease, the analysis of mutations in the Janus Kinase 2 (JAK2), thrombopoietin receptor (MPL), or calreticulin (CALR) gene is recommended. Disease poses diagnostic challenges due to overlapping mutations with other neoplasms and the presence of triple-negative cases. This study explores the potential of Raman spectroscopy combined with machine learning for ET diagnosis. We assessed two laser wavelengths (785, 1064 nm) to differentiate between ET patients and healthy controls. The PCR results indicate that approximately 50% of patients in our group have a mutation in the JAK2 gene, while only 5% of patients harbor a mutation in the ASXL1 gene. Additionally, only one patient had a mutation in the IDH1 and one had a mutation in IDH2 gene. Consequently, patients having no mutations were also observed in our group, making diagnosis challenging. Raman spectra at 1064 nm showed lower amide, polysaccharide, and lipid vibrations in ET patients, while 785 nm spectra indicated significant decreases in amide II and C-H lipid vibrations. Principal Component Analysis (PCA) confirmed that both wavelengths could distinguish ET from healthy subjects. Support Vector Machine (SVM) analysis revealed that the 800-1800 cm range provided the highest diagnostic accuracy, with 89% for 785 nm and 72% for 1064 nm. These findings suggest that FT-Raman spectroscopy, paired with multivariate and machine learning analyses, offers a promising method for diagnosing ET with high accuracy by detecting specific molecular changes in serum. Principal Component Analysis (PCA) confirmed that both wavelengths could distinguish ET from healthy subjects. Support Vector Machine (SVM) analysis revealed that the 800-1800 cm range provided the highest diagnostic accuracy, with 89% for 785 nm and 72% for 1064 nm. These findings suggest that FT-Raman spectroscopy, paired with multivariate and machine learning analyses, offers a promising method for diagnosing ET with high accuracy by detecting specific molecular changes in serum.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s12013-024-01333-6DOI Listing

Publication Analysis

Top Keywords

machine learning
12
raman spectroscopy
8
essential thrombocythemia
8
laser wavelengths
8
lipid vibrations
8
principal component
8
component analysis
8
analysis pca
8
pca confirmed
8
confirmed wavelengths
8

Similar Publications

Estimation of Diagnostic Test Accuracy Without Gold Standards.

Stat Med

February 2025

Department of Biostatistics and Beijing International Center for Mathematical Research, Peking University, Beijing, China.

The ideal evaluation of diagnostic test performance requires a reference test that is free of errors. However, for many diseases, obtaining such a "gold standard" reference is either impossible or prohibitively expensive. Estimating test accuracy in the absence of a gold standard is therefore a significant challenge.

View Article and Find Full Text PDF

The rapid growth of Internet of Things (IoT) devices necessitates efficient data compression techniques to manage the vast amounts of data they generate. Chemiresistive sensor arrays (CSAs), a simple yet essential component in IoT systems, produce large datasets due to their simultaneous multi-sensor operations. Classical principal component analysis (cPCA), a widely used solution for dimensionality reduction, often struggles to preserve critical information in complex datasets.

View Article and Find Full Text PDF

Raven's Coloured Progressive Matrices (CPM) is a widely used assessment tool for measuring general cognitive ability in developmental and educational research, particularly in studies involving young children. However, administering the full set of the 36-item CPM can be burdensome for young participants, hindering its practicality in large-scale studies and reducing research efficiency. In the current study, a short form of the CPM was developed based on a sample of preschoolers (n = 336, mean age = 5.

View Article and Find Full Text PDF

Efficiently extracting data from tables in the scientific literature is pivotal for building large-scale databases. However, the tables reported in materials science papers exist in highly diverse forms; thus, rule-based extractions are an ineffective approach. To overcome this challenge, the study presents MaTableGPT, which is a GPT-based table data extractor from the materials science literature.

View Article and Find Full Text PDF

Background: Long COVID, a heterogeneous condition characterized by a range of physical and neuropsychiatric presentations, can be presented with a proportion of COVID-19-infected individuals.

Methods: Transcriptomic data sets of those within gene expression profiles of COVID-19, long COVID, and healthy controls were retrieved from the GEO database. Differentially expressed genes (DEGs) falling under COVID-19 and long COVID were identified with R packages, and contemporaneously conducted module detection was performed with the Modular Pharmacology Platform (http://112.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!