Identifying β-thalassemia carriers using a data mining approach: The case of the Gaza Strip, Palestine.

Artif Intell Med

King Abdullah II School for Information Technology, The University of Jordan, Amman, Jordan. Electronic address:

Published: June 2018

Thalassemia is considered one of the most common genetic blood disorders that has received excessive attention in the medical research fields worldwide. Under this context, one of the greatest challenges for healthcare professionals is to correctly differentiate normal individuals from asymptomatic thalassemia carriers. Usually, thalassemia diagnosis is based on certain measurable characteristic changes to blood cell counts and related indices. These characteristic changes can be derived easily when performing a complete blood count test (CBC) using a special fully automated blood analyzer or counter. However, the reliability of the CBC test alone is questionable with possible candidate characteristics that could be seen in other disorders, leading to misdiagnosis of thalassemia. Therefore, other costly and time-consuming tests should be performed that may cause serious consequences due to the delay in the correct diagnosis. To help overcoming these challenging diagnostic issues, this work presents a new novel dataset collected from Palestine Avenir Foundation for persons tested for thalassemia. We aim to compile a gold standard dataset for thalassemia and make it available for researchers in this field. Moreover, we use this dataset to predict the specific type of thalassemia known as beta thalassemia (β-thalassemia) based on hybrid data mining model. The proposed model consists of two main steps. First, to overcome the problem of the highly imbalanced class distribution in the dataset, a balancing technique called SMOTE is proposed and applied to handle this problem. In the second step, four classification models, namely k-nearest neighbors (k-NN), naïve Bayesian (NB), decision tree (DT) and the multilayer perceptron (MLP) neural network are used to differentiate between normal persons and those patients carrying β-thalassemia. Different evaluation metrics are used to assess the performance of the proposed model. The experimental results show that the SMOTE oversampling method can effectively improve the identification ratio of β-thalassemia carriers in a highly imbalanced class distribution. The results reveal also that the NB classifier achieved the best performance in differentiating between normal and β-thalassemia carriers at oversampling SMOTE ratio of 400%. This combination shows a specificity of 99.47% and a sensitivity of 98.81%.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.artmed.2018.04.009DOI Listing

Publication Analysis

Top Keywords

β-thalassemia carriers
12
data mining
8
thalassemia
8
differentiate normal
8
characteristic changes
8
proposed model
8
highly imbalanced
8
imbalanced class
8
class distribution
8
identifying β-thalassemia
4

Similar Publications

Importance: Disease characteristics of genetically mediated coronary artery disease (CAD) on coronary angiography and the association of genomic risk with outcomes after coronary angiography are not well understood.

Objective: To assess the angiographic characteristics and risk of post-coronary angiography outcomes of patients with genomic drivers of CAD: familial hypercholesterolemia (FH), high polygenic risk score (PRS), and clonal hematopoiesis of indeterminate potential (CHIP).

Design, Setting, And Participants: A retrospective cohort study of 3518 Mass General Brigham Biobank participants with genomic information who underwent coronary angiography was conducted between July 18, 2000, and August 1, 2023.

View Article and Find Full Text PDF

Silymarin: a promising modulator of apoptosis and survival signaling in cancer.

Discov Oncol

January 2025

Department of Biotechnology, School of Bio Sciences and Technology (SBST), Vellore Institute of Technology (VIT), Vellore, 632014, India.

Cancer, one of the deadliest diseases, has remained the epicenter of biological research for more than seven decades. Yet all the efforts for a perfect therapeutic cure come with certain limitations. The use of medicinal plants and their phytochemicals as therapeutics has received much attention in recent years.

View Article and Find Full Text PDF

γ-Glutamylcysteine (γ-EC) can increase intracellular glutathione (GSH) levels, which may prevent and alleviate age-related disorders and chronic diseases caused by oxidative damage. However, the commercial availability of γ-EC remains limited owing to its complex chemical synthesis from glutamate and cysteine. In this study, we have developed the method of the effective conversion of GSH to γ-EC to achieve the optimal reaction conditions for repeated batch production and potential application in industrial γ-EC production using the phytochelatin synthase-like enzyme NsPCS.

View Article and Find Full Text PDF

Extracellular vesicles (EVs) have been demonstrated to own the advantages in evading phagocytosis, crossing biological barriers, and possessing excellent biocompatibility and intrinsic stability. Based on these characteristics, EVs have been used as effective therapeutic carriers for drug delivery, but the low drug loading capacity greatly limits further applications. Herein, we developed a drug loading method based on cell-penetrating peptide (CPP) to enhance the encapsulation of therapeutic reagents in EVs, and EVs-based drug delivery system achieved higher killing efficacy to tumor cells.

View Article and Find Full Text PDF

Pinch-off dynamics of emulsion filaments before and after polymerization of the internal phase.

Soft Matter

January 2025

Department of Mechanical and Aerospace Engineering, Princeton University, Princeton, NJ 08544, USA.

The capillary break-up of complex fluid filaments occurs in many scientific and industrial applications, particularly in bio-printing where both liquid and polymerized droplets exist in the fluid. The simultaneous presence of fluid and solid particles within a carrier fluid and their interactions lead to deviations in the filament break-up from the well-established capillary breakup dynamics of single-phase liquids. To examine the significance of the dispersed phase and the internal interactions between liquid droplets and solid particles, we prepare emulsions through photopolymerization and conduct experimental investigations into the pinch-off dynamics of fluid filaments, focusing on the impact of varying concentrations of liquid droplets (before polymerization) and polymerized droplets.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!