AI Article Synopsis

  • - The study addresses the issue of sample imbalance in medical datasets, which hinders the accuracy of machine learning models used for clinical diagnosis.
  • - A new hybrid sampling algorithm, combining SMOTE (to oversample minority classes) and ENN (to remove noise from majority classes), is proposed to balance datasets for missed abortion and diabetes.
  • - The results demonstrate that using this SMOTE-ENN method significantly enhances the classification performance of the Random Forest model, achieving high MCC indices of 95.6% for missed abortion and 90.0% for diabetes.

Article Abstract

Background: Clinical diagnosis based on machine learning usually uses case samples as training samples, and uses machine learning to construct disease prediction models characterized by descriptive texts of clinical manifestations. However, the problem of sample imbalance often exists in the medical field, which leads to a decrease in classification performance of the machine learning.

Methods: To solve the problem of sample imbalance in medical dataset, we propose a hybrid sampling algorithm combining synthetic minority over-sampling technique (SMOTE) and edited nearest neighbor (ENN). Firstly, the SMOTE is used to over-sampling missed abortion and diabetes datasets, so that the number of samples of the two classes is balanced. Then, ENN is used to under-sampling the over-sampled dataset to delete the "noisy sample" in the majority. Finally, Random forest is used to model and predict the sampled missed abortion and diabetes datasets to achieve an accurate clinical diagnosis.

Results: Experimental results show that Random forest has the best classification performance on missed abortion and diabetes datasets after SMOTE-ENN sampled, and the MCC index is 95.6% and 90.0%, respectively. In addition, the results of pairwise comparison and multiple comparisons show that the SMOTE-ENN is significantly better than other sampling algorithms.

Conclusion: Random forest has significantly improved all indexes on the missed abortion dataset after SMOTE-ENN sampled.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9801640PMC
http://dx.doi.org/10.1186/s12911-022-02075-2DOI Listing

Publication Analysis

Top Keywords

missed abortion
20
abortion diabetes
12
diabetes datasets
12
random forest
12
hybrid sampling
8
sampling algorithm
8
algorithm combining
8
combining synthetic
8
synthetic minority
8
minority over-sampling
8

Similar Publications

Background: Miscarriage is a common complication of pregnancy, and its underlying pathophysiologic mechanisms remains unclear. The platelet-to-lymphocyte ratio (PLR), a prothrombotic and inflammatory marker, has been controversially discussed as a potential predictor of miscarriage. This systematic review and meta-analysis aimed to assess the predictive significance of the PLR in women with miscarriage compared to healthy pregnancies.

View Article and Find Full Text PDF

Early missed abortion is defined as a pregnancy of ≤ 12 weeks in which there is a cessation of life in the developing embryo or fetus, leading to its retention within the uterine cavity without being spontaneously expelled promptly. This condition is commonly observed and significantly impacts human reproductive health. This study aimed to identify key genes related to ferroptosis that could serve as novel biomarkers for early missed abortion.

View Article and Find Full Text PDF

Case report: A rare but fatal complication of hysteroscopy-air embolism during treatment for missed abortion.

Front Med (Lausanne)

December 2024

Department of Obstetrics and Gynecology, Affiliated Hospital of Southwest Jiaotong University, The Third People's Hospital of Chengdu, Chengdu, China.

Hysteroscopic procedures complicated by air embolism (AE) are exceptionally rare occurrences in clinical practice, and there have been no previously reported cases of AE associated with hysteroscopic dilation and curettage. While the overall incidence of this complication is low, the consequences can be devastating. During early pregnancy, the unique physiological changes, such as elevated hormonal levels and increased uterine blood supply, significantly heighten the risk of AE development.

View Article and Find Full Text PDF

Society of Family Planning Clinical Recommendation: Medication management for early pregnancy loss.

Contraception

December 2024

Planned Parenthood South Atlantic, Raleigh, NC, USA and McLeod Regional Medical Center, Florence, SC, USA; University of Washington Department of Obstetrics and Gynecology, 1959 NE Pacific St, Box 356460, Seattle, WA 98005, USA; Pegasus Health Justice Center, Dallas, TX, 75207, USA; Washington University, St. Louis, MO, USA.

Early pregnancy loss (EPL), also known as miscarriage or spontaneous abortion, makes up 15-20% of all clinically recognized pregnancies. EPL is a broad term that includes intrauterine pregnancies (IUPs) with findings that suggest the pregnancy may not progress or definitely will not progress; pregnancies with a gestational sac (GS) in the lower endometrial cavity or endocervical canal in the process of expulsion; residual pregnancy tissue or persistent GS; and complete passage of the GS without residual tissue. This document addresses medication management of EPL in which the complete passage of the GS has not yet occurred, including pregnancies concerning for and diagnostic of EPL (sometimes called "missed abortion") and EPL in progress.

View Article and Find Full Text PDF

Ectopic pregnancies are uncommon among women presenting for abortion. However, where suspected, rapid referral for definitive diagnosis is essential to prevent harm. We assessed whether implementation of a standardised decision-making tool, an algorithmic representation of the clinical decisions and associated actions defined in policy, in a British abortion service was associated with a reduction in missed opportunities to escalate care where indicated.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!