AI Article Synopsis

  • The review assesses the use of machine learning models for diagnostic purposes using text data, emphasizing the importance of diverse study populations in medical informatics.
  • Out of 2,260 papers reviewed, 78 were included; the most common model used was neural networks, and the majority of studies were conducted on predominantly White patient populations.
  • The discussion highlights the need for comprehensive demographic data to avoid potential biases in machine learning algorithms as the reliance on these technologies in clinical settings increases.

Article Abstract

Objective: As the storage of clinical data has transitioned into electronic formats, medical informatics has become increasingly relevant in providing diagnostic aid. The purpose of this review is to evaluate machine learning models that use text data for diagnosis and to assess the diversity of the included study populations.

Methods: We conducted a systematic literature review on three public databases. Two authors reviewed every abstract for inclusion. Articles were included if they used or developed machine learning algorithms to aid in diagnosis. Articles focusing on imaging informatics were excluded.

Results: From 2,260 identified papers, we included 78. Of the machine learning models used, neural networks were relied upon most frequently (44.9%). Studies had a median population of 661.5 patients, and diseases and disorders of 10 different body systems were studied. Of the 35.9% ( = 28) of papers that included race data, 57.1% ( = 16) of study populations were majority White, 14.3% were majority Asian, and 7.1% were majority Black. In 75% ( = 21) of papers, White was the largest racial group represented. Of the papers included, 43.6% ( = 34) included the sex ratio of the patient population.

Discussion: With the power to build robust algorithms supported by massive quantities of clinical data, machine learning is shaping the future of diagnostics. Limitations of the underlying data create potential biases, especially if patient demographics are unknown or not included in the training.

Conclusion: As the movement toward clinical reliance on machine learning accelerates, both recording demographic information and using diverse training sets should be emphasized. Extrapolating algorithms to demographics beyond the original study population leaves large gaps for potential biases.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9132735PMC
http://dx.doi.org/10.1055/s-0042-1749119DOI Listing

Publication Analysis

Top Keywords

machine learning
24
papers included
12
clinical data
8
learning models
8
potential biases
8
included
7
learning
6
data
5
machine
5
diversity machine
4

Similar Publications

Triple-negative breast cancer (TNBC) remains a significant global health challenge, emphasizing the need for precise identification of patients with specific therapeutic targets and those at high risk of metastasis. This study aimed to identify novel therapeutic targets for personalized treatment of TNBC patients by elucidating their roles in cell cycle regulation. Using weighted gene co-expression network analysis (WGCNA), we identified 83 hub genes by integrating gene expression profiles with clinical pathological grades.

View Article and Find Full Text PDF

The rising incidence of pancreatic diseases, including acute and chronic pancreatitis and various pancreatic neoplasms, poses a significant global health challenge. Pancreatic ductal adenocarcinoma (PDAC) for example, has a high mortality rate due to late-stage diagnosis and its inaccessible location. Advances in imaging technologies, though improving diagnostic capabilities, still necessitate biopsy confirmation.

View Article and Find Full Text PDF

Proteins' flexibility is a feature in communicating changes in cell signaling instigated by binding with secondary messengers, such as calcium ions, associated with the coordination of muscle contraction, neurotransmitter release, and gene expression. When binding with the disordered parts of a protein, calcium ions must balance their charge states with the shape of calcium-binding proteins and their versatile pool of partners depending on the circumstances they transmit. Accurately determining the ionic charges of those ions is essential for understanding their role in such processes.

View Article and Find Full Text PDF

Background: Alzheimer's disease (AD) is a progressive neurodegenerative disorder affecting millions worldwide, leading to cognitive and functional decline. Early detection and intervention are crucial for enhancing the quality of life of patients and their families. Remote Monitoring Technologies (RMTs) offer a promising solution for early detection by tracking changes in behavioral and cognitive functions, such as memory, language, and problem-solving skills.

View Article and Find Full Text PDF

Purpose: To develop and validate a prostate-specific membrane antigen (PSMA) PET/CT based multimodal deep learning model for predicting pathological lymph node invasion (LNI) in prostate cancer (PCa) patients identified as candidates for extended pelvic lymph node dissection (ePLND) by preoperative nomograms.

Methods: [Ga]Ga-PSMA-617 PET/CT scan of 116 eligible PCa patients (82 in the training cohort and 34 in the test cohort) who underwent radical prostatectomy with ePLND were analyzed in our study. The Med3D deep learning network was utilized to extract discriminative features from the entire prostate volume of interest on the PET/CT images.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!