Publications by authors named "Yilin Ning"

Background: Traditional clustering techniques are typically restricted to either continuous or categorical variables. However, most real-world clinical data are mixed type. This study aims to introduce a clustering technique specifically designed for datasets containing both continuous and categorical variables to offer better clustering compatibility, adaptability, and interpretability than other mixed type techniques.

View Article and Find Full Text PDF
Article Synopsis
  • - Federated learning (FL) is a method that enhances data privacy in healthcare collaborations, with roots in both engineering and statistics, and a need for better recognition of statistical privacy-preserving algorithms.
  • - The study compared seven FL frameworks from both domains, evaluating their performance using logistic regression and Lasso modeling on simulated and real-world data, revealing statistical FL algorithms yield less biased estimates while engineering methods can offer better predictions.
  • - The research highlights strengths and weaknesses of both FL methods, suggesting their selection based on specific study needs, and calls for increased awareness and integration of these techniques in future healthcare applications.
View Article and Find Full Text PDF

The escalating integration of machine learning in high-stakes fields such as healthcare raises substantial concerns about model fairness. We propose an interpretable framework, fairness-aware interpretable modeling (FAIM), to improve model fairness without compromising performance, featuring an interactive interface to identify a "fairer" model from a set of high-performing models and promoting the integration of data-driven evidence and clinical expertise to enhance contextualized fairness. We demonstrate FAIM's value in reducing intersectional biases arising from race and sex by predicting hospital admission with two real-world databases, the Medical Information Mart for Intensive Care IV Emergency Department (MIMIC-IV-ED) and the database collected from Singapore General Hospital Emergency Department (SGH-ED).

View Article and Find Full Text PDF
Article Synopsis
  • Cognitive biases in clinical decision-making lead to misdiagnoses and hurt patient outcomes, making it essential to find solutions in the medical field.
  • The study explores how large language models (like GPT-4) can help reduce these biases through a multi-agent framework that simulates clinical team interactions to enhance diagnostic accuracy.
  • Results showed that while initial diagnoses were completely inaccurate, using the multi-agent framework improved the accuracy of final differential diagnoses to 76%, indicating its potential as a valuable tool in medical decision-making.
View Article and Find Full Text PDF

Artificial intelligence (AI) has been extensively researched in medicine, but its practical application remains limited. Meanwhile, there are various disparities in existing AI-enabled clinical studies, which pose a challenge to global health equity. In this study, we conducted an in-depth analysis of the geo-economic distribution of 159 AI-enabled clinical studies, as well as the gender disparities among these studies.

View Article and Find Full Text PDF

Machine learning (ML) methods are increasingly used to assess variable importance, but such black box models lack stability when limited in sample sizes, and do not formally indicate non-important factors. The Shapley variable importance cloud (ShapleyVIC) addresses these limitations by assessing variable importance from an ensemble of regression models, which enhances robustness while maintaining interpretability, and estimates uncertainty of overall importance to formally test its significance. In a clinical study, ShapleyVIC reasonably identified important variables when the random forest and XGBoost failed to, and generally reproduced the findings from smaller subsamples (n = 2500 and 500) when statistical power of the logistic regression became attenuated.

View Article and Find Full Text PDF

Importance: Clinical trial results of topical atropine eye drops for childhood myopia control have shown inconsistent outcomes across short-term studies, with little long-term safety or other outcomes reported.

Objective: To report the long-term safety and outcomes of topical atropine for childhood myopia control.

Design, Setting, And Participants: This prospective, double-masked observational study of the Atropine for the Treatment of Myopia (ATOM) 1 and ATOM2 randomized clinical trials took place at 2 single centers and included adults reviewed in 2021 through 2022 from the ATOM1 study (atropine 1% vs placebo; 1999 through 2003) and the ATOM2 study (atropine 0.

View Article and Find Full Text PDF

Artificial intelligence (AI) and machine learning (ML) have potential to revolutionize emergency medical care by enhancing triage systems, improving diagnostic accuracy, refining prognostication, and optimizing various aspects of clinical care. However, as clinicians often lack AI expertise, they might perceive AI as a "black box," leading to trust issues. To address this, "explainable AI," which teaches AI functionalities to end-users, is important.

View Article and Find Full Text PDF

Artificial intelligence (AI) has demonstrated the ability to extract insights from data, but the fairness of such data-driven insights remains a concern in high-stakes fields. Despite extensive developments, issues of AI fairness in clinical contexts have not been adequately addressed. A fair model is normally expected to perform equally across subgroups defined by sensitive variables (e.

View Article and Find Full Text PDF

Objective: We propose FedScore, a privacy-preserving federated learning framework for scoring system generation across multiple sites to facilitate cross-institutional collaborations.

Materials And Methods: The FedScore framework includes five modules: federated variable ranking, federated variable transformation, federated score derivation, federated model selection and federated model evaluation. To illustrate usage and assess FedScore's performance, we built a hypothetical global scoring system for mortality prediction within 30 days after a visit to an emergency department using 10 simulated sites divided from a tertiary hospital in Singapore.

View Article and Find Full Text PDF

Objectives: Federated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical practice. This review examines FL applications on structured medical data, identifies contemporary limitations, and discusses potential innovations.

View Article and Find Full Text PDF

Objective: The proper handling of missing values is critical to delivering reliable estimates and decisions, especially in high-stakes fields such as clinical research. In response to the increasing diversity and complexity of data, many researchers have developed deep learning (DL)-based imputation techniques. We conducted a systematic review to evaluate the use of these techniques, with a particular focus on the types of data, intending to assist healthcare researchers from various disciplines in dealing with missing data.

View Article and Find Full Text PDF
Article Synopsis
  • The AutoScore framework helps create clinical scores using data so doctors can make better decisions.
  • The guide shows how to set up the AutoScore package and explains important steps like processing data and choosing the right information.
  • It also covers how to improve and test these scores, making sure they are clear and easy to understand for everyone.
View Article and Find Full Text PDF
Article Synopsis
  • This study investigates various factors linked to mild cognitive impairment (MCI) in Parkinson's disease, using a method called Shapley variable importance cloud (ShapleyVIC) to analyze their significance collectively.
  • Out of 41 examined variables, 22 were found to be important, and 8 were selected as key risk factors for PD-MCI, including education level, hypertension history, motor scores, triglyceride levels, and a specific genetic marker (SNCA rs6826785).
  • The findings suggest that managing triglyceride and apolipoprotein A1 levels may help prevent cognitive impairment in early Parkinson’s patients, while the genetic marker could serve as a potential target for new treatments.
View Article and Find Full Text PDF

Risk scores are widely used for clinical decision making and commonly generated from logistic regression models. Machine-learning-based methods may work well for identifying important predictors to create parsimonious scores, but such 'black box' variable selection limits interpretability, and variable importance evaluated from a single model can be biased. We propose a robust and interpretable variable selection approach using the recently developed Shapley variable importance cloud (ShapleyVIC) that accounts for variability in variable importance across models.

View Article and Find Full Text PDF

Background: Risk prediction models are useful tools in clinical decision-making which help with risk stratification and resource allocations and may lead to a better health care for patients. AutoScore is a machine learning-based automatic clinical score generator for binary outcomes. This study aims to expand the AutoScore framework to provide a tool for interpretable risk prediction for ordinal outcomes.

View Article and Find Full Text PDF

Background: Despite the ease of interpretation and communication of a risk ratio (RR), and several other advantages in specific settings, the odds ratio (OR) is more commonly reported in epidemiological and clinical research. This is due to the familiarity of the logistic regression model for estimating adjusted ORs from data gathered in a cross-sectional, cohort or case-control design. The preservation of the OR (but not RR) in case-control samples has contributed to the perception that it is the only valid measure of relative risk from case-control samples.

View Article and Find Full Text PDF

Background: Survival with favorable neurological outcomes is an important indicator of successful resuscitation in out-of-hospital cardiac arrest (OHCA). We sought to validate the CaRdiac Arrest Survival Score (CRASS), derived using data from the German Resuscitation Registry, in predicting the likelihood of good neurological outcomes after OHCA in Singapore.

Methods: We conducted a retrospective population-based validation study among EMS-attended OHCA patients (≥18 years) in Singapore, using data from the prospective Pan-Asian Resuscitation Outcomes Study registry.

View Article and Find Full Text PDF

Interpretable machine learning has been focusing on explaining final models that optimize performance. The state-of-the-art Shapley additive explanations (SHAP) locally explains the variable impact on individual predictions and has recently been extended to provide global assessments across the dataset. Our work further extends "global" assessments to a set of models that are "good enough" and are practically as relevant as the final model to a prediction task.

View Article and Find Full Text PDF

Background: Medical decision-making impacts both individual and public health. Clinical scores are commonly used among various decision-making models to determine the degree of disease deterioration at the bedside. AutoScore was proposed as a useful clinical score generator based on machine learning and a generalized linear model.

View Article and Find Full Text PDF

Background: Emergency readmission poses an additional burden on both patients and healthcare systems. Risk stratification is the first step of transitional care interventions targeted at reducing readmission. To accurately predict the short- and intermediate-term risks of readmission and provide information for further temporal risk stratification, we developed and validated an interpretable machine learning risk scoring system.

View Article and Find Full Text PDF

Background: Bystander cardiopulmonary resuscitation (BCPR) is a critical component of the 'chain of survival' in reducing mortality among out-of-hospital cardiac arrest (OHCA) victims. Inconsistent findings on gender disparities among adult recipients of layperson BCPR have been reported in the literature. We aimed to fill this knowledge gap by investigating the extent of gender disparities in a cross-national setting within Pan-Asian communities.

View Article and Find Full Text PDF