Publications by authors named "Hunyong Cho"

Machine learning (ML) has seen impressive growth in health science research due to its capacity for handling complex data to perform a range of tasks, including unsupervised learning, supervised learning, and reinforcement learning. To aid health science researchers in understanding the strengths and limitations of ML and to facilitate its integration into their studies, we present here a guideline for integrating ML into an analysis through a structured framework, covering steps from framing a research question to study design and analysis techniques for specialized data types.

View Article and Find Full Text PDF

Understanding the function of the human microbiome is important but the development of statistical methods specifically for the microbial gene expression (i.e. metatranscriptomics) is in its infancy.

View Article and Find Full Text PDF
Article Synopsis
  • Streptococcus mutans is identified as the main culprit behind childhood tooth decay, but its interactions with other bacteria in dental plaque are not fully understood.
  • Researchers examined biofilms from 416 preschoolers and found 16 types of bacteria linked to caries, highlighting Selenomonas sputigena's role in forming complex structures with S. mutans.
  • S. sputigena cooperates with S. mutans to worsen tooth decay, showing that while it cannot cause decay alone, it significantly enhances the virulence of S. mutans when they coexist.
View Article and Find Full Text PDF
Article Synopsis
  • The text discusses a new reinforcement learning method designed to optimize dynamic treatment plans for survival outcomes while handling dependent censoring.
  • This method allows for flexibility in treatment options and stages, and aims to improve either average survival time or the likelihood of survival at a specific moment.
  • Based on simulations and real data from a health study, the new approach shows better expected results compared to previous methods.
View Article and Find Full Text PDF

The zero-inflated negative binomial distribution has been widely used for count data analyses in various biomedical settings due to its capacity of modeling excess zeros and overdispersion. When there are correlated count variables, a bivariate model is essential for understanding their full distributional features. Examples include measuring correlation of two genes in sparse single-cell RNA sequencing data and modeling dental caries count indices on two different tooth surface types.

View Article and Find Full Text PDF

Integration of multi-omics data is a challenging but necessary step to advance our understanding of the biology underlying human health and disease processes. To date, investigations seeking to integrate multi-omics (e.g.

View Article and Find Full Text PDF

Integration of multi-omics data is a challenging but necessary step to advance our understanding of the biology underlying human health and disease processes. To date, investigations seeking to integrate multi-omics (e.g.

View Article and Find Full Text PDF

Background: Precision medicine is an emerging field that involves the selection of treatments based on patients' individual prognostic data. It is formalized through the identification of individualized treatment rules (ITRs) that maximize a clinical outcome. When the type of outcome is time-to-event, the correct handling of censoring is crucial for estimating reliable optimal ITRs.

View Article and Find Full Text PDF

We propose interval censored recursive forests (ICRF), an iterative tree ensemble method for interval censored survival data. This nonparametric regression estimator addresses the splitting bias problem of existing tree-based methods and iteratively updates survival estimates in a self-consistent manner. Consistent splitting rules are developed for interval censored data, convergence is monitored using out-of-bag samples, and kernel-smoothing is applied.

View Article and Find Full Text PDF

Microbiome data are becoming increasingly available in large health cohorts, yet metabolomics data are still scant. While many studies generate microbiome data, they lack matched metabolomics data or have considerable missing proportions of metabolites. Since metabolomics is key to understanding microbial and general biological activities, the possibility of imputing individual metabolites or inferring metabolomics pathways from microbial taxonomy or metagenomics is intriguing.

View Article and Find Full Text PDF

: This experimental study investigated bacterial microbiome and metabolome longitudinal changes associated with enamel caries lesion progression and arrest. : We induced natural caries activity in three caries-free volunteers prior to four premolar extractions for orthodontic reasons. The experimental model included placement of a modified orthodontic band on smooth surfaces and a mesh on occlusal surfaces.

View Article and Find Full Text PDF

The purpose of the study was to develop and evaluate an automated machine learning algorithm (AutoML) for children's classification according to early childhood caries (ECC) status. Clinical, demographic, behavioral, and parent-reported oral health status information for a sample of 6,404 three- to five-year-old children (mean age equals 54 months) participating in an epidemiologic study of early childhood oral health in North Carolina was used. ECC prevalence (decayed, missing, and filled primary teeth surfaces [dmfs] score greater than zero, using an International Caries Detection and Assessment System score greater than or equal to three caries lesion detection threshold) was 54 percent.

View Article and Find Full Text PDF

Early childhood caries (ECC) is an aggressive form of dental caries occurring in the first five years of life. Despite its prevalence and consequences, little progress has been made in its prevention and even less is known about individuals' susceptibility or genomic risk factors. The genome-wide association study (GWAS) of ECC ("ZOE 2.

View Article and Find Full Text PDF

Receiver operating characteristic curves are widely used as a measure of accuracy of diagnostic tests and can be summarised using the area under the receiver operating characteristic curve (AUC). Often, it is useful to construct a confidence interval for the AUC; however, because there are a number of different proposed methods to measure variance of the AUC, there are thus many different resulting methods for constructing these intervals. In this article, we compare different methods of constructing Wald-type confidence interval in the presence of missing data where the missingness mechanism is ignorable.

View Article and Find Full Text PDF

Early childhood caries (ECC) is a biofilm-mediated disease. Social, environmental, and behavioral determinants as well as innate susceptibility are major influences on its incidence; however, from a pathogenetic standpoint, the disease is defined and driven by oral dysbiosis. In other words, the disease occurs when the natural equilibrium between the host and its oral microbiome shifts toward states that promote demineralization at the biofilm-tooth surface interface.

View Article and Find Full Text PDF