Int J Environ Res Public Health
May 2022
The early prediction and identification of risk factors for diabetes may prevent or delay diabetes progression. In this study, we developed an interactive online application that provides the predictive probabilities of prediabetes and diabetes in 4 years based on a Bayesian network (BN) classifier, which is an interpretable machine learning technique. The BN was trained using a dataset from the Ansung cohort of the Korean Genome and Epidemiological Study (KoGES) in 2008, with a follow-up in 2012.
View Article and Find Full Text PDFSeoul, the capital city of Korea with over 10 million residents, has been experiencing serious air pollution problems. Previous studies on source apportionment of PM2.5 in Seoul are based on measurements of chemical compositions of PM2.
View Article and Find Full Text PDFExposure to environment-polluting chemicals (EPC) is associated with the development of diabetes. Many EPCs exert toxic effects via aryl hydrocarbon receptor (AhR) and/or mitochondrial inhibition. Here we investigated if the levels of human exposure to a mixture of EPC and/or mitochondrial inhibitors could predict the development of diabetes in a prospective study, the Korean Genome and Epidemiological Study (KoGES).
View Article and Find Full Text PDFThere has been growing interest in jointly modeling correlated multivariate crash counts in road safety research over the past decade. To assess the effects of roadway characteristics or environmental factors on crash counts by severity level or by collision type, various models including multivariate Poisson regression models, multivariate negative binomial regression models, and multivariate Poisson-Lognormal regression models have been suggested. We introduce more general copula-based multivariate count regression models with correlated random effects within a Bayesian framework.
View Article and Find Full Text PDFThere has been increasing interest in assessing health effects associated with multiple air pollutants emitted by specific sources. A major difficulty with achieving this goal is that the pollution source profiles are unknown and source-specific exposures cannot be measured directly; rather, they need to be estimated by decomposing ambient measurements of multiple air pollutants. This estimation process, called multivariate receptor modeling, is challenging because of the unknown number of sources and unknown identifiability conditions (model uncertainty).
View Article and Find Full Text PDFA Bayesian model-based clustering approach is proposed for identifying differentially expressed genes in meta-analysis. A Bayesian hierarchical model is used as a scientific tool for combining information from different studies, and a mixture prior is used to separate differentially expressed genes from non-differentially expressed genes. Posterior estimation of the parameters and missing observations are done by using a simple Markov chain Monte Carlo method.
View Article and Find Full Text PDF