A typical challenge in air pollution epidemiology is to perform detailed exposure assessment for individuals for which health data are available. To address this problem, in the last few years, substantial research efforts have been placed in developing statistical methods or machine learning techniques to generate estimates of air pollution at fine spatial and temporal scales (daily, usually) with complete coverage. However, it is not clear how much the predicted exposures yielded by the various methods differ, and which method generates more reliable estimates. In this paper, we aim to address this gap by evaluating a variety of exposure modeling approaches, comparing their predictive performance. Using PM in year 2011 over the continental U.S. as a case study, we generate national maps of ambient PM concentration using: (i) ordinary least squares and inverse distance weighting; (ii) kriging; (iii) statistical downscaling models, that is, spatial statistical models that use the information contained in air quality model outputs; (iv) land use regression, that is, linear regression modeling approaches that leverage the information in Geographical Information System (GIS) covariates; and (v) machine learning methods, such as neural networks, random forests and support vector regression. We examine the various methods' predictive performance via cross-validation using Root Mean Squared Error, Mean Absolute Deviation, Pearson correlation, and Mean Spatial Pearson Correlation. Additionally, we evaluated whether factors such as, season, urbanicty, and levels of PM concentration (low, medium or high) affected the performance of the different methods. Overall, statistical methods that explicitly modeled the spatial correlation, e.g. universal kriging and the downscaler model, outperform all the other exposure assessment approaches regardless of season, urbanicity and PM concentration level. We posit that the better predictive performance of spatial statistical models over machine learning methods is due to the fact that they explicitly account for spatial dependence, thus borrowing information from neighboring observations. In light of our findings, we suggest that future exposure assessment methods for regional PM2.5 incorporate information from neighboring sites when deriving predictions at unsampled locations or attempt to account for spatial dependence.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7451200PMC
http://dx.doi.org/10.1016/j.atmosenv.2019.117130DOI Listing

Publication Analysis

Top Keywords

machine learning
16
learning methods
12
exposure assessment
12
predictive performance
12
methods
8
maps ambient
8
ambient concentration
8
air pollution
8
statistical methods
8
modeling approaches
8

Similar Publications

BMT: A Cross-Validated ThinPrep Pap Cervical Cytology Dataset for Machine Learning Model Training and Validation.

Sci Data

December 2024

Department of Pathology and Laboratory Medicine, Alpert Medical School, Brown University, Providence, RI, 02912, USA.

In the past several years, a few cervical Pap smear datasets have been published for use in clinical training. However, most publicly available datasets consist of pre-segmented single cell images, contain on-image annotations that must be manually edited out, or are prepared using the conventional Pap smear method. Multicellular liquid Pap image datasets are a more accurate reflection of current cervical screening techniques.

View Article and Find Full Text PDF

Background: High triglyceride (TG) affects and is affected of other hematological factors. The determination of serum fasted triglycerides concentrations, as part of a lipid profile, is crucial key point in hematological factors and significantly affect various systemic diseases. This study was carried out to assess the potential relation between the concentration of TG and hematological factors.

View Article and Find Full Text PDF

Generative Artificial Intelligence (AI), characterized by its ability to generate diverse forms of content including text, images, video and audio, has revolutionized many fields, including medical education. Generative AI leverages machine learning to create diverse content, enabling personalized learning, enhancing resource accessibility, and facilitating interactive case studies. This narrative review explores the integration of generative artificial intelligence (AI) into orthopedic education and training, highlighting its potential, current challenges, and future trajectory.

View Article and Find Full Text PDF

Bias in machine learning applications to address non-communicable diseases at a population-level: a scoping review.

BMC Public Health

December 2024

Upstream Lab, MAP Centre for Urban Health Solutions, Li Ka Shing Knowledge Institute, Unity Health Toronto, 30 Bond Street, Toronto, ON, M5B 1W8, Canada.

Background: Machine learning (ML) is increasingly used in population and public health to support epidemiological studies, surveillance, and evaluation. Our objective was to conduct a scoping review to identify studies that use ML in population health, with a focus on its use in non-communicable diseases (NCDs). We also examine potential algorithmic biases in model design, training, and implementation, as well as efforts to mitigate these biases.

View Article and Find Full Text PDF

Development and Validation of a Nomogram Based on Multiparametric MRI for Predicting Lymph Node Metastasis in Endometrial Cancer: A Retrospective Cohort Study.

Acad Radiol

December 2024

Department of Radiology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China (Y.T., Y.W., Y.Y., X.Q., Y.H., J.L.); Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor (Guangxi Medical University), Ministry of Education, Nanning 530021, Guangxi Zhuang Autonomous Region, PR China (J.L.). Electronic address:

Rationale And Objectives: To develop a radiomics nomogram based on clinical and magnetic resonance features to predict lymph node metastasis (LNM) in endometrial cancer (EC).

Materials And Methods: We retrospectively collected 308 patients with endometrial cancer (EC) from two centers. These patients were divided into a training set (n=155), a test set (n=67), and an external validation set (n=86).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!