Variable selection is an important statistical problem. This problem becomes more challenging when the candidate predictors are of mixed type (e.g. continuous and binary) and impact the response variable in nonlinear and/or non-additive ways. In this paper, we review existing variable selection approaches for the Bayesian additive regression trees (BART) model, a nonparametric regression model, which is flexible enough to capture the interactions between predictors and nonlinear relationships with the response. An emphasis of this review is on the ability to identify relevant predictors. We also propose two variable importance measures which can be used in a permutation-based variable selection approach, and a backward variable selection procedure for BART. We introduce these variations as a way of illustrating limitations and opportunities for improving current approaches and assess these via simulations.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11395240 | PMC |
http://dx.doi.org/10.1214/23-sts900 | DOI Listing |
JMIR Med Inform
January 2025
Department of Endocrinology and Metabolism, The First Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, China.
Background: Many tools have been developed to predict the risk of diabetes in a population without diabetes; however, these tools have shortcomings that include the omission of race, inclusion of variables that are not readily available to patients, and low sensitivity or specificity.
Objective: We aimed to develop and validate an easy, systematic index for predicting diabetes risk in the Asian population.
Methods: We collected the data from the NAGALA (NAfld [nonalcoholic fatty liver disease] in the Gifu Area, Longitudinal Analysis) database.
Parasit Vectors
January 2025
Faculty of Information Technology, Mutah University, Mutah, Jordan.
Background: Amebiasis represents a significant global health concern. This is especially evident in developing countries, where infections are more common. The primary diagnostic method in laboratories involves the microscopy of stool samples.
View Article and Find Full Text PDFBMC Pregnancy Childbirth
January 2025
Department of Obstetrics and Gynecology, Division of Maternal-Fetal Medicine, University of Utah Health, 30 N. Mario Capecchi Dr., Level 5 South, Salt Lake City, UT, 84132, USA.
Background: Fetal growth restriction (FGR) is a leading risk factor for stillbirth, yet the diagnosis of FGR confers considerable prognostic uncertainty, as most infants with FGR do not experience any morbidity. Our objective was to use data from a large, deeply phenotyped observational obstetric cohort to develop a probabilistic graphical model (PGM), a type of "explainable artificial intelligence (AI)", as a potential framework to better understand how interrelated variables contribute to perinatal morbidity risk in FGR.
Methods: Using data from 9,558 pregnancies delivered at ≥ 20 weeks with available outcome data, we derived and validated a PGM using randomly selected sub-cohorts of 80% (n = 7645) and 20% (n = 1,912), respectively, to discriminate cases of FGR resulting in composite perinatal morbidity from those that did not.
BMC Pediatr
January 2025
Institute of Pediatric Endocrinology, Dana Dwek Children's Hospital, Tel Aviv Sourasky Medical Center, 6 Weizmann Street, 6423906, Tel Aviv, Israel.
Background: The diagnosis of depression or anxiety treated by SSRIs has become relatively common in women of childbearing age. However, the impact of gestational SSRI treatment on newborn thyroid function is lacking. We explored the impact of gestational SSRI treatment on newborn thyroid function as measured by the National Newborn Screening (NBS) Program and identified contributory factors.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Archaeology, Faculty of History, Vilnius University, Universiteto St. 7, Vilnius, 01513, Lithuania.
This study explores how major climatic shifts, together with socioeconomic factors over the past two millennia, influenced buffer crop selection, focusing on five crops: rye, millet, buckwheat, oat, and hemp. For this study, we analyzed archaeobotanical data from 135 archaeological contexts and historical data from 242 manor inventories across the northeastern Baltic region, spanning the period from 100 to 1800 AD. Our findings revealed that rye remained a main staple crop throughout the studied periods reflecting environmental adaptation to northern latitudes.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!