Tree-Values: Selective Inference for Regression Trees.

J Mach Learn Res

Departments of Statistics and Biostatistics, University of Washington, Seattle, WA 98195, USA.

Published: January 2022

We consider conducting inference on the output of the Classification and Regression Tree (CART) (Breiman et al., 1984) algorithm. A naive approach to inference that does not account for the fact that the tree was estimated from the data will not achieve standard guarantees, such as Type 1 error rate control and nominal coverage. Thus, we propose a selective inference framework for conducting inference on a fitted CART tree. In a nutshell, we condition on the fact that the tree was estimated from the data. We propose a test for the difference in the mean response between a pair of terminal nodes that controls the selective Type 1 error rate, and a confidence interval for the mean response within a single terminal node that attains the nominal selective coverage. Efficient algorithms for computing the necessary conditioning sets are provided. We apply these methods in simulation and to a dataset involving the association between portion control interventions and caloric intake.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10933572PMC

Publication Analysis

Top Keywords

selective inference
8
conducting inference
8
fact tree
8
tree estimated
8
estimated data
8
type error
8
error rate
8
inference
5
tree-values selective
4
inference regression
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!