Learning dependence relationships among variables of mixed types provides insights in a variety of scientific settings and is a well-studied problem in statistics. Existing methods, however, typically rely on copious, high quality data to accurately learn associations. In this paper, we develop a method for scientific settings where learning dependence structure is essential, but data are sparse and have a high fraction of missing values. Specifically, our work is motivated by survey-based cause of death assessments known as verbal autopsies (VAs). We propose a Bayesian approach to characterize dependence relationships using a latent Gaussian graphical model that incorporates informative priors on the marginal distributions of the variables. We demonstrate such information can improve estimation of the dependence structure, especially in settings with little training data. We show that our method can be integrated into existing probabilistic cause-of-death assignment algorithms and improves model performance while recovering dependence patterns between symptoms that can inform efficient questionnaire design in future data collection.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7709479PMC
http://dx.doi.org/10.1214/19-ba1172DOI Listing

Publication Analysis

Top Keywords

latent gaussian
8
gaussian graphical
8
verbal autopsies
8
learning dependence
8
dependence relationships
8
scientific settings
8
dependence structure
8
dependence
5
bayesian latent
4
graphical models
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!