In this paper, we illustrate that combining ecological data with subsample data in situations in which a linear model is appropriate provides three main benefits. First, by including the individual level subsample data, the biases associated with linear ecological inference can be eliminated. Second, by supplementing the subsample data with ecological data, the information about parameters will be increased. Third, we can use readily available ecological data to design optimal subsampling schemes, so as to further increase the information about parameters. We present an application of this methodology to the classic problem of estimating the effect of a college degree on wages. We show that combining ecological data with subsample data provides precise estimates of this value, and that optimal subsampling schemes (conditional on the ecological data) can provide good precision with only a fraction of the observations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2801082PMC
http://dx.doi.org/10.1111/j.1467-985X.2007.00511.xDOI Listing

Publication Analysis

Top Keywords

ecological data
20
subsample data
16
data
10
linear ecological
8
combining ecological
8
data subsample
8
optimal subsampling
8
subsampling schemes
8
ecological
7
alleviating linear
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!