Data that include fine geographic information, such as census tract or street block identifiers, can be difficult to release as public use files. Fine geography provides information that ill-intentioned data users can use to identify individuals. We propose to release data with simulated geographies, so as to enable spatial analyses while reducing disclosure risks. We fit disease mapping models that predict areal-level counts from attributes in the file and sample new locations based on the estimated models. We illustrate this approach using data on causes of death in North Carolina, including evaluations of the disclosure risks and analytic validity that can result from releasing synthetic geographies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4008679PMC
http://dx.doi.org/10.1002/sim.6078DOI Listing

Publication Analysis

Top Keywords

disease mapping
8
mapping models
8
disclosure risks
8
data
5
imputation confidential
4
confidential data
4
data sets
4
sets spatial
4
spatial locations
4
locations disease
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!