'Unmasking' masked address data: A medoid geocoding solution.

MethodsX

Center for Geospatial Sciences, School of Public Policy, University of California Riverside.

Published: February 2023

In recent years, there has been a consistent push for more open data initiatives, particularly for datasets collected by public agencies or groups that receive public funding. However, there is a tension between the release of open data and the preservation of individual and household privacy, whose balance shifts due to increased data availability, the sophistication of analysis techniques, and the computational power available to users. As a result, data masking is a standard tool used to preserve privacy. This is a process in which the data publishers obfuscate some identifying features in the dataset while attempting to maintain as much accuracy and precision as possible. For spatial datasets, the geocoding of administratively-masked data has been a consistent problem. Here, we present a medoid-based technique that geocodes masked data while minimizing the spatial uncertainty associated with the masking approach. Unfortunately, many commercial geocoding software packages either fail to geocode administratively-masked data or provide false positives by assigning points to city or street centroids. We demonstrate the results of our medoid-based geocoding approach by comparing it to commercial geocoding software. The results suggest that a medoid geocoding approach is mechanically simple to deploy and maximizes the spatial accuracy of the resulting geocodes.•Administratively-masked data are difficult to geocode•A medoid geocoding method maximizes geocoding accuracy•This method outperforms commercial geocoding software.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006849PMC
http://dx.doi.org/10.1016/j.mex.2023.102090DOI Listing

Publication Analysis

Top Keywords

medoid geocoding
12
commercial geocoding
12
geocoding software
12
data
10
geocoding
9
open data
8
administratively-masked data
8
geocoding approach
8
'unmasking' masked
4
masked address
4

Similar Publications

In recent years, there has been a consistent push for more open data initiatives, particularly for datasets collected by public agencies or groups that receive public funding. However, there is a tension between the release of open data and the preservation of individual and household privacy, whose balance shifts due to increased data availability, the sophistication of analysis techniques, and the computational power available to users. As a result, data masking is a standard tool used to preserve privacy.

View Article and Find Full Text PDF

Ecoregionalization is the process by which a territory is classified in similar areas according to specific environmental and climatic factors. The climate and the environment strongly influence the presence and distribution of vectors responsible for significant human and animal diseases worldwide. In this paper, we developed a map of the eco-climatic regions of Italy adopting a data-driven spatial clustering approach using recent and detailed spatial data on climatic and environmental factors.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!