In recent years, there has been a consistent push for more open data initiatives, particularly for datasets collected by public agencies or groups that receive public funding. However, there is a tension between the release of open data and the preservation of individual and household privacy, whose balance shifts due to increased data availability, the sophistication of analysis techniques, and the computational power available to users. As a result, data masking is a standard tool used to preserve privacy. This is a process in which the data publishers obfuscate some identifying features in the dataset while attempting to maintain as much accuracy and precision as possible. For spatial datasets, the geocoding of administratively-masked data has been a consistent problem. Here, we present a medoid-based technique that geocodes masked data while minimizing the spatial uncertainty associated with the masking approach. Unfortunately, many commercial geocoding software packages either fail to geocode administratively-masked data or provide false positives by assigning points to city or street centroids. We demonstrate the results of our medoid-based geocoding approach by comparing it to commercial geocoding software. The results suggest that a medoid geocoding approach is mechanically simple to deploy and maximizes the spatial accuracy of the resulting geocodes.•Administratively-masked data are difficult to geocode•A medoid geocoding method maximizes geocoding accuracy•This method outperforms commercial geocoding software.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006849 | PMC |
http://dx.doi.org/10.1016/j.mex.2023.102090 | DOI Listing |
MethodsX
February 2023
Center for Geospatial Sciences, School of Public Policy, University of California Riverside.
In recent years, there has been a consistent push for more open data initiatives, particularly for datasets collected by public agencies or groups that receive public funding. However, there is a tension between the release of open data and the preservation of individual and household privacy, whose balance shifts due to increased data availability, the sophistication of analysis techniques, and the computational power available to users. As a result, data masking is a standard tool used to preserve privacy.
View Article and Find Full Text PDFPLoS One
March 2020
Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise G. Caporale, Campo Boario, Teramo, Italy.
Ecoregionalization is the process by which a territory is classified in similar areas according to specific environmental and climatic factors. The climate and the environment strongly influence the presence and distribution of vectors responsible for significant human and animal diseases worldwide. In this paper, we developed a map of the eco-climatic regions of Italy adopting a data-driven spatial clustering approach using recent and detailed spatial data on climatic and environmental factors.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!