A semantic framework to protect the privacy of electronic health records with non-numerical attributes.

J Biomed Inform

Department of Computer Science and Mathematics, Universitat Rovira i Virgili, Av. Països Catalans, 26, 43007 Tarragona, Catalonia, Spain.

Published: April 2013

Structured patient data like Electronic Health Records (EHRs) are a valuable source for clinical research. However, the sensitive nature of such information requires some anonymisation procedure to be applied before releasing the data to third parties. Several studies have shown that the removal of identifying attributes, like the Social Security Number, is not enough to obtain an anonymous data file, since unique combinations of other attributes as for example, rare diagnoses and personalised treatments, may lead to patient's identity disclosure. To tackle this problem, Statistical Disclosure Control (SDC) methods have been proposed to mask sensitive attributes while preserving, up to a certain degree, the utility of anonymised data. Most of these methods focus on continuous-scale numerical data. Considering that part of the clinical data found in EHRs is expressed with non-numerical attributes as for example, diagnoses, symptoms, procedures, etc., their application to EHRs produces far from optimal results. In this paper, we propose a general framework to enable the accurate application of SDC methods to non-numerical clinical data, with a focus on the preservation of semantics. To do so, we exploit structured medical knowledge bases like SNOMED CT to propose semantically-grounded operators to compare, aggregate and sort non-numerical terms. Our framework has been applied to several well-known SDC methods and evaluated using a real clinical dataset with non-numerical attributes. Results show that the exploitation of medical semantics produces anonymised datasets that better preserve the utility of EHRs.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jbi.2012.11.005DOI Listing

Publication Analysis

Top Keywords

non-numerical attributes
12
sdc methods
12
electronic health
8
health records
8
attributes example
8
clinical data
8
data
7
attributes
6
non-numerical
5
semantic framework
4

Similar Publications

The studies of number sense in different species are severely hampered by the inevitable entanglement of non-numerical attributes inherent in nonsymbolic stimuli representing numerosity, resulting in contrasting theories of numerosity processing. Here, we developed an algorithm and associated analytical methods to generate stimuli that not only minimized the impact of non-numerical magnitudes in numerosity perception but also allowed their quantification. We trained number-naïve rats with these stimuli as sound pulses representing two or three numbers and demonstrated that their numerical discrimination ability mainly relied on numerosity.

View Article and Find Full Text PDF

Numerical cognition provides an opportunity to study the underlying processes of selective attention to numerical information in the face of conflicting, non-numerical, information of different magnitudes. For instance, in the numerical Stroop paradigm, participants are asked to judge pairs of Arabic digits whose physical size can either be congruent (e.g.

View Article and Find Full Text PDF

Numerosity perception is a key ability to guide behavior. However, current models propose that number units encode an abstract representation of numerosity regardless of the non-numerical attributes of the stimuli, suggesting rather coarse environmental tuning. Here we investigated whether numerosity systems spontaneously adapt to all visible items, or to subsets segregated by salient attributes such as color or pitch.

View Article and Find Full Text PDF

The representation of numbers in human adults is linked to space. In Western cultures, small and large numbers are associated respectively with the left and right sides of space. An influential framework attributes the emergence of these spatial-numerical associations (SNAs) to cultural factors such as the direction of reading and writing, because SNAs were found to be reduced or inverted in right-to-left readers/writers (e.

View Article and Find Full Text PDF

Both humans and non-human animals exhibit sensitivity to the approximate number of items in a visual array, as indexed by their performance in numerosity discrimination tasks, and even neonates can detect changes in numerosity. These findings are often interpreted as evidence for an innate 'number sense'. However, recent simulation work has challenged this view by showing that human-like sensitivity to numerosity can emerge in deep neural networks that build an internal model of the sensory data.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!