As a linkage using less informative identifiers could lead to linkage errors, it is essential to quantify the information associated to each identifier. The aim of this study was to estimate the discriminating power of different identifiers susceptible to be used in a record linkage process. This work showed the interest of three identifiers when linking data concerning a same patient using an automatic procedure based on the method proposed by Jaro; the date of birth, the first and the last names seemed to be the more appropriate identifiers. Including a poorly discriminating identifier like gender did not improve the results. Moreover, adding a second christian name, often missing, increased linkage errors. On the contrary, it seemed that using a phonetic treatment adapted to the French language could improve the results of linkage in comparison to the Soundex. However, whatever, the method used it seems necessary to improve the quality of identifier collection as it could greatly influence linkage results.

Download full-text PDF

Source
http://dx.doi.org/10.1080/14639230400005974DOI Listing

Publication Analysis

Top Keywords

linkage errors
8
linkage
6
best identifiers
4
identifiers record
4
record linkage?
4
linkage? linkage
4
linkage informative
4
identifiers
4
informative identifiers
4
identifiers lead
4

Similar Publications

The effect of HLA genotype on disease onset and severity in CTLA-4 insufficiency.

Front Immunol

January 2025

Institute for Immunodeficiency, Center for Chronic Immunodeficiency (CCI), Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany.

Introduction: Human Cytotoxic-T-lymphocyte-antigen-4 (CTLA-4) insufficiency caused by heterozygous germline mutations in is a complex immune dysregulation and immunodeficiency syndrome presenting with reduced penetrance and variable disease expressivity, suggesting the presence of disease modifiers that trigger the disease onset and severity. Various genetic and non-genetic potential triggers have been analyzed in CTLA-4 insufficiency cohorts, however, none of them have revealed a clear association to the disease. Multiple HLA haplotypes have been positively or negatively associated with various autoimmune diseases and inborn errors of immunity (IEI) due to the relevance of MHC in the strength of the T cell responses.

View Article and Find Full Text PDF

Introduction: In Germany, there has been no population-level pharmacoepidemiological study on the safety of the COVID-19 vaccines. One factor preventing such a study so far relates to challenges combining the different relevant data bodies on vaccination with suitable outcome data, specifically statutory health insurance claims data. Individual identifiers used across these data bodies are of unknown quality and reliability for data linkage.

View Article and Find Full Text PDF

Background: The rapid proliferation of artificial intelligence (AI) requires new approaches for human-AI interfaces that are different from classic human-computer interfaces. In developing a system that is conducive to the analysis and use of health big data (HBD), reflecting the empirical characteristics of users who have performed HBD analysis is the most crucial aspect to consider. Recently, human-centered design methodology, a field of user-centered design, has been expanded and is used not only to develop types of products but also technologies and services.

View Article and Find Full Text PDF

SEMdag: Fast learning of Directed Acyclic Graphs via node or layer ordering.

PLoS One

January 2025

Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy.

A Directed Acyclic Graph (DAG) offers an easy approach to define causal structures among gathered nodes: causal linkages are represented by arrows between the variables, leading from cause to effect. Recently, industry and academics have paid close attention to DAG structure learning from observable data, and many techniques have been put out to address the problem. We provide a two-step approach, named SEMdag(), that can be used to quickly learn high-dimensional linear SEMs.

View Article and Find Full Text PDF

Background: There is no standardization within hand and upper-extremity surgery regarding which patient-reported outcome measures (PROMs) are collected and reported. This limits the ability to compare or combine cohorts that utilize different PROMs. The aim of this study was to develop a linkage model for the QuickDASH (shortened version of the Disabilities of the Arm, Shoulder and Hand) and PROMIS PF CAT (Patient-Reported Outcomes Measurement Information System Physical Function computerized adaptive testing) instruments to allow interconversion between these PROMs in a hand surgery population.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!