Objectives: According to European legislation, we must develop computer software allowing the linkage of medical records previously rendered anonymous. Some of them, like AUTOMATCH, are used in daily practice either to gather medical files in epidemiologic studies or for clinical purpose. In the first situation, the aim is to avoid homonymous errors, and in the second one, synonymous errors. The objective of this work is to study the effect of different parameters (number of identification variables, phonetic treatments of names, direct or probabilistic linkage procedure) on the reliability of the linkage in order to determine which strategy is the best according to the purpose of the linkage.

Methods: The assessment of the Burgundy Perinatal Network requires the linking of discharge abstracts of mothers and neonates, collected in all the hospitals of the region. Those data are used to compare direct and probabilistic linkage, using different parameterization strategies.

Results: If the linkage has to be performed in real time, so that no validation of indecisions generated by probabilistic linkage is possible, probabilistic linkage using three variables without any phonetic treatment seems to be the most appropriate approach, combined with a direct linkage using four variables applied to non-conclusive links. If a validation of indecisions is possible in an epidemiological study, probabilistic linkage using five variables, with a phonetic treatment adapted to the local language has to be preferred. For medical purpose, it should be combined with a direct linkage with four or five variables.

Conclusion: This paper reveals that the time and money available to manage indecision as well as the purpose of the linkage are of paramount importance for choosing a linkage strategy.

Download full-text PDF

Source

Publication Analysis

Top Keywords

probabilistic linkage
20
linkage
13
variables phonetic
12
linkage procedure
8
perinatal network
8
direct probabilistic
8
validation indecisions
8
phonetic treatment
8
combined direct
8
direct linkage
8

Similar Publications

The presence in ecological communities of unfeasible species interactions, termed forbidden links, due to physiological or morphological exploitation barriers has been long debated, but little direct evidence has been found. Forbidden links are likely to make ecological communities less robust to species extinctions, stressing the need to assess their prevalence. Here, we used a dataset of plant-hummingbird interactions, coupled with a Bayesian hierarchical model, to assess the importance of exploitation barriers in determining species interactions.

View Article and Find Full Text PDF

There is a paucity of disaggregated data to monitor cancer health inequalities in Canada. We used data linkage to estimate site-specific cancer relative survival by race, immigration status, household income, and education level in Canada. We pooled the Canadian Census Health and Environment Cohorts, which are linked datasets of 5.

View Article and Find Full Text PDF

Many population surveys do not provide information on respondents' residential addresses, instead offering coarse geographies like zip code or higher aggregations. However, fine resolution geography can be beneficial for characterizing neighbourhoods, especially for relatively rare populations such as immigrants. One way to obtain such information is to link survey records to records in auxiliary databases that include residential addresses by matching on variables common to both files.

View Article and Find Full Text PDF

Background: Due to advances in treatment, HIV is now a chronic condition with near-normal life expectancy. However, people with HIV continue to have a higher burden of mental and physical health conditions and are impacted by wider socioeconomic issues. Positive Voices is a nationally representative series of surveys of people with HIV in the United Kingdom.

View Article and Find Full Text PDF

Transcriptome-Wide Association Study of Metabolic Dysfunction-Associated Steatotic Liver Disease Identifies Relevant Gene Signatures.

Turk J Gastroenterol

December 2024

Department of Emergency Medicine, Shandong University, Qilu Hospital (Qingdao), Cheeloo College of Medicine, Qingdao, China.

Metabolic dysfunction-associated steatotic liver disease (MASLD) is considered the most widespread chronic liver condition globally. Genome-wide association studies (GWAS) have pinpointed several genetic loci correlated to MASLD, yet the biological significance of these loci remains poorly understood. Initially, we applied Functional Mapping and Annotation (FUMA) to conduct a functional annotation of the MASLD GWAS summary statistics, which included data from 3242 cases and 707 631 controls.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!