Record linkage: making the most out of errors in linking variables.

AMIA Annu Symp Proc

Dept. of Medical Informatics, Academic Medical Center, University of Amsterdam, P.O. Box 22700, 1100 DE Amsterdam, the Netherlands.

Published: September 2007

AI Article Synopsis

Article Abstract

This paper presents a refinement of the probabilistic medical record linking algorithm. We introduced "close agreement" to account for typical errors in administrative variables used for record linkage. Linking data on early pregnancy determinants with data on late child outcomes was used as a case study. We analyzed whether the addition of close agreement resulted in a higher discriminating power of the linking key reflected ina reduction of the number of links with an uncertain linking status. Incorporating close agreement for postal code and date of birth in the record linking algorithm resulted in a reduction of 95% of the number of pairs in the uncertain region. We showed that the extension of a third outcome"close" when comparing values of corresponding linking variables led to a major improvement in our probabilistic record linkage study. Similar improvements are likely in other studies because the frequency, nature, and type of errors in other large databases will not be substantially different.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1839331PMC

Publication Analysis

Top Keywords

record linkage
12
linking variables
8
record linking
8
linking algorithm
8
close agreement
8
linking
7
record
5
linkage making
4
making errors
4
errors linking
4

Similar Publications

Background: Tuberculosis (TB) is a leading cause of death worldwide with over 90% of reported cases occurring in low- and middle-income countries (LMICs). Pre-treatment loss to follow-up (PTLFU) is a key contributor to TB mortality and infection transmission.

Objectives: We performed a scoping review to map available evidence on interventions to reduce PTLFU in adults with pulmonary TB, identify gaps in existing knowledge, and develop a conceptual framework to guide intervention implementation.

View Article and Find Full Text PDF

Alcohol consumption, drinking patterns and cause-specific mortality in an Australian cohort of 181,607 participants aged 45 years and over.

Public Health

December 2024

The Daffodil Centre, The University of Sydney, a Joint Venture with Cancer Council NSW, Postal Address: PO Box 572, KINGS CROSS, NSW, 1340, Australia.

Objectives: Despite relatively high alcohol consumption in Australia, local evidence regarding drinking and cause-specific mortality is limited. We aimed to quantify the risk of alcohol-related causes of death and to calculate contemporary estimates of absolute risk and population attributable fractions for deaths caused by alcohol consumption in Australia.

Study Design: Prospective cohort study.

View Article and Find Full Text PDF

Piloting a minimum data set for older people living in care homes in England: a developmental study.

Age Ageing

January 2025

Centre for Research in Public Health and Community Care (CRIPACC), University of Hertfordshire, College Lane, Hatfield, UK.

Background: We developed a prototype minimum data set (MDS) for English care homes, assessing feasibility of extracting data directly from digital care records (DCRs) with linkage to health and social care data.

Methods: Through stakeholder development workshops, literature reviews, surveys and public consultation, we developed an aspirational MDS. We identified ways to extract this from existing sources, including DCRs and routine health and social care datasets.

View Article and Find Full Text PDF

Many population surveys do not provide information on respondents' residential addresses, instead offering coarse geographies like zip code or higher aggregations. However, fine resolution geography can be beneficial for characterizing neighbourhoods, especially for relatively rare populations such as immigrants. One way to obtain such information is to link survey records to records in auxiliary databases that include residential addresses by matching on variables common to both files.

View Article and Find Full Text PDF

Background: Extended-release naltrexone (XR-NTX, Vivitrol) is an effective, but underutilized, evidence-based treatment for people with opioid use disorder (POUD) who are incarcerated. Networks of family, friends, and clinicians serve as social influencers of health behaviors, including XR-NTX initiation, and are especially salient in Appalachia.

Objectives: Using a triangulation of perspectives, this study examined concordance between the social network themes that emerged from qualitative interviews with clinicians and POUD social network findings.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!