Purpose: In Germany, record linkage of claims and cancer registry data is cost- and time-consuming, since up until recently no unique personal identifier was available in both data sources. The aim of this study was to evaluate the feasibility and performance of a deterministic linkage procedure based on indirect personal identifiers included in the data sources.

Methods: We identified users of glucose-lowering drugs with residence in four federal states in Northern and Southern Germany (Bavaria, Bremen, Hamburg, Lower Saxony) in the German Pharmacoepidemiological Research Database (GePaRD) and assessed colorectal and thyroid cancer cases. Cancer registries of the federal states selected all colorectal and thyroid cancer cases between 2004 and 2015. A deterministic linkage approach was performed based on indirect personal identifiers such as year of birth, sex, area of residence, type of cancer and an absolute difference between the dates of cancer diagnosis in both data sources of at most 90 days. Results were compared to a probabilistic linkage using "direct" personal identifiers (gold standard).

Results: The deterministic linkage procedure yielded a sensitivity of 71.8% for colorectal cancer and 66.6% for thyroid cancer. For thyroid cancer, the sensitivity improved when using only inpatient diagnosis to define cancer in GePaRD (71.4%). Specificity was always above 99%. Using the probabilistic linkage to define cancer cases, the risk for colorectal cancer was estimated 10 percentage points lower than when using the deterministic approach.

Conclusions: Sensitivity of the deterministic linkage approach appears to be too low to be considered as reasonable alternative to the probabilistic linkage procedure.

Download full-text PDF

Source
http://dx.doi.org/10.1002/pds.5545DOI Listing

Publication Analysis

Top Keywords

deterministic linkage
20
personal identifiers
16
thyroid cancer
16
cancer
13
linkage approach
12
based indirect
12
indirect personal
12
linkage procedure
12
cancer cases
12
probabilistic linkage
12

Similar Publications

Objectives To examine a comprehensive monitoring framework for health inequalities in Japan, this study aimed to quantify educational inequalities in mortality and its regional variations, which are widely used internationally as outcome measures of health inequalities.Methods Individual data were obtained from the 2010 Population Census and Vital Statistics death records (2010-2015). We used the combination of "sex," "birth month/year," "municipality of residence," "marital status," and "age of spouse (married individuals only)" as a linkage key.

View Article and Find Full Text PDF

Objectives: Concurrence of pregnancy and cancer diagnosis is increasingly frequent in Italy. The study aimed to compare women with pregnancy-associated cancers (PACs) to those of childbearing age, focusing on fertility, induced abortion, and miscarriage.

Methods: The population-based study included women aged 15-49 years, both with and without PAC, who were residents in the area covered by the 19 participating Cancer Registries between 2003 and 2015 and identified by individual deterministic linkage with the Hospital Discharge Database.

View Article and Find Full Text PDF

Process and validity of linking cystic fibrosis patient registry with national Medicaid databases.

J Cyst Fibros

November 2024

Rutgers University Institute for Health, Health Care Policy and Aging Research, New Brunswick, New Jersey, USA; Department of Medicine, Rutgers Robert Wood Johnson Medical School, Piscataway, New Jersey, USA; Department of Biostatistics and Epidemiology, Rutgers School of Public Health, Piscataway, New Jersey, USA. Electronic address:

Article Synopsis
  • * A matching algorithm was used to connect 10,616 individuals with CF from CFFPR to Medicaid data, revealing that those linked had significantly higher outpatient visits and antibiotic prescriptions than reported in the CFFPR alone.
  • * The study found high costs associated with CF treatment in North Carolina, with pharmacy costs reaching $16.4 million and non-pharmacy costs totaling $7.5 million, highlighting the potential for comprehensive analysis of healthcare utilization and costs for low-income CF patients through linked data.
View Article and Find Full Text PDF

HIV and SARS-CoV-2 Coinfections in Brazil in 2020: Epidemiological, Sociodemographic, and Clinical Characteristics of 36,746 Cases.

Rev Soc Bras Med Trop

November 2024

Universidade de São Paulo, Faculdade de Medicina Veterinária e Zootecnia, Programa de Pós-Graduação em Epidemiologia e Saúde Única, São Paulo, SP, Brasil.

Background: This study aimed to identify COVID-19 cases among people living with HIV (PLWH) in Brazil in 2020, describe their clinical, sociodemographic, and epidemiological profiles, and evaluate the factors associated with disease severity.

Methods: This cross-sectional study used secondary data obtained from the Brazilian healthcare system. Probabilistic and deterministic data linkage methods were used to identify coinfected patients.

View Article and Find Full Text PDF

Background: Large-scale clinical databases containing routinely collected electronic health records (EHRs) data are a valuable source of information for research studies. For example, they can be used in pharmacoepidemiology studies to evaluate the effects of maternal medication exposure on neonatal and pediatric outcomes. Yet, this type of studies is infeasible without proper mother-child linkage.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!