Purpose: In Germany, record linkage of claims and cancer registry data is cost- and time-consuming, since up until recently no unique personal identifier was available in both data sources. The aim of this study was to evaluate the feasibility and performance of a deterministic linkage procedure based on indirect personal identifiers included in the data sources.
Methods: We identified users of glucose-lowering drugs with residence in four federal states in Northern and Southern Germany (Bavaria, Bremen, Hamburg, Lower Saxony) in the German Pharmacoepidemiological Research Database (GePaRD) and assessed colorectal and thyroid cancer cases. Cancer registries of the federal states selected all colorectal and thyroid cancer cases between 2004 and 2015. A deterministic linkage approach was performed based on indirect personal identifiers such as year of birth, sex, area of residence, type of cancer and an absolute difference between the dates of cancer diagnosis in both data sources of at most 90 days. Results were compared to a probabilistic linkage using "direct" personal identifiers (gold standard).
Results: The deterministic linkage procedure yielded a sensitivity of 71.8% for colorectal cancer and 66.6% for thyroid cancer. For thyroid cancer, the sensitivity improved when using only inpatient diagnosis to define cancer in GePaRD (71.4%). Specificity was always above 99%. Using the probabilistic linkage to define cancer cases, the risk for colorectal cancer was estimated 10 percentage points lower than when using the deterministic approach.
Conclusions: Sensitivity of the deterministic linkage approach appears to be too low to be considered as reasonable alternative to the probabilistic linkage procedure.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/pds.5545 | DOI Listing |
Objectives To examine a comprehensive monitoring framework for health inequalities in Japan, this study aimed to quantify educational inequalities in mortality and its regional variations, which are widely used internationally as outcome measures of health inequalities.Methods Individual data were obtained from the 2010 Population Census and Vital Statistics death records (2010-2015). We used the combination of "sex," "birth month/year," "municipality of residence," "marital status," and "age of spouse (married individuals only)" as a linkage key.
View Article and Find Full Text PDFTumori
December 2024
National Centre for Disease Prevention and Health Promotion, Italian National Institute of Health, Rome, Italy.
Objectives: Concurrence of pregnancy and cancer diagnosis is increasingly frequent in Italy. The study aimed to compare women with pregnancy-associated cancers (PACs) to those of childbearing age, focusing on fertility, induced abortion, and miscarriage.
Methods: The population-based study included women aged 15-49 years, both with and without PAC, who were residents in the area covered by the 19 participating Cancer Registries between 2003 and 2015 and identified by individual deterministic linkage with the Hospital Discharge Database.
J Cyst Fibros
November 2024
Rutgers University Institute for Health, Health Care Policy and Aging Research, New Brunswick, New Jersey, USA; Department of Medicine, Rutgers Robert Wood Johnson Medical School, Piscataway, New Jersey, USA; Department of Biostatistics and Epidemiology, Rutgers School of Public Health, Piscataway, New Jersey, USA. Electronic address:
Rev Soc Bras Med Trop
November 2024
Universidade de São Paulo, Faculdade de Medicina Veterinária e Zootecnia, Programa de Pós-Graduação em Epidemiologia e Saúde Única, São Paulo, SP, Brasil.
Background: This study aimed to identify COVID-19 cases among people living with HIV (PLWH) in Brazil in 2020, describe their clinical, sociodemographic, and epidemiological profiles, and evaluate the factors associated with disease severity.
Methods: This cross-sectional study used secondary data obtained from the Brazilian healthcare system. Probabilistic and deterministic data linkage methods were used to identify coinfected patients.
J Biomed Inform
November 2024
Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Gran Via de les Corts Catalanes, 587, Barcelona 08007, Catalonia, Spain.
Background: Large-scale clinical databases containing routinely collected electronic health records (EHRs) data are a valuable source of information for research studies. For example, they can be used in pharmacoepidemiology studies to evaluate the effects of maternal medication exposure on neonatal and pediatric outcomes. Yet, this type of studies is infeasible without proper mother-child linkage.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!