Detection of missed deaths in cancer registry data to reduce bias in long-term survival estimation.

Front Oncol

German Center for Cancer Registry Data, Department for Health Monitoring, Robert Koch-Institute, Berlin, Germany.

Published: March 2023

AI Article Synopsis

  • Population-based cancer survival estimates help understand the effectiveness of healthcare interventions, but they can be biased due to incomplete data on cancer incidences and deaths.
  • This report introduces a new method to identify "missed deaths" in cancer registry data to ensure that long-term survival rates are accurately reflective of the general population's survival.
  • Analysis of cancer data from German and SEER registries showed that a small percentage of deaths were missed, which significantly lowered the estimated long-term survival rates for cancer patients when accounted for.

Article Abstract

Background: Population-based cancer survival estimates can provide insight into the real-world impacts of healthcare interventions and preventive services. However, estimation of survival rates obtained from population-based cancer registries can be biased due to missed incidence or incomplete vital status data. Long-term survival estimates in particular are prone to overestimation, since the proportion of deaths that are missed, for example through unregistered emigration, increases with follow-up time. This also applies to registry-based long-term prevalence estimates. The aim of this report is to introduce a method to detect missed deaths within cancer registry data such that long-term survival of cancer patients does not exceed survival in the general population.

Methods: We analyzed data from 15 German epidemiologic cancer registries covering the years 1970-2016 and from Surveillance, Epidemiology, and End Results (SEER)-18 registries covering 1975-2015. The method is based on comparing survival times until exit (death or follow-up end) and ages at exit between deceased patients and surviving patients, stratified by diagnosis group, sex, age group and stage. Deceased patients with both follow-up time and age at exit in the highest percentile were regarded as outliers and used to fit a logistic regression. The regression was then used to classify each surviving patient as a survivor or a missed death. The procedure was repeated for lower percentile thresholds regarding deceased persons until long-term survival rates no longer exceeded the survival rates in the general population.

Results: For the German cancer registry data, 0.9% of total deaths were classified as having been missed. Excluding these missed deaths reduced 20-year relative survival estimates for all cancers combined from 140% to 51%. For the whites in SEER data, classified missed deaths amounted to 0.02% of total deaths, resulting in 0.4 percent points lower 20-year relative survival rate for all cancers combined.

Conclusion: The method described here classified a relatively small proportion of missed deaths yet reduced long-term survival estimates to more plausible levels. The effects of missed deaths should be considered when calculating long-term survival or prevalence estimates.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10034313PMC
http://dx.doi.org/10.3389/fonc.2023.1088657DOI Listing

Publication Analysis

Top Keywords

missed deaths
24
long-term survival
24
survival estimates
16
survival
13
cancer registry
12
registry data
12
survival rates
12
deaths
9
missed
9
deaths cancer
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!