Considerations for using population frequency data in germline variant interpretation: Cancer syndrome genes as a model.

Aimee L Davidson Conrad Leonard Lambros T Koufariotis Michael T Parsons Georgina E Hollway John V Pearson Felicity Newell Nicola Waddell Amanda B Spurdle

Hum Mutat

Genetics & Computational Biology, QIMR Berghofer Medical Research Institute, Herston, Queensland, Australia.

Published: May 2021

Aggregate population genomics data from large cohorts are vital for assessing germline variant pathogenicity. However, there are no specifications on how sequencing quality metrics should be considered, and whether exome-derived and genome-derived allele frequencies should be considered in isolation. Germline genome sequence data were simulated for nine read-depths to identify a minimum acceptable read-depth for detecting variants. gnomAD exome-derived and genome-derived datasets were assessed for read-depth, for six key cancer genes selected for variant curation by ClinGen expert panels. Non-Finnish European allele frequency (AF) or filter AF of coding variants in these genes, assigned into frequency bins using modified ACMG-AMP criteria, was compared between exome-derived and genome-derived datasets. A 30X read-depth achieved acceptable precision and recall for detection of substitutions, but poor recall for small insertions/deletions. Exome-derived and genome-derived datasets exhibited low read-depth for different gene exons. Individual variants were mostly assigned to non-divergent AF bins (>95%) or filter AF bins (>97%). Two major bin divergences were resolved by applying the minimal acceptable read-depth threshold. These findings show the importance of assessing read-depth separately for population datasets sourced from different short-read sequencing technologies before assigning a frequency-based ACMG-AMP classification code for variant interpretation.

Download full-text PDF	Source
http://dx.doi.org/10.1002/humu.24183	DOI Listing

Publication Analysis

Top Keywords

exome-derived genome-derived

genome-derived datasets

germline variant

variant interpretation

acceptable read-depth

read-depth

considerations population

population frequency

frequency data

data germline

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!