Objective: To evaluate the benefit of combining polygenic risk scores with the QCancer-10 (colorectal cancer) prediction model for non-genetic risk to identify people at highest risk of colorectal cancer.

Design: Population based cohort study.

Setting: Data from the UK Biobank study, collected between March 2006 and July 2010.

Participants: 434 587 individuals with complete data for genetics and QCancer-10 predictions were included in the QCancer-10 plus polygenic risk score modelling and validation cohorts.

Main Outcome Measures: Prediction of colorectal cancer diagnosis by genetic, non-genetic, and combined risk models. Using data from UK Biobank, six different polygenic risk scores for colorectal cancer were developed using LDpred2 polygenic risk score software, clumping, and thresholding approaches, and a model based on genome-wide significant polymorphisms. The top performing genome-wide polygenic risk score and the score containing genome-wide significant polymorphisms were combined with QCancer-10 and performance was compared with QCancer-10 alone. Case-control (logistic regression) and time-to-event (Cox proportional hazards) analyses were used to evaluate risk model performance in men and women.

Results: Polygenic risk scores derived using the LDpred2 program performed best, with an odds ratio per standard deviation of 1.584 (95% confidence interval 1.536 to 1.633), and top age and sex adjusted C statistic of 0.733 (95% confidence interval 0.710 to 0.753) in logistic regression models in the validation cohort. Integrated QCancer-10 plus polygenic risk score models out-performed QCancer-10 alone. In men, the integrated LDpred2 model produced a C statistic of 0.730 (0.720 to 0.741) and explained variation of 28.2% (26.3 to 30.1), compared with 0.693 (0.682 to 0.704) and 21.0% (18.9 to 23.1) for QCancer-10 alone. In women, the C statistic for the integrated LDpred2 model was 0.687 (0.673 to 0.702) and explained variation was 21.0% (18.7 to 23.7), compared with 0.645 (0.631 to 0.659) and 12.4% (10.3 to 14.6) for QCancer-10 alone. In the top 20% of individuals at highest absolute risk, the sensitivity and specificity of the integrated LDpred2 models for predicting colorectal cancer diagnosis was 47.8% and 80.3% respectively in men, and 42.7% and 80.1% respectively in women, with increases in absolute risk in the top 5% of risk in men of 3.47-fold and in women of 2.77-fold compared with the median. Illustrative decision curve analysis indicated a small incremental improvement in net benefit with QCancer-10 plus polygenic risk score models compared with QCancer-10 alone.

Conclusions: Integrating polygenic risk scores with QCancer-10 modestly improves risk prediction over use of QCancer-10 alone. Given that QCancer-10 data can be obtained relatively easily from health records, use of polygenic risk score in risk stratified population screening for colorectal cancer currently has no clear justification. The added benefit, cost effectiveness, and acceptability of polygenic risk scores should be carefully evaluated in a real life screening setting before implementation in the general population.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9644277PMC
http://dx.doi.org/10.1136/bmj-2022-071707DOI Listing

Publication Analysis

Top Keywords

polygenic risk
48
risk scores
24
colorectal cancer
24
risk score
24
risk
22
qcancer-10
14
polygenic
12
cancer diagnosis
12
qcancer-10 polygenic
12
integrated ldpred2
12

Similar Publications

Importance: Disease characteristics of genetically mediated coronary artery disease (CAD) on coronary angiography and the association of genomic risk with outcomes after coronary angiography are not well understood.

Objective: To assess the angiographic characteristics and risk of post-coronary angiography outcomes of patients with genomic drivers of CAD: familial hypercholesterolemia (FH), high polygenic risk score (PRS), and clonal hematopoiesis of indeterminate potential (CHIP).

Design, Setting, And Participants: A retrospective cohort study of 3518 Mass General Brigham Biobank participants with genomic information who underwent coronary angiography was conducted between July 18, 2000, and August 1, 2023.

View Article and Find Full Text PDF

Application of the 6-SNP elevated LDL-cholesterol polygenic risk score in individuals with familial hypercholesterolemia phenotype from an Argentine population.

Gac Med Mex

January 2025

Universidad de Buenos Aires, Facultad de Farmacia y Bioquímica, Departamento de Bioquímica Clínica, Laboratorio de Lípidos y Aterosclerosis, Ciudad Autónoma de Buenos Aires.

Introduction: LDL-cholesterol greater than 190 mg/dL indicates severe hypercholesterolemia (HS) of monogenic and/or polygenic origin. Genetic risk scores (GRS) evaluate potential polygenic causes.

Objective: we applied a GRS of 6-SNP (GRS-6) in HS individuals.

View Article and Find Full Text PDF

Distinct genetic risk loci between biopsy-proven renal and non-renal lupus: a 10-year longitudinal cohort.

Rheumatology (Oxford)

January 2025

Division of Rheumatology & Clinical Immunology, Department of Medicine, Queen Mary Hospital, The University of Hong Kong, Hong Kong.

Objective: Systemic lupus erythematosus (SLE) is a heterogeneous disease which manifests as different subphenotypes. Distinct subphenotypes, such as lupus nephritis (LN), have been associated with increased genetic risk, but prior studies have been limited by cross-sectional and imprecisely subphenotyped cohorts. This study investigated the genetic basis for LN using a longitudinal cohort of distinctly subphenotyped patients.

View Article and Find Full Text PDF

The growing availability of pre-trained polygenic risk score (PRS) models has enabled their integration into real-world applications, reducing the need for extensive data labeling, training, and calibration. However, selecting the most suitable PRS model for a specific target population remains challenging, due to issues such as limited transferability, data het-erogeneity, and the scarcity of observed phenotype in real-world settings. Ensemble learning offers a promising avenue to enhance the predictive accuracy of genetic risk assessments, but most existing methods often rely on observed phenotype data or additional genome-wide association studies (GWAS) from the target population to optimize ensemble weights, limiting their utility in real-time implementation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!