We aimed to identify potential novel predictors for breast cancer among post-menopausal women, with pre-specified interest in the role of polygenic risk scores (PRS) for risk prediction. We utilised an analysis pipeline where machine learning was used for feature selection, prior to risk prediction by classical statistical models. An "extreme gradient boosting" (XGBoost) machine with Shapley feature-importance measures were used for feature selection among [Formula: see text] 1.7 k features in 104,313 post-menopausal women from the UK Biobank. We constructed and compared the "augmented" Cox model (incorporating the two PRS, known and novel predictors) with a "baseline" Cox model (incorporating the two PRS and known predictors) for risk prediction. Both of the two PRS were significant in the augmented Cox model ([Formula: see text]). XGBoost identified 10 novel features, among which five showed significant associations with post-menopausal breast cancer: plasma urea (HR = 0.95, 95% CI 0.92-0.98, [Formula: see text]), plasma phosphate (HR = 0.68, 95% CI 0.53-0.88, [Formula: see text]), basal metabolic rate (HR = 1.17, 95% CI 1.11-1.24, [Formula: see text]), red blood cell count (HR = 1.21, 95% CI 1.08-1.35, [Formula: see text]), and creatinine in urine (HR = 1.05, 95% CI 1.01-1.09, [Formula: see text]). Risk discrimination was maintained in the augmented Cox model, yielding C-index 0.673 vs 0.667 (baseline Cox model) with the training data and 0.665 vs 0.664 with the test data. We identified blood/urine biomarkers as potential novel predictors for post-menopausal breast cancer. Our findings provide new insights to breast cancer risk. Future research should validate novel predictors, investigate using multiple PRS and more precise anthropometry measures for better breast cancer risk prediction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10247810PMC
http://dx.doi.org/10.1038/s41598-023-36214-0DOI Listing

Publication Analysis

Top Keywords

[formula text]
28
breast cancer
24
cox model
20
novel predictors
16
risk prediction
16
post-menopausal breast
12
machine learning
8
potential novel
8
post-menopausal women
8
feature selection
8

Similar Publications

Measurement and spectral analysis of medical shock wave parameters based on flexible PVDF sensors.

Phys Eng Sci Med

January 2025

School of Biological Science and Medical Engineering, Beihang University, 37 Xueyuan Road, Haidian District, Beijing, 100191, China.

Extracorporeal shock wave therapy (ESWT) achieves its therapeutic purpose mainly through the biological effects produced by the interaction of shock waves with tissues, and the accurate measurement and calculation of the mechanical parameters of shock waves in tissues are of great significance in formulating the therapeutic strategy and evaluating the therapeutic effect. This study utilizes the approach of implanting flexible polyvinylidene fluoride (PVDF) vibration sensors inside the tissue-mimicking phantom of various thicknesses to capture waveforms at different depths during the impact process in real time. Parameters including positive and negative pressure changes (P, P), pulse wave rise time ([Formula: see text]), and energy flux density (EFD) are calculated, and frequency spectrum analysis of the waveforms is conducted.

View Article and Find Full Text PDF

Electrochemical capacitance-based aptasensor for HER2 detection.

Biomed Microdevices

January 2025

Department of Physics, Faculty of Philosophy, Science and Letter, University of São Paulo, Ribeirão Preto, SP, 14040-901, Brazil.

The overexpression of Human Epidermal Growth Factor Receptor 2 (HER2) protein is specifically related to tumor cell proliferation in breast cancers. Its presence in biological serum samples indicates presence or progression of cancer, becoming a promise biomarker. However, their detection needs a simple and high accuracy platform.

View Article and Find Full Text PDF

A measurement of the dijet production cross section is reported based on proton-proton collision data collected in 2016 at by the CMS experiment at the CERN LHC, corresponding to an integrated luminosity of up to 36.3 . Jets are reconstructed with the anti- algorithm for distance parameters of and 0.

View Article and Find Full Text PDF

Background: The workplace is an important determinant of health that people are exposed to for the first-time during adolescence or early adulthood. This study investigates how diet, physical activity, and sleep change as people aged 16-30 years transition into work and whether this varies for different individuals and job types.

Methods: Multilevel linear regression models assessed changes in fruit and vegetable intake, sleep duration, and physical activity among 3,302 UK Household Longitudinal Study (UKHLS) participants aged 16-30 years, who started work for the first time between 2015 and 2023.

View Article and Find Full Text PDF

Penalized landmark supermodels (penLM) for dynamic prediction for time-to-event outcomes in high-dimensional data.

BMC Med Res Methodol

January 2025

Quantitative Sciences Unit, Department of Medicine, Stanford University School of Medicine, 3180 Porter Drive, Office 118, Stanford, CA, 94304, USA.

Background: To effectively monitor long-term outcomes among cancer patients, it is critical to accurately assess patients' dynamic prognosis, which often involves utilizing multiple data sources (e.g., tumor registries, treatment histories, and patient-reported outcomes).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!