Regression models such as the Cox proportional hazards model have had increasing use in modelling and estimating the prognosis of patients with a variety of diseases. Many applications involve a large number of variables to be modelled using a relatively small patient sample. Problems of overfitting and of identifying important covariates are exacerbated in analysing prognosis because the accuracy of a model is more a function of the number of events than of the sample size. We used a general index of predictive discrimination to measure the ability of a model developed on training samples of varying sizes to predict survival in an independent test sample of patients suspected of having coronary artery disease. We compared three methods of model fitting: (1) standard 'step-up' variable selection, (2) incomplete principal components regression, and (3) Cox model regression after developing clinical indices from variable clusters. We found regression using principal components to offer superior predictions in the test sample, whereas regression using indices offers easily interpretable models nearly as good as the principal components models. Standard variable selection has a number of deficiencies.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/sim.4780030207 | DOI Listing |
Environ Sci Pollut Res Int
January 2025
Institute of Environment and Sustainable Development, Banaras Hindu University, Varanasi, 221005, India.
Surface water chemistry of the River Ganga at Varanasi was analyzed at 10 locations over 3 years (2019-2021) across pre-monsoon, monsoon, and post-monsoon seasons. The study aimed to assess water parameters using principal component analysis (PCA), calculate the water quality index (WQI), determine processes governing water chemistry, evaluate irrigation suitability, and estimate non-carcinogenic health risks. The physical parameters measured included pH (8.
View Article and Find Full Text PDFBMC Pulm Med
January 2025
Universal Scientific Education and Research Network (USERN), Tehran, Iran.
Objective: Lung cancer (LC), the primary cause for cancer-related death globally is a diverse illness with various characteristics. Saliva is a readily available biofluid and a rich source of miRNA. It can be collected non-invasively as well as transported and stored easily.
View Article and Find Full Text PDFSci Rep
January 2025
Research Unit of Health Sciences and Technology, University of Oulu, Oulu, Finland.
Optical techniques, such as functional near-infrared spectroscopy (fNIRS), contain high potential for the development of non-invasive wearable systems for evaluating cerebral vascular condition in aging, due to their portability and ability to monitor real-time changes in cerebral hemodynamics. In this study, thirty-six healthy adults were measured by single channel fNIRS to explore differences between two age groups using machine learning (ML). The subjects, measured during functional magnetic resonance imaging (fMRI) at Oulu University Hospital, were divided into young (age ≤ 32) and elderly (age ≥ 57) groups.
View Article and Find Full Text PDFFood Chem
January 2025
Engineering Center of Genetic Breeding and Innovative Utilization of Small Fruits of Jilin Province, Changchun, Jilin 130118, China; College of Horticulture, Jilin Agricultural University, Changchun, Jilin 130118, China. Electronic address:
Blueberries are the most popular small berries, in order to solve the problem of unbalanced blueberry resources in different regions of China. In this study, 18 blueberries were analyzed by chromatography and mass spectrometry for 9 soil elements, 6 anthocyanins, 7 phenolic acids, 9 organic acids, and 12 flavonoids. The result showed that blueberry physico-chemical indicators were significantly variable across production regions by Wenn and volcano maps, chlorogenic acid, ascorbic acid, citric acid, catechin were the main antioxidant active components, soil pH was significantly correlated with low content of anthocyanins and organic acids, soil elements were not significantly correlated with fruits antioxidant activity by the network correlation analysis.
View Article and Find Full Text PDFBiophys Chem
January 2025
Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Kolkata 700106, India. Electronic address:
Quantitative characterization of protein conformational landscapes is a computationally challenging task due to their high dimensionality and inherent complexity. In this study, we systematically benchmark several widely used dimensionality reduction and clustering methods to analyze the conformational states of the Trp-Cage mini-protein, a model system with well-documented folding dynamics. Dimensionality reduction techniques, including Principal Component Analysis (PCA), Time-lagged Independent Component Analysis (TICA), and Variational Autoencoders (VAE), were employed to project the high-dimensional free energy landscape onto 2D spaces for visualization.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!