Background: Hypertension is a major global health issue and a significant modifiable risk factor for cardiovascular diseases, contributing to a substantial socioeconomic burden due to its high prevalence. In China, particularly among populations living near desert regions, hypertension is even more prevalent due to unique environmental and lifestyle conditions, exacerbating the disease burden in these areas, underscoring the urgent need for effective early detection and intervention strategies.

Objective: This study aims to develop, calibrate, and prospectively validate a 2-year hypertension risk prediction model by using large-scale health examination data collected from populations residing in 4 regions surrounding the Taklamakan Desert of northwest China.

Methods: We retrospectively analyzed the health examination data of 1,038,170 adults (2019-2021) and prospectively validated our findings in a separate cohort of 961,519 adults (2021-2023). Data included demographics, lifestyle factors, physical examinations, and laboratory measurements. Feature selection was performed using light gradient-boosting machine-based recursive feature elimination with cross-validation and Least Absolute Shrinkage and Selection Operator, yielding 24 key predictors. Multiple machine learning (logistic regression, random forest, extreme gradient boosting, light gradient-boosting machine) and deep learning (Feature Tokenizer + Transformer, SAINT) models were trained with Bayesian hyperparameter optimization.

Results: Over a 2-year follow-up, 15.20% (157,766/1,038,170) of the participants in the retrospective cohort and 10.50% (101,077/961,519) in the prospective cohort developed hypertension. Among the models developed, the CatBoost model demonstrated the best performance, achieving area under the curve (AUC) values of 0.888 (95% CI 0.886-0.889) in the retrospective cohort and 0.803 (95% CI 0.801-0.804) in the prospective cohort. Calibration via isotonic regression improved the model's probability estimates, with Brier scores of 0.090 (95% CI 0.089-0.091) and 0.102 (95% CI 0.101-0.103) in the internal validation and prospective cohorts, respectively. Participants were ranked by the positive predictive value calculated using the calibrated model and stratified into 4 risk categories (low, medium, high, and very high), with the very high group exhibiting a 41.08% (5741/13,975) hypertension incidence over 2 years. Age, BMI, and socioeconomic factors were identified as significant predictors of hypertension.

Conclusions: Our machine learning model effectively predicted the 2-year risk of hypertension, making it particularly suitable for preventive health care management in high-risk populations residing in the desert regions of China. Our model exhibited excellent predictive performance and has potential for clinical application. A web-based application was developed based on our predictive model, which further enhanced the accessibility for clinical and public health use, aiding in reducing the burden of hypertension through timely prevention strategies.

Download full-text PDF

Source
http://dx.doi.org/10.2196/68442DOI Listing

Publication Analysis

Top Keywords

desert regions
12
prospective cohort
12
hypertension incidence
8
risk prediction
8
health examination
8
examination data
8
populations residing
8
light gradient-boosting
8
machine learning
8
retrospective cohort
8

Similar Publications

Microplastics (MPs), as a global environmental issue, have unclear impacts on agricultural ecosystems. Cotton, as a major agricultural crop in Xinjiang, requires plastic film covering to ensure its yield. The widespread use of plastic film (commonly made of polyethylene) in cotton cultivation has led to significant concerns about microplastic pollution in cotton fields.

View Article and Find Full Text PDF

Background: Hypertension is a major global health issue and a significant modifiable risk factor for cardiovascular diseases, contributing to a substantial socioeconomic burden due to its high prevalence. In China, particularly among populations living near desert regions, hypertension is even more prevalent due to unique environmental and lifestyle conditions, exacerbating the disease burden in these areas, underscoring the urgent need for effective early detection and intervention strategies.

Objective: This study aims to develop, calibrate, and prospectively validate a 2-year hypertension risk prediction model by using large-scale health examination data collected from populations residing in 4 regions surrounding the Taklamakan Desert of northwest China.

View Article and Find Full Text PDF

Explaining global species richness patterns is a major goal of evolution, ecology, and biogeography. These richness patterns are often attributed to spatial variation in diversification rates (speciation minus extinction). Surprisingly, prominent studies of birds, fish, and plants have reported higher speciation and/or diversification rates at higher latitudes, where species richness is lower.

View Article and Find Full Text PDF

The ecological research of regional land use and land cover change (LULCC) under the Shared Socioeconomic Pathway (SSP) and Representative Concentration Pathway (RCP) scenarios proposed by the IPCC has become a prominent topic. This study investigates the spatial distribution of ecological risks associated with land use and land cover changes in the arid and semi-arid regions of Xinjiang under future SSP-RCP scenarios. In this paper, LUCC data, climate data, and soil and topographic data under different scenarios in 2100 were adopted to construct the land use/land cover quality index (LQI), the climate quality index (CQI), and the soil quality index (SQI) respectively.

View Article and Find Full Text PDF

This study investigates plutonium (Pu) isotopes preserved in nebkhas--aeolian dunes formed by shrubs intercepting wind-blown sands to reconstruct environmental changes in the semi-arid Mu Us dune field, northern China. Analysis results of two nebkha profiles reveal that the Pu/Pu atom ratios consistently approximate 0.18, indicating a dominant source from global fallout, with no significant local contributions from the Lop Nor or Semipalatinsk nuclear tests or the Chernobyl accident.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!