Predicting Prefecture-Level Well-Being Indicators in Japan Using Search Volumes in Internet Search Engines: Infodemiology Study.

J Med Internet Res

Human Health Sciences, Graduate School of Medicine, Kyoto University, Kyoto, Japan.

Published: November 2024

Background: In recent years, the adoption of well-being indicators by national governments and international organizations has emerged as an important tool for evaluating state governance and societal progress. Traditionally, well-being has been gauged primarily through economic metrics such as gross domestic product, which fall short of capturing multifaceted well-being, including socioeconomic inequalities, life satisfaction, and health status. Current well-being indicators, including both subjective and objective measures, offer a broader evaluation but face challenges such as high survey costs and difficulties in evaluating at regional levels within countries. The emergence of web log data as an alternative source of well-being indicators offers the potential for more cost-effective, timely, and less biased assessments.

Objective: This study aimed to develop a model using internet search data to predict well-being indicators at the regional level in Japan, providing policy makers with a more accessible and cost-effective tool for assessing public well-being and making informed decisions.

Methods: This study used the Regional Well-Being Index (RWI) for Japan, which evaluates prefectural well-being across 47 prefectures for the years 2010, 2013, 2016, and 2019, as the outcome variable. The RWI includes a comprehensive approach integrating both subjective and objective indicators across 11 domains, including income, job, and life satisfaction. Predictor variables included z score-normalized relative search volume (RSV) data from Google Trends for words relevant to each domain. Unrelated words were excluded from the analysis to ensure relevance. The Elastic Net methodology was applied to predict RWI using RSVs, with α balancing ridge and lasso effects and λ regulating their strengths. The model was optimized by cross-validation, determining the best mix and strength of regularization parameters to minimize prediction error. Root mean square errors (RMSE) and coefficients of determination (R) were used to assess the model's predictive accuracy and fit.

Results: An analysis of Google Trends data yielded 275 words related to the RWI domains, and RSVs were collected for 211 words after filtering out irrelevant terms. The mean search frequencies for these words during 2010, 2013, 2016, and 2019 ranged from -1.587 to 3.902, with SDs between 3.025 and 0.053. The best Elastic Net model (α=0.1, λ=0.906, RMSE=1.290, and R=0.904) was built using 2010-2016 training data and 2-13 variables per domain. Applied to 2019 test data, it yielded an RMSE of 2.328 and R of 0.665.

Conclusions: This study demonstrates the effectiveness of using internet search log data through the Elastic Net machine learning method to predict the RWI in Japanese prefectures with high accuracy, offering a rapid and cost-efficient alternative to traditional survey approaches. This study highlights the potential of this methodology to provide foundational data for evidence-based policy making aimed at enhancing well-being.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11589491PMC
http://dx.doi.org/10.2196/64555DOI Listing

Publication Analysis

Top Keywords

well-being indicators
20
internet search
12
elastic net
12
well-being
11
life satisfaction
8
subjective objective
8
data
8
log data
8
2010 2013
8
2013 2016
8

Similar Publications

Comprehensive histopathological analysis of gastric cancer in European and Latin America populations reveals differences in PDL1, HER2, p53 and MUC6 expression.

Gastric Cancer

January 2025

Department of Medical Oncology, Hospital Clinico Universitario, INCLIVA, Biomedical Research Institute, University of Valencia, Avenida Menendez Pelayo nro 4 accesorio, Valencia, Spain.

Introduction: Gastric cancer (GC) burden is currently evolving with regional differences associated with complex behavioural, environmental, and genetic risk factors. The LEGACy study is a Horizon 2020-funded multi-institutional research project conducted prospectively to provide comprehensive data on the tumour biological characteristics of gastroesophageal cancer from European and LATAM countries.

Material And Methods: Treatment-naïve advanced gastroesophageal adenocarcinoma patients were prospectively recruited in seven European and LATAM countries.

View Article and Find Full Text PDF

Background/purpose: Although metabolic dysfunction-associated steatotic liver disease (MASLD) has been proposed to replace the diagnosis of non-alcoholic fatty liver disease (NAFLD) with new diagnostic criteria since 2023, the genetic predisposition of MASLD remains to be explored.

Methods: Participants with data of genome-wide association studies (GWAS) in the Taiwan Biobank database were collected. Patients with missing data, positive for HBsAg, anti-HCV, and alcohol drinking history were excluded.

View Article and Find Full Text PDF

Background Aims: Extracellular vesicles (EVs) have gained traction as potential cell-free therapeutic candidates. Development of purification methods that are scalable and robust is a major focus of EV research. Yet there is still little in the literature that evaluates purification methods against potency of the EV product.

View Article and Find Full Text PDF

Background: Primary care physicians (PCPs) and nurse practitioners play a key role in guiding caregivers on early peanut protein (PP) introduction, yet many lack adequate knowledge.

Aim Statement: This quality improvement study aimed to enhance understanding among PCPs and caregivers about evidence-based guidelines for early PP introduction in infants' diets.

Methods: Using the Stetler Model, PCP knowledge was evaluated through pre-test, educational video and some posttest material.

View Article and Find Full Text PDF

Background: Patellar instability is frequently encountered by orthopaedic surgeons. One of the major risk factors of this condition is underlying trochlear dysplasia (TD). Recent trends have indicated the use of multiple procedures to correct patellar instability under these conditions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!