Background: Digital misinformation, primarily on social media, has led to harmful and costly beliefs in the general population. Notably, these beliefs have resulted in public health crises to the detriment of governments worldwide and their citizens. However, public health officials need access to a comprehensive system capable of mining and analyzing large volumes of social media data in real time.

Objective: This study aimed to design and develop a big data pipeline and ecosystem (UbiLab Misinformation Analysis System [U-MAS]) to identify and analyze false or misleading information disseminated via social media on a certain topic or set of related topics.

Methods: U-MAS is a platform-independent ecosystem developed in Python that leverages the Twitter V2 application programming interface and the Elastic Stack. The U-MAS expert system has 5 major components: data extraction framework, latent Dirichlet allocation (LDA) topic model, sentiment analyzer, misinformation classification model, and Elastic Cloud deployment (indexing of data and visualizations). The data extraction framework queries the data through the Twitter V2 application programming interface, with queries identified by public health experts. The LDA topic model, sentiment analyzer, and misinformation classification model are independently trained using a small, expert-validated subset of the extracted data. These models are then incorporated into U-MAS to analyze and classify the remaining data. Finally, the analyzed data are loaded into an index in the Elastic Cloud deployment and can then be presented on dashboards with advanced visualizations and analytics pertinent to infodemiology and infoveillance analysis.

Results: U-MAS performed efficiently and accurately. Independent investigators have successfully used the system to extract significant insights into a fluoride-related health misinformation use case (2016 to 2021). The system is currently used for a vaccine hesitancy use case (2007 to 2022) and a heat wave-related illnesses use case (2011 to 2022). Each component in the system for the fluoride misinformation use case performed as expected. The data extraction framework handles large amounts of data within short periods. The LDA topic models achieved relatively high coherence values (0.54), and the predicted topics were accurate and befitting to the data. The sentiment analyzer performed at a correlation coefficient of 0.72 but could be improved in further iterations. The misinformation classifier attained a satisfactory correlation coefficient of 0.82 against expert-validated data. Moreover, the output dashboard and analytics hosted on the Elastic Cloud deployment are intuitive for researchers without a technical background and comprehensive in their visualization and analytics capabilities. In fact, the investigators of the fluoride misinformation use case have successfully used the system to extract interesting and important insights into public health, which have been published separately.

Conclusions: The novel U-MAS pipeline has the potential to detect and analyze misleading information related to a particular topic or set of related topics.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10337356PMC
http://dx.doi.org/10.2196/44356DOI Listing

Publication Analysis

Top Keywords

public health
16
data
14
social media
12
data extraction
12
extraction framework
12
lda topic
12
sentiment analyzer
12
elastic cloud
12
cloud deployment
12
misinformation case
12

Similar Publications

Drug-Drug Interaction Between Rifampicin and Albuvirtide: A Phase 1, Randomized, Open-Label Study.

J Clin Pharmacol

January 2025

Department of Pharmacy, Shanghai Public Health Clinical Center, Fudan University, Shanghai, China.

Albuvirtide (ABT) is a novel long-acting fusion inhibitor for human immunodeficiency virus type 1 (HIV-1), and may be co-administered with rifampicin (RIF) in patients concurrent with tubercle bacillus and HIV-1. This study was conducted to investigate the pharmacokinetic effect of co-administration of the two drugs. In the study, 24 healthy volunteers were randomized to receive ABT alone or with RIF.

View Article and Find Full Text PDF

Background: Grasping the nuanced needs of older adults is paramount for the efficacious provision of day-care services. Our study sought to identify the demand patterns for day-care services in China and to explore the underlying factors. This study aims to offer useful evidence that can refine nursing care strategies and guide policy development within day-care settings.

View Article and Find Full Text PDF

Firefighters are exposed to the risk of burns at fire scenes. In 2020, the National Fire Agency of the Republic of Korea surveyed 50,527 firefighters and identified 242 burn-related incidents. The body parts affected by these burns were the hands (28.

View Article and Find Full Text PDF

Objective: We aimed to examine the relationship between the weight-adjusted waist index (WWI) and obstructive sleep apnea (OSA), a condition often caused by obesity, which remains unclear.

Methods: In this cross-sectional study, we analyzed data from the National Health and Nutrition Examination Survey among adults in the United States (US) aged 20 to 65 years, covering the periods 2005 to 2008 and 2015 to 2018. The study included 8278 participants; we used multivariate logistic regression, restricted cubic splines, and subgroup analyses to explore the relationship between WWI and OSA.

View Article and Find Full Text PDF

Pulmonary rehabilitation (PR) services are increasingly using alternative programme delivery modes, for example telerehabilitation strategies including videoconferencing, to improve patient choice and accessibility. Although telerehabilitation results in improvements in core outcomes, the effect on knowledge attainment is not known. To observe the real-world responses of patients choosing to undergo videoconference PR to a matched control group choosing to undergo in-person PR, in terms of knowledge attainment.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!