An Analysis of French-Language Tweets About COVID-19 Vaccines: Supervised Learning Approach.

JMIR Med Inform

Polytech Clermont, Clermont Auvergne INP, Aubiere, France.

Published: May 2022

Background: As the COVID-19 pandemic progressed, disinformation, fake news, and conspiracy theories spread through many parts of society. However, the disinformation spreading through social media is, according to the literature, one of the causes of increased COVID-19 vaccine hesitancy. In this context, the analysis of social media posts is particularly important, but the large amount of data exchanged on social media platforms requires specific methods. This is why machine learning and natural language processing models are increasingly applied to social media data.

Objective: The aim of this study is to examine the capability of the CamemBERT French-language model to faithfully predict the elaborated categories, with the knowledge that tweets about vaccination are often ambiguous, sarcastic, or irrelevant to the studied topic.

Methods: A total of 901,908 unique French-language tweets related to vaccination published between July 12, 2021, and August 11, 2021, were extracted using Twitter's application programming interface (version 2; Twitter Inc). Approximately 2000 randomly selected tweets were labeled with 2 types of categorizations: (1) arguments for (pros) or against (cons) vaccination (health measures included) and (2) type of content (scientific, political, social, or vaccination status). The CamemBERT model was fine-tuned and tested for the classification of French-language tweets. The model's performance was assessed by computing the F1-score, and confusion matrices were obtained.

Results: The accuracy of the applied machine learning reached up to 70.6% for the first classification (pro and con tweets) and up to 90% for the second classification (scientific and political tweets). Furthermore, a tweet was 1.86 times more likely to be incorrectly classified by the model if it contained fewer than 170 characters (odds ratio 1.86; 95% CI 1.20-2.86).

Conclusions: The accuracy of the model is affected by the classification chosen and the topic of the message examined. When the vaccine debate is jostled by contested political decisions, tweet content becomes so heterogeneous that the accuracy of the model drops for less differentiated classes. However, our tests showed that it is possible to improve the accuracy by selecting tweets using a new method based on tweet length.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9116457PMC
http://dx.doi.org/10.2196/37831DOI Listing

Publication Analysis

Top Keywords

social media
16
french-language tweets
12
tweets
8
machine learning
8
tweets vaccination
8
scientific political
8
accuracy model
8
social
5
model
5
analysis french-language
4

Similar Publications

Background: Information exchange regarding the scope and content of health studies is becoming increasingly important. Digital methods, including study websites, can facilitate such an exchange.

Objective: This scoping review aimed to describe how digital information exchange occurs between the public and researchers in health studies.

View Article and Find Full Text PDF

Introduction: Carpal tunnel syndrome (CTS) is the most common peripheral nerve entrapment disease, and it is a subject of great interest and concern to medical professionals and the general public. Our study aims to analyze and compare the quality and accuracy of the information related to CTS provided by social media platforms (SMPs) and the new large language models (LLM).

Methods: On YouTube, the first 20 videos in English and the first 20 videos in Spanish when searching for "carpal tunnel syndrome" and "síndrome túnel carpo" were selected.

View Article and Find Full Text PDF

Technology advances lead to a high prevalence of cyber dating abuse among youth. Previous studies had demonstrated its detrimental outcomes and predictors, but neglected the characters in Eastern countries. Therefore, exploring the comprehensive mechanisms of cyber dating abuse in different cultures and mitigating it are necessary.

View Article and Find Full Text PDF

Background: Managing preoperative anxiety in pediatric anesthesia is challenging, as it impacts patient cooperation and postoperative outcomes. Both pharmacological and nonpharmacological interventions are used to reduce children's anxiety levels. However, the optimal approach remains debated, with evidence-based guidelines still lacking.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!