Identifying Twitter users who repost unreliable news sources with linguistic information.

PeerJ Comput Sci

Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.

Published: December 2020

AI Article Synopsis

  • Social media is a major source of news, but it's also a key platform for spreading disinformation, making it important to identify users likely to share unreliable content.
  • A novel approach was created to predict whether users will repost content from unreliable news sources by analyzing the language of their own posts.
  • A dataset of around 6.2K Twitter users was used to evaluate various machine learning models, achieving a performance of up to 79.7 macro F1, revealing distinct language styles between users who share reliable vs. unreliable content.

Article Abstract

Social media has become a popular source for online news consumption with millions of users worldwide. However, it has become a primary platform for spreading disinformation with severe societal implications. Automatically identifying social media users that are likely to propagate posts from handles of unreliable news sources sometime in the future is of utmost importance for early detection and prevention of disinformation diffusion in a network, and has yet to be explored. To that end, we present a novel task for predicting whether a user will repost content from Twitter handles of unreliable news sources by leveraging linguistic information from the user's own posts. We develop a new dataset of approximately 6.2K Twitter users mapped into two categories: (1) those that have reposted content from unreliable news sources; and (2) those that repost content only from reliable sources. For our task, we evaluate a battery of supervised machine learning models as well as state-of-the-art neural models, achieving up to 79.7 macro F1. In addition, our linguistic feature analysis uncovers differences in language use and style between the two user categories.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924477PMC
http://dx.doi.org/10.7717/peerj-cs.325DOI Listing

Publication Analysis

Top Keywords

unreliable news
16
news sources
16
twitter users
8
social media
8
handles unreliable
8
repost content
8
news
5
sources
5
identifying twitter
4
users
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!