A hashtag recommendation system for twitter data streams.

Comput Soc Netw

Department of Mathematics and Computer Science, University of Puget Sound, Tacoma, USA.

Published: May 2016

Background: Twitter has evolved into a powerful communication and information sharing tool used by millions of people around the world to post what is happening now. A hashtag, a keyword prefixed with a hash symbol (#), is a feature in Twitter to organize tweets and facilitate effective search among a massive volume of data. In this paper, we propose an automatic hashtag recommendation system that helps users find new hashtags related to their interests on-demand.

Methods: For hashtag ranking, we propose the Hashtag Frequency-Inverse Hashtag Ubiquity (HF-IHU) ranking scheme, which is a variation of the well-known TF-IDF, that considers hashtag relevancy, as well as data sparseness which is one of the key challenges in analyzing microblog data. Our system is built on top of Hadoop, a leading platform for distributed computing, to provide scalable performance using Map-Reduce. Experiments on a large Twitter data set demonstrate that our method successfully yields relevant hashtags for user's interest and that recommendations are more stable and reliable than ranking tags based on tweet content similarity.

Results And Conclusions: Our results show that HF-IHU can achieve over 30 % hashtag recall when asked to identify the top 10 relevant hashtags for a particular tweet. Furthermore, our method out-performs kNN, k-popularity, and Naïve Bayes by 69, 54, and 17 %, respectively, on recall of the top 200 hashtags.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749337PMC
http://dx.doi.org/10.1186/s40649-016-0028-9DOI Listing

Publication Analysis

Top Keywords

hashtag
8
hashtag recommendation
8
recommendation system
8
twitter data
8
relevant hashtags
8
data
5
twitter
4
system twitter
4
data streams
4
streams background
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!