Exploiting Language Models to Classify Events from Twitter.

Duc-Thuan Vo Vo Thuan Hai Cheol-Young Ock

Comput Intell Neurosci

School of Electrical Engineering, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan 680-749, Republic of Korea.

Published: June 2016

Classifying events is challenging in Twitter because tweets texts have a large amount of temporal data with a lot of noise and various kinds of topics. In this paper, we propose a method to classify events from Twitter. We firstly find the distinguishing terms between tweets in events and measure their similarities with learning language models such as ConceptNet and a latent Dirichlet allocation method for selectional preferences (LDA-SP), which have been widely studied based on large text corpora within computational linguistic relations. The relationship of term words in tweets will be discovered by checking them under each model. We then proposed a method to compute the similarity between tweets based on tweets' features including common term words and relationships among their distinguishing term words. It will be explicit and convenient for applying to k-nearest neighbor techniques for classification. We carefully applied experiments on the Edinburgh Twitter Corpus to show that our method achieves competitive results for classifying events.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4584231	PMC
http://dx.doi.org/10.1155/2015/401024	DOI Listing

Publication Analysis

Top Keywords

language models

classify events

events twitter

classifying events

events

exploiting language

models classify

twitter

twitter classifying

events challenging

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!