ConceptVector: Text Visual Analytics via Interactive Lexicon Building Using Word Embedding.

Deokgun Park Seungyeon Kim Jurim Lee Jaegul Choo Nicholas Diakopoulos Niklas Elmqvist

IEEE Trans Vis Comput Graph

Published: January 2018

Central to many text analysis methods is the notion of a concept: a set of semantically related keywords characterizing a specific object, phenomenon, or theme. Advances in word embedding allow building a concept from a small set of seed terms. However, naive application of such techniques may result in false positive errors because of the polysemy of natural language. To mitigate this problem, we present a visual analytics system called ConceptVector that guides a user in building such concepts and then using them to analyze documents. Document-analysis case studies with real-world datasets demonstrate the fine-grained analysis provided by ConceptVector. To support the elaborate modeling of concepts, we introduce a bipolar concept model and support for specifying irrelevant words. We validate the interactive lexicon building interface by a user study and expert reviews. Quantitative evaluation shows that the bipolar lexicon generated with our methods is comparable to human-generated ones.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TVCG.2017.2744478	DOI Listing

Publication Analysis

Top Keywords

visual analytics

interactive lexicon

lexicon building

word embedding

conceptvector text

text visual

analytics interactive

building

building word

embedding central

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!