Progressive Learning of Topic Modeling Parameters: A Visual Analytics Framework


Mennatallah El-Assady, Rita Sevastjanova, Fabian Sperrle, Daniel Keim, and Christopher Collins


Topic modeling algorithms are widely used to analyze the thematic composition of text corpora but remain difficult to interpret and adjust. Addressing these limitations, we present a modular visual analytics framework, tackling the understandability and adaptability of topic models through a user-driven reinforcement learning process which does not require a deep understanding of the underlying topic modeling algorithms. Given a document corpus, our approach initializes two algorithm configurations based on a parameter space analysis that enhances document separability. We abstract the model complexity in an interactive visual workspace for exploring the automatic matching results of two models, investigating topic summaries, analyzing parameter distributions, and reviewing documents. The main contribution of our work is an iterative decision-making technique in which users provide a document-based relevance feedback that allows the framework to converge to a user-endorsed topic distribution. We also report feedback from a two-stage study which shows that our technique results in topic model quality improvements on two independent measures.


To apply our technique on your own data or try out a demo, please visit: (Individual accounts will be created upon request).


  • M. El-Assady, R. Sevastjanova, F. Sperrle, D. Keim, and C. Collins, “Progressive Learning of Topic Modeling Parameters: A Visual Analytics Framework,” IEEE Transactions on Visualization and Computer Graphics, vol. PP, 2017. Honorable Mention for Best Paper.
    [Bibtex] [PDF] [DOI]
      author =    {Mennatallah El-Assady and Rita Sevastjanova and Fabian Sperrle and Daniel Keim and Christopher Collins},
      title =    {Progressive Learning of Topic Modeling Parameters: A Visual Analytics Framework},
      journal = {IEEE Transactions on Visualization and Computer Graphics},
      doi = {10.1109/TVCG.2017.2745080},
      volume = PP,
      year =   2017,
      note = {Honorable Mention for Best Paper}

VAST Award - Honorable Mention for Best Paper


Demo Video:

Talk from IEEE VAST 2017:


EduApps – Supporting Non-Native English Speakers to Overcome Language Transfer Effects

Metatation: Annotation as Implicit Interaction to Bridge Close and Distant Reading

DataTours: A Data Narratives Framework

Perceptual Biases in Font Size as a Data Encoding

Progressive Learning of Topic Modeling Parameters: A Visual Analytics Framework

Abbreviating Text Labels on Demand

NEREx: Named-Entity Relationship Exploration in Multi-Party Conversations

ConToVi: Multi-Party Conversation Exploration using Topic-Space Views

PhysioEx: Visual Analysis of Physiological Event Streams

Using Visual Analytics of Heart Rate Variation to Aid in Diagnostics

Off-Screen Desktop


Reading Comprehension on Mobile Devices

#FluxFlow: Visual Analysis of Anomalous Information Spreading on Social Media

Optimizing Hierarchical Visualizations with the Minimum Description Length Principle


SentimentState: Exploring Sentiment Analysis on Twitter

Facilitating Discourse Analysis with Interactive Visualization




Simple Multi-Touch Toolkit

Exploring Text Entities with Descriptive Non-photorealistic Rendering

Investigating the Semantic Patterns of Passwords

Bubble Sets: Revealing Set Relations with Isocontours over Existing Visualizations

Parallel Tag Clouds to Explore Faceted Text Corpora

VisLink: Revealing Relationships Amongst Visualizations

DocuBurst: Visualizing Document Content using Language Structure

Tabletop Text Entry Techniques

Lattice Uncertainty Visualization: Understanding Machine Translation and Speech Recognition

WordNet Visualization

// Where the sidebar information is stored
| © Copyright vialab | Dr. Christopher Collins, Canada Research Chair in Linguistic Information Visualization |