Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods

Wanying Ding, Chaomei Chen

Research output: Contribution to journalArticle

30 Citations (Scopus)

Abstract

Cocitation and co-word methods have long been used to detect and track emerging topics in scientific literature, but both have weaknesses. Recently, while many researchers have adopted generative probabilistic models for topic detection and tracking, few have compared generative probabilistic models with traditional cocitation and co-word methods in terms of their overall performance. In this article, we compare the performance of hierarchical Dirichlet process (HDP), a promising generative probabilistic model, with that of the 2 traditional topic detecting and tracking methods - cocitation analysis and co-word analysis. We visualize and explore the relationships between topics identified by the 3 methods in hierarchical edge bundling graphs and time flow graphs. Our result shows that HDP is more sensitive and reliable than the other 2 methods in both detecting and tracking emerging topics. Furthermore, we demonstrate the important topics and topic evolution trends in the literature of terrorism research with the HDP method.

Original languageEnglish
Pages (from-to)2084-2097
Number of pages14
JournalJournal of the Association for Information Science and Technology
Volume65
Issue number10
DOIs
Publication statusPublished - 2014 Oct 1

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Networks and Communications
  • Information Systems and Management
  • Library and Information Sciences

Fingerprint Dive into the research topics of 'Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods'. Together they form a unique fingerprint.

  • Cite this