Topic detection and tracking on heterogeneous information

Chen, Long and Zhang, Huaizhi and Jose, Joemon M. and Yu, Haitao and Moshfeghi, Yashar and Triantafillou, Peter (2017) Topic detection and tracking on heterogeneous information. Journal of Intelligent Information Systems. ISSN 1573-7675 (https://doi.org/10.1007/s10844-017-0487-y)

[thumbnail of Chen-etal-JIIS-2017-Topic-detection-and-tracking-on-heterogeneous]
Preview
Text. Filename: Chen_etal_JIIS_2017_Topic_detection_and_tracking_on_heterogeneous.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (2MB)| Preview

Abstract

Given the proliferation of social media and the abundance of news feeds, a substantial amount of real-time content is distributed through disparate sources, which makes it increasingly difficult to glean and distill useful information. Although combining heterogeneous sources for topic detection has gained attention from several research communities, most of them fail to consider the interaction among different sources and their intertwined temporal dynamics. To address this concern, we studied the dynamics of topics from heterogeneous sources by exploiting both their individual properties (including temporal features) and their inter-relationships. We first implemented a heterogeneous topic model that enables topic--topic correspondence between the sources by iteratively updating its topic--word distribution. To capture temporal dynamics, the topics are then correlated with a time-dependent function that can characterise its social response and popularity over time. We extensively evaluate the proposed approach and compare to the state-of-the-art techniques on heterogeneous collection. Experimental results demonstrate that our approach can significantly outperform the existing ones.