ACM Home Page
Please provide us with feedback. Feedback
Tracking dynamics of topic trends using a finite mixture model
Full text PdfPdf (169 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Seattle, WA, USA
POSTER SESSION: Industry/government track posters table of contents
Pages: 811 - 816  
Year of Publication: 2004
ISBN:1-58113-888-1
Authors
Satoshi Morinaga  NEC Corporation, Kawasaki, Kanagawa, JAPAN
Kenji Yamanishi  NEC Corporation, Kawasaki, Kanagawa, JAPAN
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 149,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1014052.1016919
What is a DOI?

ABSTRACT

In a wide range of business areas dealing with text data streams, including CRM, knowledge management, and Web monitoring services, it is an important issue to discover topic trends and analyze their dynamics in real-time. Specifically we consider the following three tasks in topic trend analysis: 1)Topic Structure Identification; identifying what kinds of main topics exist and how important they are, 2)Topic Emergence Detection; detecting the emergence of a new topic and recognizing how it grows, 3)Topic Characterization; identifying the characteristics for each of main topics. For real topic analysis systems, we may require that these three tasks be performed in an on-line fashion rather than in a retrospective way, and be dealt with in a single framework. This paper proposes a new topic analysis framework which satisfies this requirement from a unifying viewpoint that a topic structure is modeled using a finite mixture model and that any change of a topic trend is tracked by learning the finite mixture model dynamically. In this framework we propose the usage of a time-stamp based discounting learning algorithm in order to realize real-time topic structure identification. This enables tracking the topic structure adaptively by forgetting out-of-date statistics. Further we apply the theory of dynamic model selection to detecting changes of main components in the finite mixture model in order to realize topic emergence detection. We demonstrate the effectiveness of our framework using real data collected at a help desk to show that we are able to track dynamics of topic trends in a timely fashion.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
 
5
 
6
 
7
Y.Matsunaga and K.Yamanishi: An information-theoretic approach to detecting anomalous behaviors, in Information Technology Letters vol.2 (Proc. of the 2nd Forum on Information Technologies), pp:123--124, (in Japanese) 2003.
 
8
G.McLahlan and D.Peel: Finite Mixture Models, Wiley Series in Probability and Statistics, John Wiley and Sons, 2000.
 
9
 
10
J.Rissanen: Universal coding, information, and estimation, IEEE Trans. on Inform. Theory, 30:629--636, 1984.
11
12
 
13
K.Yamanishi: A Decision-theoretic Extension of Stochastic Complexity and Its Applications to Learning, IEEE Trans. on Inform. Theory, vol.44/4, pp:1424--1439, 1998.
14
15
16

CITED BY  15
 
 
 
 
 

Collaborative Colleagues:
Satoshi Morinaga: colleagues
Kenji Yamanishi: colleagues

Peer to Peer - Readers of this Article have also read: