| Mine your own business, mine others' news! |
| Full text |
Pdf
(382 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 261
archive
Proceedings of the 11th international conference on Extending database technology: Advances in database technology
table of contents
Nantes, France
DEMONSTRATION SESSION: Demonstrations: Web and distribution
table of contents
Pages 725-729
Year of Publication: 2008
ISBN:978-1-59593-926-5
|
|
Authors
|
|
Quang-Khai Pham
|
University of New South Wales, Sydney, NSW, Australia and LINA at University of Nantes, Nantes, France
|
|
Regis Saint-Paul
|
University of New South Wales, Sydney, NSW, Australia
|
|
Boualem Benatallah
|
University of New South Wales, Sydney, NSW, Australia
|
|
Noureddine Mouaddib
|
LINA at University of Nantes, Nantes, France
|
|
Guillaume Raschia
|
LINA at University of Nantes, Nantes, France
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 56, Citation Count: 0
|
|
|
ABSTRACT
Major media companies such as The Financial Times, the Wall Street Journal or Reuters generate huge amounts of textual news data on a daily basis. Mining frequent patterns in this mass of information is critical for knowledge workers such as financial analysts, stock traders or economists. Using existing frequent pattern mining (FPM) algorithms for the analysis of news data is difficult because of the size and lack of structuring of the free text news content. In this article, we demonstrate a comprehensive Streaming TEmporAl Data (STEAD) analysis framework for mining frequent patterns in financial news. In this demonstration, we show how the mining task is supported by the use of a Time-Aware Content Summarization algorithm (TACS). This summary generates a concise representation of large volume of data by taking into account the expert's peculiar interest while preserving the news arrival temporal information which is essential for FPM algorithms. We experimented the whole framework on a set of news data from Reuters.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Wordnet. http://wordnet.princeton.edu/
|
| |
2
|
|
 |
3
|
Shivnath Babu , Minos Garofalakis , Rajeev Rastogi, SPARTAN: a model-based semantic compression system for massive data tables, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.283-294, May 21-24, 2001, Santa Barbara, California, United States
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
Jian Pei , Jiawei Han , Behzad Mortazavi-Asl , Helen Pinto , Qiming Chen , Umeshwar Dayal , Meichun Hsu, PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth, Proceedings of the 17th International Conference on Data Engineering, p.215-224, April 02-06, 2001
|
| |
8
|
|
|