| Scalable and near real-time burst detection from eCommerce queries |
| Full text |
Pdf
(469 KB)
|
Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
Las Vegas, Nevada, USA
SESSION: Industrial papers
table of contents
Pages 972-980
Year of Publication: 2008
ISBN:978-1-60558-193-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 33, Downloads (12 Months): 287, Citation Count: 2
|
|
|
ABSTRACT
In large scale online systems like Search, eCommerce, or social network applications, user queries represent an important dimension of activities that can be used to study the impact on the system, and even the business. In this paper, we describe how to detect, characterize and classify bursts in user queries in a large scale eCommerce system. We build upon the approaches discussed in KDD 2002 "Bursty and Hierarchical Structure in Streams" [3] and apply them to a high volume industrial context. We describe how to identify bursts on a near real-time basis, classify them, and apply them to build interesting merchandizing applications.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Chris Curry , Robert L. Grossman , David Locke , Steve Vejcik , Joseph Bugajski, Detecting changes in large data sets of payment card data: a case study, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
[doi> 10.1145/1281192.1281303]
|
 |
3
|
|
| |
4
|
Rabiner Lawrence R., A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE, Vol 77, No 2, February 1989.
|
| |
5
|
Aboufadel Edward, Schlicker Steven., Discovering Wavelets. A Wiley-Interscience Publication.
|
| |
6
|
Hulata Eyal, Segev Ronen, Ben-Jacob Eshel., A method for spike sorting and detection based on wavelet packets and Shannon's mutual information. Journal of Neuroscience Methods 117(2002) 1 -- 12.
|
| |
7
|
Vlachos Michail, Lin Jessica, Keogh Eamonn, Gunopulos Dimitrios., A Wavelet-Based Anytime Algorithm for K-Means Clustering of Time Series. 3rd SIAM International Conference on Data Mining.
|
| |
8
|
Yi Jeonghee, Detecting buzz from time-sequenced document streams. 2005 IEEE International Conference Proceedings.
|
 |
9
|
Xuanhui Wang , ChengXiang Zhai , Xiao Hu , Richard Sproat, Mining correlated bursty topic patterns from coordinated text streams, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
[doi> 10.1145/1281192.1281276]
|
 |
10
|
Micah Dubinko , Ravi Kumar , Joseph Magnani , Jasmine Novak , Prabhakar Raghavan , Andrew Tomkins, Visualizing tags over time, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
[doi> 10.1145/1135777.1135810]
|
| |
11
|
|
| |
12
|
|
| |
13
|
Shaker M. EEG Waves Classifier using Wavelet Transform and Fourier Transform. Intl. Journal of Biomedical Sciences Volume 1, Number 1, 2006.
|
| |
14
|
Chazal P., Celler B., Reilly R. Using Wavelet Coefficients for the Classification of the Electrocardiogram. Proceedings of World Congress on Medical Physics and Biomedical Engineering, 2000.
|
| |
15
|
Cruden D., Hu X. The shapes of some mountain peaks in the Canadian Rockies. Earth Surface Processes and Landforms, Volume 24, Issue 13.
|
 |
16
|
Michail Vlachos , Christopher Meek , Zografoula Vagena , Dimitrios Gunopulos, Identifying similarities, periodicities and bursts for online search queries, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007586]
|
| |
17
|
Cyrus Shahabi , Seokkyung Chung , Maytham Safar , George Hajj, 2D TSA-tree: A Wavelet-Based Approach to Improve the Efficiency of Multi-Level Spatial Data Mining, Proceedings of the 13th International Conference on Scientific and Statistical Database Management, p.59-68, July 18-20, 2001
|
 |
18
|
|
| |
19
|
Kleinberg J. Temporal Dynamics of On-Line Information Streams. Processing High-Speed Data Streams, Springer 06.
|
| |
20
|
|
| |
21
|
Shyang Ho S., Wechsler H. Detecting Changes in Unlabeled Data Streams using Martingale. IJCAI07.
|
| |
22
|
|
| |
23
|
|
|