|
ABSTRACT
The aggregation and comparison of behavioral patterns on the WWW represent a tremendous opportunity for understanding past behaviors and predicting future behaviors. In this paper, we take a first step at achieving this goal. We present a large scale study correlating the behaviors of Internet users on multiple systems ranging in size from 27 million queries to 14 million blog posts to 20,000 news articles. We formalize a model for events in these time-varying datasets and study their correlation. We have created an interface for analyzing the datasets, which includes a novel visual artifact, the DTWRadar, for summarizing differences between time series. Using our tool we identify a number of behavioral properties that allow us to understand the predictive power of patterns of use.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Aizen, J., D. Huttenlocher, J. Kleinberg, and A. Novak, "Traffic-Based Feedback on the Web," PNAS, Suppl. 1: 5254--5260, Apr. 6, 2004.
|
| |
2
|
Allan, J., J. Carbonell, G. Doddington, J. Yamron, Y. Yang, "Topic Detection and Tracking Pilot Study Final Report," Proc. of the DARPA Broadcast News Transcription and Understanding Workshop, Feb., 1998.
|
| |
3
|
|
 |
4
|
|
 |
5
|
|
 |
6
|
Daniel Gruhl , R. Guha , Ravi Kumar , Jasmine Novak , Andrew Tomkins, The predictive power of online chatter, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
[doi> 10.1145/1081870.1081883]
|
| |
7
|
|
| |
8
|
|
| |
9
|
Keogh, E.J., and M.J. Pazzani, "Derivative Dynamic Time Warping," SDM '01, Chicago, Apr. 5-7, 2001.
|
 |
10
|
|
| |
11
|
Kleinberg, J., "Temporal Dynamics of On-Line Information Streams," In Data Stream Management: Processing High-Speed Data Streams, M. Garofalakis, J. Gehrke, R. Rastogi, eds., Springer, 2006.
|
| |
12
|
Lavrenko, V., M. Schmill, D. Lawrie, and P. Ogilvie, D. Jensen and J. Allen, "Mining of Concurrent Text and Time Series," Workshop on Text Mining, KDD '00, Boston, MA. Aug. 20, 2000.
|
 |
13
|
|
| |
14
|
Martzoukou, K., "A review of Web information seeking research: considerations of method and foci of interest," Information Research, 10(2), paper 215, 2004.
|
| |
15
|
Microsoft Live Labs, "Accelerating Search in Academic Research," 2006.
|
| |
16
|
Murray, G. C., J. Lin, and A. Chowdhury, "Identification of User Sessions with Hierarchical Agglomerative Clustering," ASIS&T'06, Austin, TX, Nov. 3-8, 2006.
|
| |
17
|
Myers, C.S., and L.R. Rabiner, "A Comparative Study of Several Dynamic Time-Warping Algorithms for Connected Word Recognition," The Bell System Tech. J., 60(7):1389--1408, September, 191.
|
| |
18
|
Nielsen BuzzMetrics, ICWSM Conference dataset, http://www.icwsm.org/data.html
|
 |
19
|
|
| |
20
|
Sakoe, H., and S. Chiba, "Dynamic Programming Algorithm Optimization for Spoken Word Recognition," IEEE Transactions on Acoustics, Speech and Signal Processing, Vol. ASSP-26(1):43--49, 1978.
|
 |
21
|
Jaime Teevan , Eytan Adar , Rosie Jones , Michael Potts, History repeats itself: repeat queries in Yahoo's logs, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
[doi> 10.1145/1148170.1148326]
|
| |
22
|
|
| |
23
|
|
 |
24
|
Michail Vlachos , Christopher Meek , Zografoula Vagena , Dimitrios Gunopulos, Identifying similarities, periodicities and bursts for online search queries, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
[doi> 10.1145/1007568.1007586]
|
| |
25
|
|
 |
26
|
|
| |
27
|
Witkin, A. P. "Scale-space filtering", IJCAI '83, Karlsruche, Germany, Aug. 8-12, 1983.
|
CITED BY 4
|
|
Yabo Xu , Ke Wang , Ada Wai-Chee Fu , Philip S. Yu, Anonymizing transaction databases for publication, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
Munmun De Choudhury , Hari Sundaram , Ajita John , Dorée Duncan Seligmann, What makes conversations interesting?: themes, participants and consequences of conversations in online social media, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|
|
|
|