|
ABSTRACT
In many practical applications, one is interested in generating a ranked list of items using information mined from continuous streams of data. For example, in the context of computer networks, one might want to generate lists of nodes ranked according to their susceptibility to attack. In addition, real-world data streams often exhibit concept drift, making the learning task even more challenging. We present an online learning approach to ranking with concept drift, using weighted majority techniques. By continuously modeling different snapshots of the data and tuning our measure of belief in these models over time, we capture changes in the underlying concept and adapt our predictions accordingly. We measure the performance of our algorithm on real electricity data as well as asynthetic data stream, and demonstrate that our approach to ranking from stream data outperforms previously known batch-learning methods and other online methods that do not account for concept drift.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
 |
6
|
Chris Burges , Tal Shaked , Erin Renshaw , Ari Lazier , Matt Deeds , Nicole Hamilton , Greg Hullender, Learning to rank using gradient descent, Proceedings of the 22nd international conference on Machine learning, p.89-96, August 07-11, 2005, Bonn, Germany
[doi> 10.1145/1102351.1102363]
|
 |
7
|
Nicolò Cesa-Bianchi , Yoav Freund , David Haussler , David P. Helmbold , Robert E. Schapire , Manfred K. Warmuth, How to use expert advice, Journal of the ACM (JACM), v.44 n.3, p.427-485, May 1997
[doi> 10.1145/258128.258179]
|
| |
8
|
F. Chu and C. Zaniolo. Fast and light boosting for adaptive mining of data streams. In Proceedings of the Pacific-Asia Knowledge Discovery and Data Mining Conference pages 282--292, 2004.
|
| |
9
|
V. Dani, O. Madani, D. Pennock, S. Sanghai, and B. Galebach. An empirical comparison of algorithms for aggregating expert predictions. In UAI 2006.
|
| |
10
|
|
 |
11
|
|
| |
12
|
|
 |
13
|
Yoav Freund , Robert E. Schapire , Yoram Singer , Manfred K. Warmuth, Using and combining predictors that specialize, Proceedings of the twenty-ninth annual ACM symposium on Theory of computing, p.334-343, May 04-06, 1997, El Paso, Texas, United States
[doi> 10.1145/258533.258616]
|
 |
14
|
|
| |
15
|
J. Gama and G. Castillo. Learning with local drift detection. In ADMA pages 42--55, 2006.
|
| |
16
|
J. Gama, P. Medas, G. Castillo, and P. P. Rodrigues. Learning with drift detection. In SBIA pages 286--295, 2004.
|
 |
17
|
|
| |
18
|
P. Gross, A. Boulanger, M. Arias, D. L. Waltz, P. M. Long, C. Lawson, R. Anderson, M. Koenig,M. Mastrocinque, W. Fairechio, J. A. Johnson, S. Lee, F. Doherty, and A. Kressner. Predicting electricity distribution feeder failures using machine learning susceptibility analysis. In AAAI 2006.
|
| |
19
|
|
 |
20
|
|
| |
21
|
|
| |
22
|
R. Klinkenberg. Meta-learning, model selection, and example selection in machine learning domains with concept drift. In LWA pages 164--171, 2005.
|
| |
23
|
|
| |
24
|
R. Klinkenberg and I. Renz. Adaptive information ?ltering: Learning in the presence of concept drifts, 1998.
|
| |
25
|
R. Klinkenberg and S. Rüping. Concept drift and the importance of example. In Text Mining pages 55--78. Jürgen Franke, Gholamreza Nakhaeizadeh, and Ingrid Renz, 2003.
|
 |
26
|
|
| |
27
|
|
| |
28
|
N. Littlestone and M. K. Warmuth. The weighted majority algorithm. In IEEE Symposium on Foundations of Computer Science pages 256--261, 1989.
|
| |
29
|
P. M. Long and R. A. Servedio. Martingale boosting. In COLT pages 79--94, 2005.
|
| |
30
|
|
| |
31
|
N. Oza and S. Russell. Online bagging and boosting. In Artificial Intelligence and Statistics 2001 pages 105--112. Morgan Kaufmann, 2001.
|
 |
32
|
|
| |
33
|
F. Rosenblatt. The perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65(1):386--407, 1958.
|
| |
34
|
C. Rudin. Ranking with a p-norm push. In COLT pages 589--604, 2006.
|
| |
35
|
M. Scholz and R. Klinkenberg. An ensemble classi?er for drifting concepts. In Proceedings of the Second International Workshop on Knowledge Discovery in Data Streams pages 53--64, 2005.
|
| |
36
|
M. Severo and J. Gama. Change detection with kalman ?lter and cusum. In Discovery Science pages 243--254, 2006.
|
| |
37
|
K. Stanley. Learning concept drift with a committee of decision trees, 2001.
|
 |
38
|
|
| |
39
|
|
| |
40
|
|
 |
41
|
|
| |
42
|
|
| |
43
|
|
|