ACM Home Page
Please provide us with feedback. Feedback
Real-time ranking with concept drift using expert advice
Full text PdfPdf (2.57 MB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
San Jose, California, USA
SESSION: Research track papers table of contents
Pages: 86 - 94  
Year of Publication: 2007
ISBN:978-1-59593-609-7
Authors
Hila Becker  Columbia University
Marta Arias  Columbia University
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 140,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1281192.1281205
What is a DOI?

ABSTRACT

In many practical applications, one is interested in generating a ranked list of items using information mined from continuous streams of data. For example, in the context of computer networks, one might want to generate lists of nodes ranked according to their susceptibility to attack. In addition, real-world data streams often exhibit concept drift, making the learning task even more challenging. We present an online learning approach to ranking with concept drift, using weighted majority techniques. By continuously modeling different snapshots of the data and tuning our measure of belief in these models over time, we capture changes in the underlying concept and adapt our predictions accordingly. We measure the performance of our algorithm on real electricity data as well as asynthetic data stream, and demonstrate that our approach to ranking from stream data outperforms previously known batch-learning methods and other online methods that do not account for concept drift.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
 
5
6
7
 
8
F. Chu and C. Zaniolo. Fast and light boosting for adaptive mining of data streams. In Proceedings of the Pacific-Asia Knowledge Discovery and Data Mining Conference pages 282--292, 2004.
 
9
V. Dani, O. Madani, D. Pennock, S. Sanghai, and B. Galebach. An empirical comparison of algorithms for aggregating expert predictions. In UAI 2006.
 
10
11
 
12
13
14
 
15
J. Gama and G. Castillo. Learning with local drift detection. In ADMA pages 42--55, 2006.
 
16
J. Gama, P. Medas, G. Castillo, and P. P. Rodrigues. Learning with drift detection. In SBIA pages 286--295, 2004.
17
 
18
P. Gross, A. Boulanger, M. Arias, D. L. Waltz, P. M. Long, C. Lawson, R. Anderson, M. Koenig,M. Mastrocinque, W. Fairechio, J. A. Johnson, S. Lee, F. Doherty, and A. Kressner. Predicting electricity distribution feeder failures using machine learning susceptibility analysis. In AAAI 2006.
 
19
20
 
21
 
22
R. Klinkenberg. Meta-learning, model selection, and example selection in machine learning domains with concept drift. In LWA pages 164--171, 2005.
 
23
 
24
R. Klinkenberg and I. Renz. Adaptive information ?ltering: Learning in the presence of concept drifts, 1998.
 
25
R. Klinkenberg and S. Rüping. Concept drift and the importance of example. In Text Mining pages 55--78. Jürgen Franke, Gholamreza Nakhaeizadeh, and Ingrid Renz, 2003.
26
 
27
 
28
N. Littlestone and M. K. Warmuth. The weighted majority algorithm. In IEEE Symposium on Foundations of Computer Science pages 256--261, 1989.
 
29
P. M. Long and R. A. Servedio. Martingale boosting. In COLT pages 79--94, 2005.
 
30
 
31
N. Oza and S. Russell. Online bagging and boosting. In Artificial Intelligence and Statistics 2001 pages 105--112. Morgan Kaufmann, 2001.
32
 
33
F. Rosenblatt. The perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65(1):386--407, 1958.
 
34
C. Rudin. Ranking with a p-norm push. In COLT pages 589--604, 2006.
 
35
M. Scholz and R. Klinkenberg. An ensemble classi?er for drifting concepts. In Proceedings of the Second International Workshop on Knowledge Discovery in Data Streams pages 53--64, 2005.
 
36
M. Severo and J. Gama. Change detection with kalman ?lter and cusum. In Discovery Science pages 243--254, 2006.
 
37
K. Stanley. Learning concept drift with a committee of decision trees, 2001.
38
 
39
 
40
41
 
42
 
43


Collaborative Colleagues:
Hila Becker: colleagues
Marta Arias: colleagues