ACM Home Page
Please provide us with feedback. Feedback
Indexing density models for incremental learning and anytime classification on data streams
Full text PdfPdf (928 KB)
Source Extending Database Technology; Vol. 360 archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology table of contents
Saint Petersburg, Russia
SESSION: Research sessions: Stream processing table of contents
Pages 311-322  
Year of Publication: 2009
ISBN:978-1-60558-422-5
Authors
Thomas Seidl  RWTH Aachen University, Germany
Ira Assent  Aalborg University, Denmark
Philipp Kranen  RWTH Aachen University, Germany
Ralph Krieger  RWTH Aachen University, Germany
Jennifer Herrmann  RWTH Aachen University, Germany
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 79,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1516360.1516397
What is a DOI?

ABSTRACT

Classification of streaming data faces three basic challenges: it has to deal with huge amounts of data, the varying time between two stream data items must be used best possible (anytime classification) and additional training data must be incrementally learned (anytime learning) for applying the classifier consistently to fast data streams. In this work, we propose a novel index-based technique that can handle all three of the above challenges using the established Bayes classifier on effective kernel density estimators. Our novel Bayes tree automatically generates (adapted efficiently to the individual object to be classified) a hierarchy of mixture densities that represent kernel density estimators at successively coarser levels. Our probability density queries together with novel classification improvement strategies provide the necessary information for very effective classification at any point of interruption. Moreover, we propose a novel evaluation method for anytime classification using Poisson streams and demonstrate the anytime learning performance of the Bayes tree.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D. Andre and P. Stone. Physiological data modeling contest (ICML-2004): http://www.cs.utexas.edu/users/pstone/workshops/2004icml/, 2004.
 
2
T. Bayes. An essay towards solving a problem in the doctrine of chances. Philosophical Transactions of the Royal Society, 53:370--418, 1763.
3
 
4
C. Böhm, A. Pryakhin, and M. Schubert. The Gauss-Tree: Efficient Object Identification in Databases of Probabilistic Feature Vectors. ICDE, 2006.
 
5
 
6
 
7
 
8
K. Crammer, J. S. Kandola, and Y. Singer. Online classification on a budget. In NIPS, 2003.
 
9
10
 
11
P. Domingos and G. Hulten. Learning from infinite data in finite time. In NIPS, pages 673--680, 2001.
 
12
13
 
14
A. G. Gray and A. W. Moore. Nonparametric density estimation: Toward computational tractability. In SDM, 2003.
15
 
16
T. Hastie, R. Tibshirani, and J. H. Friedman. Datasets for "The Elements of Statistical Learning": http://www-stat.stanford.edu/~tibs/elemstatlearn/.
 
17
T. Hastie, R. Tibshirani, and J. H. Friedman. The Elements of Statistical Learning. Springer, 2002.
 
18
S. Hettich and S. Bay. The UCI KDD archive http://kdd.ics.uci.edu, 1999.
19
 
20
M. Jordan and R. Jacobs. Hierarchical Mixtures of Experts and the EM Algorithm. Graphical Models: Foundations of Neural Computation, 2001.
 
21
22
 
23
 
24
 
25
T. Seidl. Nearest Neighbor Classification / Liu L., Özsu M. T. (eds.): Encyclopedia of Database Systems. (to appear). Springer, 2009.
26
 
27
L. Silva, J. M. de Sa, and L. Alexandre. Neural network classification using shannonŠs entropy. In ESANN, 2005.
 
28
B. Silverman. Density Estimation for Statistics and Data Analysis. 1986.
29
 
30
 
31
W. Wahlster. Verbmobil: Foundations of Speech-To-Speech Translation. Springer, 2000.
32
 
33
 
34
35
 
36
S. Zilberstein. Using anytime algorithms in intelligent systems. The AI magazine, 17(3):73--83, 1996.

Collaborative Colleagues:
Thomas Seidl: colleagues
Ira Assent: colleagues
Philipp Kranen: colleagues
Ralph Krieger: colleagues
Jennifer Herrmann: colleagues