ACM Home Page
Please provide us with feedback. Feedback
On demand classification of data streams
Full text PdfPdf (160 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Seattle, WA, USA
POSTER SESSION: Research track posters table of contents
Pages: 503 - 508  
Year of Publication: 2004
ISBN:1-58113-888-1
Authors
Charu C. Aggarwal  IBM T. J. Watson Research Center
Jiawei Han  UIUC
Jianyong Wang  UIUC
Philip S. Yu  IBM T. J. Watson Research Center
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 127,   Citation Count: 18
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1014052.1014110
What is a DOI?

ABSTRACT

Current models of the classification problem do not effectively handle bursts of particular classes coming in at different times. In fact, the current model of the classification problem simply concentrates on methods for one-pass classification modeling of very large data sets. Our model for data stream classification views the data stream classification problem from the point of view of a dynamic approach in which simultaneous training and testing streams are used for dynamic classification of data sets. This model reflects real life situations effectively, since it is desirable to classify test streams in real time over an evolving training and test stream. The aim here is to create a classification system in which the training model can adapt quickly to the changes of the underlying data stream. In order to achieve this goal, we propose an on-demand classification process which can dynamically select the appropriate window of past training data to build the classifier. The empirical results indicate that the system maintains a high classification accuracy in an evolving data stream, while providing an efficient solution to the classification task.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
C. C. Aggarwal, J. Han, J.Wang, P. Yu. CluStream: A Framework for Clustering Evolving Data Streams. VLDB Conference, 2003.
2
3
4
 
5
6
7

CITED BY  18

Collaborative Colleagues:
Charu C. Aggarwal: colleagues
Jiawei Han: colleagues
Jianyong Wang: colleagues
Philip S. Yu: colleagues