ACM Home Page
Please provide us with feedback. Feedback
SPC: a distributed, scalable platform for data mining
Full text PdfPdf (540 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 4th international workshop on Data mining standards, services and platforms table of contents
Philadelphia, Pennsylvania
Pages: 27 - 37  
Year of Publication: 2006
ISBN:1-59593-443-X
Authors
Lisa Amini  IBM T. J. Watson Research Center, Hawthorne, NY
Henrique Andrade  IBM T. J. Watson Research Center, Hawthorne, NY
Ranjita Bhagwan  IBM T. J. Watson Research Center, Hawthorne, NY
Frank Eskesen  IBM T. J. Watson Research Center, Hawthorne, NY
Richard King  IBM T. J. Watson Research Center, Hawthorne, NY
Philippe Selo  IBM T. J. Watson Research Center, Hawthorne, NY
Yoonho Park  IBM T. J. Watson Research Center, Hawthorne, NY
Chitra Venkatramani  IBM T. J. Watson Research Center, Hawthorne, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 127,   Citation Count: 6
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1289612.1289615
What is a DOI?

ABSTRACT

The Stream Processing Core (SPC) is distributed stream processing middleware designed to support applications that extract information from a large number of digital data streams. In this paper, we describe the SPC programming model which, to the best of our knowledge, is the first to support stream-mining applications using a subscription-like model for specifying stream connections as well as to provide support for non-relational operators. This enables stream-mining applications to tap into, analyze and track an ever-changing array of data streams which may contain information relevant to the streaming-queries placed on it. We describe the design, implementation, and experimental evaluation of the SPC distributed middleware, which deploys applications on to the running system in an incremental fashion, making stream connections as required. Using micro-benchmarks and a representative large-scale synthetic stream-mining application, we evaluate the performance of the control and data paths of the SPC middleware.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
D. J. Abadi, Y. Ahmad, M. Balazinska, U. Cetintemel, M. Cherniack, J.-H. Hwang, W. Lindner, A. S. Maskey, A. Rasin, E. Ryvkina, N. Tatbul, Y. Xing, and S. Zdonik. The design of the borealis stream processing engine. In Proceedings of the 2005 Conference on Innovative Data Systems Research (CIDR 2003), Asilomar, CA, 2005.
 
3
 
4
A. Arasu, B. Babcock, S. Babu, J. Cieslewicz, M. Datar, R. Motwani, U. Srivastava, and J. Widom. STREAM: The stanford data stream management system (demonstration description). To appear in a book on data stream management edited by Garofalakis, Gehrke and Rastogi, 2004.
5
 
6
 
7
 
8
S. Chandrasekaran, O. Cooper, A. Deshpande, M. J. Franklin, J. M. Hellerstein, W. Hong, S. Krishnamurthy, S. Madden, V. Raman, F. Reiss, and M. Shah. TelegraphCQ: Continuous dataflow processing for an uncertain world. In Proceedings of the 2003 Conference on Innovative Data Systems Research (CIDR 2003), Asilomar, CA, 2003.
 
9
M. Cherniack, H. Balakrishnan, M. Balazinska, D. Carney, U. Cetintemel, Y. Xing, and S. Zdonik. Scalable distributed stream processing. In Proceedings of the 2003 Conference on Innovative Data Systems Research (CIDR 2003), Asilomar, CA, January 2003.
 
10
H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: A framework and graphical development environment for robust NLP tools and applications. In Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics, Philadelphia, PA, July 2002.
 
11
 
12
M. Hapner, R. Burridge, R. Sharma, and J. Fialli. Java message service -- version 1.0.2b, August 2001. Sun Microsystems.
13
14
 
15
C.-Y. Lin, O. Verscheure, and L. Amini. Videodig project. http://www.research.ibm.com/VideoDIG.
 
16
C.-Y. Lin, O. Verscheure, and L. Amini. Semantic routing and filtering for large-scale video streams monitoring. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2005), Amsterdam, Netherlands, July 2005.
17
18
 
19
G. Swint, G. Jung, and C. Pu. Event-based QoS for a distributed continual query system. In Proceedings of the 2005 IEEE International Conference on Information Reuse and Integration (IRI 2005), Las Vegas, NV, August 2005.

CITED BY  6
Collaborative Colleagues:
Lisa Amini: colleagues
Henrique Andrade: colleagues
Ranjita Bhagwan: colleagues
Frank Eskesen: colleagues
Richard King: colleagues
Philippe Selo: colleagues
Yoonho Park: colleagues
Chitra Venkatramani: colleagues