|
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
L. Breiman, J. H. Ftiedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Wadsworth, Belmont, CA, 1984.
|
| |
2
|
J. Catett. Megainduction: Machine Learning on Very Large Databases. PhD thesis, Basset Department of Computer Science, University of Sydney, Sydney, AustrMia, 1991.
|
 |
3
|
|
| |
4
|
|
 |
5
|
Johannes Gehrke , Venkatesh Ganti , Raghu Ramakrishnan , Wei-Yin Loh, BOAT—optimistic decision tree construction, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.169-180, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
| |
6
|
J. Gratch. Sequential inductive learning. In Proceedings of the Thireeenth National Conference on Artificial fntelligence, pages 779 786, Portland, OR, 1996. AAAI Press.
|
| |
7
|
W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58:13 30, 1963.
|
| |
8
|
|
| |
9
|
O. Maron and A. Moore. Hoeffding races: Accelerating model selection search for classification and function approximation. In J. D. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural fnformation Processing Systems 6. Morgan Kaufmann, San Mateo, CA, 1994.
|
| |
10
|
|
| |
11
|
R. G. Miller, Jr. Simultaneous Statistical fnference. Springer, New York, NY, 2nd edition, 1981.
|
| |
12
|
A. W. Moore and M. S. Lee. Efficient algorithms for minimizing cross validation error. In Proceedings of the Eleventh fnterrtational Conference on Machine Learning, pages 190 198, New Brunswick, NO, 1994. Morgan Kaufmann.
|
| |
13
|
R. Musick, J. Catlett, and S. Russell. Decision theoretic subsampling for induction on large databases. In Proceedings of the Tenth fnterrtational Conference on Machine Learning, pages 212 219, Amherst, MA, 1993. Morgan Kauflnann.
|
 |
14
|
Foster Provost , David Jensen , Tim Oates, Efficient progressive sampling, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.23-32, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312188]
|
| |
15
|
|
| |
16
|
J. R. Quinlan and R. M. Cameron-Jones. Oversearching and layered search in empirical learning. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pages 1019 1024, Montreal, Canada, 1995. Morgan Kaufmann.
|
| |
17
|
|
| |
18
|
P. Smyth and D. Wolpert. Anytime exploratory data anMysis for massive data sets. In Proceedings of the Third Interrtational Conference on Knowledge Discovery and Data Mining, pages 5&60, Newport Beach, CA, 1997. AAAI Press.
|
| |
19
|
|
| |
20
|
|
| |
21
|
P. E. Utgoff. An improved algorithm for incremental induction of decision trees. In Proceedings of the Eleventh International Conference on Machine Learning, pages 318 325, New Brunswick, NJ, 1994. Morgan Kaufmann.
|
| |
22
|
G. L Webb. OPUS: An efiqcient admissible algorithm for unordered search. Journal of Artificial Intelligence Research, 3:431 465, 1995.
|
| |
23
|
A. Wolman, G. Voelker, N. Sharma, N. Cardwell, M. Brown, T. Landray, D. Pinnel, A. KaHin, and H. Levy. Organization-based analysis of Web-object sharing and caching. In Proceedings of the Second USENIX Conference on Interrtet Technologies and Systems, pages 25-36, Boulder, CO, 1999.
|
CITED BY 106
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Brain Babcock , Mayur Datar , Rajeev Motwani , Liadan O'Callaghan, Maintaining variance and k-medians over data stream windows, Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.234-243, June 09-11, 2003, San Diego, California
|
|
|
Yabo Xu , Ke Wang , Ada Wai-Chee Fu , Rong She , Jian Pei, Classification spanning correlated data streams, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Zhiyuan Chen , Chen Li , Jian Pei , Yufei Tao , Haixun Wang , Wei Wang , Jiong Yang , Jun Yang , Donghui Zhang, Recent progress on selected topics in database research: a report by nine young Chinese researchers working in the United States, Journal of Computer Science and Technology, v.18 n.5, p.538-552, September 2003
|
|
|
|
|
|
|
|
|
Mohamed Medhat Gaber , Shonali Krishnaswamy , Arkady Zaslavsky, Cost-efficient mining techniques for data streams, Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation, p.109-114, January 01, 2004, Dunedin, New Zealand
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Charu C. Aggarwal , Jiawei Han , Jianyong Wang , Philip S. Yu, On demand classification of data streams, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
|
|
|
|
|
|
Graham Cormode , Theodore Johnson , Flip Korn , S. Muthukrishnan , Oliver Spatscheck , Divesh Srivastava, Holistic UDAFs at streaming speeds, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
|
|
|
|
|
|
|
|
|
Brian Babcock , Shivnath Babu , Mayur Datar , Rajeev Motwani , Jennifer Widom, Models and issues in data stream systems, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 03-05, 2002, Madison, Wisconsin
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Haixun Wang , Jian Yin , Jian Pei , Philip S. Yu , Jeffrey Xu Yu, Suppressing model overfitting in mining concept-drifting data streams, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Charu C. Aggarwal , Jiawei Han , Jianyong Wang , Philip S. Yu, A framework for projected clustering of high dimensional data streams, Proceedings of the Thirtieth international conference on Very large data bases, p.852-863, August 31-September 03, 2004, Toronto, Canada
|
|
|
Charu C. Aggarwal , Jiawei Han , Jianyong Wang , Philip S. Yu, A framework for clustering evolving data streams, Proceedings of the 29th international conference on Very large data bases, p.81-92, September 09-12, 2003, Berlin, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lior Cohen , Gil Avrahami-Bakish , Mark Last , Abraham Kandel , Oscar Kipersztok, Real-time data mining of non-stationary data streams from sensor networks, Information Fusion, v.9 n.3, p.344-353, July, 2008
|
|
|
|
|
|
Lei Chang , Tengjiao Wang , Dongqing Yang , Hua Luan , Shiwei Tang, Efficient algorithms for incremental maintenance of closed sequential patterns in large databases, Data & Knowledge Engineering, v.68 n.1, p.68-106, January, 2009
|
|
|
|
|
|
Larry Shoemaker , Robert E. Banfield , Lawrence O. Hall , Kevin W. Bowyer , W. Philip Kegelmeyer, Using classifier ensembles to label spatially disjoint data, Information Fusion, v.9 n.1, p.120-133, January, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Mohammad M. Masud , Jing Gao , Latifur Khan , Jiawei Han , Bhavani Thuraisingham, Peer to peer botnet detection for cyber-security: a data mining approach, Proceedings of the 4th annual workshop on Cyber security and informaiton intelligence research: developing strategies to meet the cyber security and information intelligence challenges ahead, May 12-14, 2008, Oak Ridge, Tennessee
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Tiancheng Zhang , Dejun Yue , Yu Gu , Yi Wang , Ge Yu, Adaptive correlation analysis in stream time series with sliding windows, Computers & Mathematics with Applications, v.57 n.6, p.937-948, March, 2009
|
|
|
|
|
|
|
|
|
Thomas Seidl , Ira Assent , Philipp Kranen , Ralph Krieger , Jennifer Herrmann, Indexing density models for incremental learning and anytime classification on data streams, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bin Cao , Dou Shen , Jian-Tao Sun , Xuanhui Wang , Qiang Yang , Zheng Chen, Detect and track latent factors with online nonnegative matrix factorization, Proceedings of the 20th international joint conference on Artifical intelligence, p.2689-2694, January 06-12, 2007, Hyderabad, India
|
|
|
Albert Bifet , Geoff Holmes , Bernhard Pfahringer , Richard Kirkby , Ricard Gavaldà, New ensemble methods for evolving data streams, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.8
Database applications
Subjects:
Data mining
Additional Classification:
G.
Mathematics of Computing
G.2
DISCRETE MATHEMATICS
G.2.2
Graph Theory
Subjects:
Trees
H.
Information Systems
H.4
INFORMATION SYSTEMS APPLICATIONS
H.4.2
Types of Systems
Subjects:
Decision support (e.g., MIS)
I.
Computing Methodologies
I.2
ARTIFICIAL INTELLIGENCE
I.2.6
Learning
Subjects:
Concept learning
I.5
PATTERN RECOGNITION
I.5.2
Design Methodology
Subjects:
Classifier design and evaluation
General Terms:
Algorithms,
Design,
Management,
Measurement,
Performance,
Theory,
Verification
Keywords:
Hoeffding bounds,
decision trees,
disk-based algorithms,
incremental learning,
subsampling
|