|
ABSTRACT
Outlyingness is a subjective concept relying on the isolation level of a (set of) record(s). Clustering-based outlier detection is a field that aims to cluster data and to detect outliers depending on their characteristics (small, tight and/or dense clusters might be considered as outliers). Existing methods require a parameter standing for the "level of outlyingness", such as the maximum size or a percentage of small clusters, in order to build the set of outliers. Unfortunately, manually setting this parameter in a streaming environment should not be possible, given the fast time response usually needed. In this paper we propose WOD, a method that separates outliers from clusters thanks to a natural and effective principle. The main advantages of WOD are its ability to automatically adjust to any clustering result and to be parameterless.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Aleskerov, B. and Freisleben, and B. Rao. Cardwatch: A neural network based database mining system for credit card fraud detection. In IEEE Computational Intelligence for Financial Engineering, 1997.
|
| |
2
|
|
| |
3
|
L. Ertoz, E. Eilertson, A. Lazarevic, P.-N. Tan, V. Kumar, J. Srivastava, and P. Dokas. Minds - minnesota intrusion detection system. Data Mining - Next Generation Challenges and Future Directions, 2004.
|
| |
4
|
H. Fan, O. R. Zaiane, A. Foss, and J. Wu. A nonparametric outlier detection for effectively discovering top-n outliers from engineering data. In Pacific-Asia conference on knowledge discovery and data mining, 2006.
|
 |
5
|
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
| |
9
|
J. Joshua Oldmeadow, S. Ravinutala, and C. Leckie. Adaptive clustering for network intrusion detection. In 8th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, volume 3056 of Lecture Notes in Computer Science, pages 255--259, 2004.
|
| |
10
|
|
| |
11
|
H. Kum, J. Pei, W. Wang, and D. Duncan. ApproxMAP: Approximate mining of consensus sequential patterns. In Proceedings of SIAM Int. Conf. on Data Mining, San Francisco, CA, 2003.
|
| |
12
|
|
| |
13
|
S. Papadimitriou, H. Kitagawa, P. B. Gibbons, and C. Faloutsos. LOCI: fast outlier detection using the local correlation integral. In 19th International Conference on Data Engineering, 2003.
|
 |
14
|
|
 |
15
|
|
|