ACM Home Page
Please provide us with feedback. Feedback
Parameterless outlier detection in data streams
Full text PdfPdf (479 KB)
Source
Symposium on Applied Computing archive
Proceedings of the 2009 ACM symposium on Applied Computing table of contents
Honolulu, Hawaii
SESSION: Data streams track table of contents
Pages 1491-1495  
Year of Publication: 2009
ISBN:978-1-60558-166-8
Authors
Alice Marascu  INRIA, route des lucioles - BP
Florent Masseglia  INRIA, route des lucioles - BP
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 78,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1529282.1529615
What is a DOI?

ABSTRACT

Outlyingness is a subjective concept relying on the isolation level of a (set of) record(s). Clustering-based outlier detection is a field that aims to cluster data and to detect outliers depending on their characteristics (small, tight and/or dense clusters might be considered as outliers). Existing methods require a parameter standing for the "level of outlyingness", such as the maximum size or a percentage of small clusters, in order to build the set of outliers. Unfortunately, manually setting this parameter in a streaming environment should not be possible, given the fast time response usually needed. In this paper we propose WOD, a method that separates outliers from clusters thanks to a natural and effective principle. The main advantages of WOD are its ability to automatically adjust to any clustering result and to be parameterless.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. Aleskerov, B. and Freisleben, and B. Rao. Cardwatch: A neural network based database mining system for credit card fraud detection. In IEEE Computational Intelligence for Financial Engineering, 1997.
 
2
 
3
L. Ertoz, E. Eilertson, A. Lazarevic, P.-N. Tan, V. Kumar, J. Srivastava, and P. Dokas. Minds - minnesota intrusion detection system. Data Mining - Next Generation Challenges and Future Directions, 2004.
 
4
H. Fan, O. R. Zaiane, A. Foss, and J. Wu. A nonparametric outlier detection for effectively discovering top-n outliers from engineering data. In Pacific-Asia conference on knowledge discovery and data mining, 2006.
5
 
6
 
7
8
 
9
J. Joshua Oldmeadow, S. Ravinutala, and C. Leckie. Adaptive clustering for network intrusion detection. In 8th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, volume 3056 of Lecture Notes in Computer Science, pages 255--259, 2004.
 
10
 
11
H. Kum, J. Pei, W. Wang, and D. Duncan. ApproxMAP: Approximate mining of consensus sequential patterns. In Proceedings of SIAM Int. Conf. on Data Mining, San Francisco, CA, 2003.
 
12
 
13
S. Papadimitriou, H. Kitagawa, P. B. Gibbons, and C. Faloutsos. LOCI: fast outlier detection using the local correlation integral. In 19th International Conference on Data Engineering, 2003.
14
15

Collaborative Colleagues:
Alice Marascu: colleagues
Florent Masseglia: colleagues