ACM Home Page
Please provide us with feedback. Feedback
LOF: identifying density-based local outliers
Full text PdfPdf (264 KB)
Source International Conference on Management of Data archive
Proceedings of the 2000 ACM SIGMOD international conference on Management of data table of contents
Dallas, Texas, United States
Pages: 93 - 104  
Year of Publication: 2000
ISBN:1-58113-217-4
Also published in ...
Authors
Markus M. Breunig  Institute for Computer Science, University of Munich, Oettingenstr. 67, D-80538 Munich, Germany
Hans-Peter Kriegel  Institute for Computer Science, University of Munich, Oettingenstr. 67, D-80538 Munich, Germany
Raymond T. Ng  Department of Computer Science, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
Jörg Sander  Institute for Computer Science, University of Munich, Oettingenstr. 67, D-80538 Munich, Germany
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 31,   Downloads (12 Months): 177,   Citation Count: 105
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/342009.335388
What is a DOI?

ABSTRACT

For many KDD applications, such as detecting criminal activities in E-commerce, finding the rare instances or the outliers, can be more interesting than finding the common patterns. Existing work in outlier detection regards being an outlier as a binary property. In this paper, we contend that for many scenarios, it is more meaningful to assign to each object a degree of being an outlier. This degree is called the local outlier factor (LOF) of an object. It is local in that the degree depends on how isolated the object is with respect to the surrounding neighborhood. We give a detailed formal analysis showing that LOF enjoys many desirable properties. Using real-world datasets, we demonstrate that LOF can be used to find outliers which appear to be meaningful, but can otherwise not be identified with existing approaches. Finally, a careful performance evaluation of our algorithm confirms we show that our approach of finding local outliers can be practical.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Aming, A., Agrawal R., Raghavan R: "A Linear Method for Deviation Detection in Large Databases", Proc. 2rid Int. Conf. on Knowledge Discovery and Data Mining, Portland, OR, AAAI Press, 1996, p. 164-169.
2
3
 
4
 
5
Barnett V., Lewis T.: "Outliers in statistical data", John Wiley, 1994.
 
6
DuMouchel W., Schonlau M.: "A Fast Computer Intrusion Detection Algorithm based on Hypothesis Testing of Command Transition Probabilities", Proc. 4th Int. Conf. on Knowledge Discovery and Data Mining, New York, NY, AAAI Press, 1998, pp. 189-193.
 
7
Ester M., Kriegel H.-E, Sander J., Xu X.: "A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise", Proc. 2rid Int. Conf. on Knowledge Discovery and Data Mining, Portland, OR, AAAI Press, 1996, pp. 226-231.
 
8
 
9
Fayyad U., Piatetsky-Shapiro G., Smyth R: "Knowledge Discovery and Data Mining: Towards a Unifying Framework", Proc. 2rid Int. Conf. on Knowledge Discovery and Data Mining, Portland, OR, 1996, pp. 82-88.
 
10
Hawkins, D.: "Identification of Outliers", Chapman and Hall, London, 1980.
 
11
Hinneburg A., Keim D.A.: "An Efficient Approach to Clustering in Large Multimedia Databases with Noise", Proc. 4th Int. Conf. on Knowledge Discovery and Data Mining, New York City, NY, 1998,pp. 58-65.
 
12
Johnson T., Kwok I., Ng R.: "Fast Computation of 2- Dimensional Depth Contours", Proc. 4th Int. Conf. on Knowledge Discovery and Data Mining, New York, NY, AAAI Press, 1998, pp. 224-228.
 
13
 
14
 
15
 
16
17
 
18
 
19
 
20
Tukey J. W.: "Exploratory Data Analysis", Addison-Wesley, 1977.
 
21
 
22
23

CITED BY  108

Collaborative Colleagues:
Markus M. Breunig: colleagues
Hans-Peter Kriegel: colleagues
Raymond T. Ng: colleagues
Jörg Sander: colleagues