ACM Home Page
Please provide us with feedback. Feedback
Anomalous window discovery through scan statistics for linear intersecting paths (SSLIP)
Full text MovMov (19:12),  PdfPdf (669 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Paris, France
SESSION: Research track papers table of contents
Pages 767-776  
Year of Publication: 2009
ISBN:978-1-60558-495-9
Authors
Lei Shi  University of Maryland, Baltimore County, Baltimore, MD, USA
Vandana P. Janeja  University of Maryland, Baltimore County, Baltimore, MD, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 39,   Downloads (12 Months): 113,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557019.1557104
What is a DOI?

ABSTRACT

Anomalous windows are the contiguous groupings of data points. In this paper, we propose an approach for discovering anomalous windows using Scan Statistics for Linear Intersecting Paths (SSLIP). A linear path refers to a path represented by a line with a single dimensional spatial coordinate marking an observation point. Our approach for discovering anomalous windows along linear paths comprises of the following distinct steps: (a) Cross Path Discovery: where we identify a subset of intersecting paths to be considered, (b) Anomalous Window Discovery: where we outline three order invariant algorithms, namely SSLIP, Brute Force-SSLIP and Central Brute Force-SSLIP, for the traversal of the cross paths to identify varying size directional windows along the paths. For identifying an anomalous window we compute an unusualness metric, in the form of a likelihood ratio to indicate the degree of unusualness of this window with respect to the rest of the data. We identify the window with the highest likelihood ratio as our anomalous window, and (c) Monte Carlo Simulations: to ascertain whether this window is truly anomalous and not just a random occurrence we perform hypothesis testing by computing a p-value using Monte Carlo Simulations. We present extensive experimental results in real world accident datasets for various highways with known issues(code and data available from [27], [21]). Our results show that our approach indeed is effective in identifying anomalous traffic accident windows along multiple intersecting highways.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2001 state data for fatalities relating to roadway, pedestrian and large trucks, 2001.
 
2
V. Barnett and T. Lewis. Outliers in Statistical Data. John Wiley and Sons, 3rd edition, 1994.
 
3
J. Besag and J. Newell. The detection of clusters in rare diseases. Jrnl. of the Royal Statistical Society, 154:143---155, 1991.
 
4
L. Duczmal and A. Renato. A simulated annealing strategy for the detection of arbitrarily shaped spatial clusters. Computational Statistics and Data Analysis, 45(2):269--286, 2004.
 
5
M. Ester, H. P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases. In Proc. of the 2nd Intl. Conference on Knowledge Discovery and Data Mining, pages 44--49, U.S.A., 1996. AAAI Press.
 
6
A. Getis. Reflections on spatial autocorrelation. Regional Science and Urban Economics, 37(4):491--496, 2007.
 
7
J. Glaz, J. Naus, and S. Wallenstein. Scan Statistics. Springer Verlag Series in Statistics, New York, 2001.
 
8
D. Griffith. Spatial Autocorrelation: A Primer. Association of American Geographers, 1987.
 
9
R. Haining. Spatial Data Analysis: Theory and Practice. Cambridge University Press, 2003.
 
10
Highway research and technology, the need for greater inverstment, 1999.
11
 
12
13
 
14
 
15
M. Kulldorff. A spatial scan statistic. Communications of Statistics - Theory Meth., 26(6):1481--1496, 1997.
 
16
M. Kulldorff, W. Athas, E. Feuer, B. Miller, and C. Key. Evaluating cluster alarms: A space-time scan statistic and brain cancer in los alamos. American Journal of Public Health, 88(9):1377--1380, 1998.
 
17
H. J. Miller. Tobler's first law and spatial analysis. Annals of the Assoc. of American Geographers, 94(2):284--89, 2004.
 
18
S. Mohammadi, V. Janeja, and A. Gangopadhyay. Discretized spatio-temporal scan window. In The ninth SIAM International conference on Data Mining, 2009.
 
19
J. Naus. The distribution of the size of the maximum cluster of points on the line. Journal of the American Statistical Association 60, pages 532--538, 1965.
 
20
D. Neill, A. Moore, F. Pereira, and T. Mitchell. Detecting significant multidimensional spatial clusters. In Advances in Neural Information Processing Systems 17, pages 969--976. MIT Press, 2005.
 
21
New Jersey accident data for state routes. http://www.state.nj.us/transportation/refdata/accident/, 1999.
 
22
National motor vehicle crash causation survey(dot-hs-811-057), 2008.
 
23
National highway traffic safety administration(nhtsa), traffic safety facts, dot hs 810 809, 2006.
 
24
Nj state routes straight line diagrams, 2009.
 
25
S. Openshaw. A mark 1 geographical analysis machine for the automated analysis of point data sets. Intl. Journal of GIS, 1(4):335--358, 1987.
 
26
 
27
SSLIP : Code
 
28
 
29
T. Tango and K. Takahashi. A flexibly shaped spatial scan statistic for detecting clusters. Intl. Journal of Health Geographics, 4(11), 2005.
 
30
W. Tobler. A computer model simulation of urban growth in the detroit region. Eco. Geography, 46(2):234--240, 1970.
 
31
Njdot, top 100 intersection crash locations in nj, 1998-1999, accessed 2004.
 
32
New Jersey Safe Corridor Program http://www.ite.org/meetcon/2007AM/Session_17_Alfred%20Kotchi.pdf, 2007.

Collaborative Colleagues:
Lei Shi: colleagues
Vandana P. Janeja: colleagues