|
ABSTRACT
A major problem in detecting events in streams of data is that the data can be imprecise (e.g. RFID data). However, current state-ofthe-art event detection systems such as Cayuga [14], SASE [46] or SnoopIB[1], assume the data is precise. Noise in the data can be captured using techniques such as hidden Markov models. Inference on these models creates streams of probabilistic events which cannot be directly queried by existing systems. To address this challenge we propose Lahar1, an event processing system for probabilistic event streams. By exploiting the probabilistic nature of the data, Lahar yields a much higher recall and precision than deterministic techniques operating over only the most probable tuples. By using a novel static analysis and novel algorithms, Lahar processes data orders of magnitude more efficiently than a naïve approach based on sampling. In this paper, we present Lahar's static analysis and core algorithms. We demonstrate the quality and performance of our approach through experiments with our prototype implementation and comparisons with alternate methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp. A tutorial on particle filters for on-line non-linear/non-gaussian bayesian tracking. IEEE Transactions on Signal Processing, 50(2):174--188, February 2002.
|
| |
3
|
|
| |
4
|
C. Floerkemeier and M. Lampe. Issues with RFID usage in ubiquitous computing applications. In Proc. of the 2nd Pervasive Conf., April 2004.
|
| |
5
|
Cisco Systems. Cisco IOS NetFlow. http://www.cisco.com/go/netflow.
|
| |
6
|
Computerworld. Procter & Gamble: Wal-Mart RFID effort effective. http://www.computerworld.com/action/article.do?command=viewArticleBasic%&articleId=284160, February 2007.
|
 |
7
|
|
| |
8
|
Robert G. Cowell , Steffen L. Lauritzen , A. Philip David , David J. Spiegelhalter , V. Nair , J. Lawless , M. Jordan, Probabilistic Networks and Expert Systems, Springer-Verlag New York, Inc., Secaucus, NJ, 1999
|
| |
9
|
|
 |
10
|
|
| |
11
|
Dartmouth College. CRAWDAD: A Community Resource for Archiving Wireless Data At Dartmouth. http://crawdad.cs.dartmouth.edu/index.php.
|
| |
12
|
Das, S.K. et al. The role of prediction algorithms in the MavHome smart home architecture. IEEE Wireless Communications, 9(6):77--84, 2002.
|
 |
13
|
|
| |
14
|
A.J. Demers, J. Gehrke, M. Hong, M. Riedewald, and W.M. White. Towards expressive publish/subscribe systems. In EDBT, pages 627--644, 2006.
|
 |
15
|
|
| |
16
|
|
| |
17
|
Garofalakis et. al. Probabilistic data management for pervasive computing: The Data Furnace project. IEEE Data Engineering Bulletin, 29(1), March 2006.
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
 |
21
|
|
| |
22
|
B. Kanagal and A. Deshpande. Online filtering, smoothing and probabilistic modeling of streaming data. Technical Report CS-TR-4867, University of Maryland, May 2007.
|
 |
23
|
|
 |
24
|
Mike Klaas , Mark Briers , Nando de Freitas , Arnaud Doucet , Simon Maskell , Dustin Lang, Fast particle smoothing: if I had a million particles, Proceedings of the 23rd international conference on Machine learning, p.481-488, June 25-29, 2006, Pittsburgh, Pennsylvania
[doi> 10.1145/1143844.1143905]
|
| |
25
|
M. Lamming and D. Bohm. SPECs: Another approach to human context and activity sensing research, using tiny peer-to-peer wireless computers. In Ubicomp 2003, volume 2864, pages 192--199, 2003.
|
| |
26
|
S. E. Levinson, L. R. Rabiner, and M. M. Sondhi. An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition. Bell Sys. Tech. J., 62:1035, 1983.
|
| |
27
|
|
| |
28
|
|
 |
29
|
|
| |
30
|
|
| |
31
|
|
| |
32
|
Matthai Philipose , Kenneth P. Fishkin , Mike Perkowitz , Donald J. Patterson , Dieter Fox , Henry Kautz , Dirk Hahnel, Inferring Activities from Interactions with Objects, IEEE Pervasive Computing, v.3 n.4, p.50-57, October 2004
[doi> 10.1109/MPRV.2004.7]
|
| |
33
|
J. S. Provan and M. O. Ball. The complexity of counting cuts and of computing the probability that a graph is connected. SIAM J. Comput., 12(4):777--788, 1983.
|
| |
34
|
|
| |
35
|
C. Ré, J. Letchner, M. Balazinska, and D. Suciu. Event queries on correlated probabilistic streams (full version). Technical Report 08-03-02, University of Washington, Seattle, WA, 2008.
|
| |
36
|
|
| |
37
|
Reuters. Stock Data Feed. http://about.reuters.com/productinfo/s/stock_data_feed/.
|
| |
38
|
RFID Journal. Hospital gets ultra-wideband RFID. http://www.rfidjournal.com/article/view/1088/1/1, August 2004.
|
| |
39
|
Crossbow Technology. Products: Wireless sensor networks. http://www.xbow.com/Products/wproductsoverview.aspx.
|
| |
40
|
University of Washington. RFID Ecosystem. http://rfid.cs.washington.edu/.
|
| |
41
|
G. Virone, A. Wood, L. Selavo, Q. Cao, L. Fang, T. Doan, Z. He, R. Stoleru, S. Lin, and J.A. Stankovic. An assisted living oriented information system based on a residential wireless sensor network. In 1st Distributed Diagnosis and Home Healthcare (D2H2) Conference, April 2006.
|
| |
42
|
A. J. Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, IT-13:260--269, 1967.
|
| |
43
|
|
| |
44
|
|
 |
45
|
|
 |
46
|
|
CITED BY 6
|
|
Evan Welbourne , Karl Koscher , Emad Soroush , Magdalena Balazinska , Gaetano Borriello, Longitudinal study of a building-scale RFID ecosystem, Proceedings of the 7th international conference on Mobile systems, applications, and services, June 22-25, 2009, Kraków, Poland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|