ACM Home Page
Please provide us with feedback. Feedback
PGG: an online pattern based approach for stream variation management
Source Journal of Computer Science and Technology archive
Volume 23 ,  Issue 4  (July 2008) table of contents
Pages 497-515  
Year of Publication: 2008
ISSN:1000-9000
Authors
Lu-An Tang  Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
Bin Gui  School of Electronics Engineering and Computer Science, Peking University, Beijing, China and Key Laboratory of High Confidence Software Technologies, Peking University, Beijing, China
Hong-Yan Li  School of Electronics Engineering and Computer Science, Peking University, Beijing, China and Key Laboratory of Machine Perception, Peking University, Beijing, China
Gao-Shan Miao  School of Electronics Engineering and Computer Science, Peking University, Beijing, China and Key Laboratory of Machine Perception, Peking University, Beijing, China
Dong-Qing Yang  School of Electronics Engineering and Computer Science, Peking University, Beijing, China and Key Laboratory of High Confidence Software Technologies, Peking University, Beijing, China
Xin-Biao Zhou  School of Electronics Engineering and Computer Science, Peking University, Beijing, China and Key Laboratory of Machine Perception, Peking University, Beijing, China
Publisher
Institute of Computing Technology  Beijing, China
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.1007/s11390-008-9149-4

ABSTRACT

Many database applications require efficient processing of data streams with value variations and fluctuant sampling frequency. The variations typically imply fundamental features of the stream and important domain knowledge of underlying objects. In some data streams, successive events seem to recur in a certain time interval, but the data indeed evolves with tiny differences as time elapses. This feature, so called pseudo periodicity, poses a new challenge to stream variation management. This study focuses on the online management for variations over such streams. The idea can be applied to many scenarios such as patient vital signal monitoring in medical applications. This paper proposes a new method named Pattern Growth Graph (PGG) to detect and manage variations over evolving streams with following features: 1) adopts the wave-pattern to capture the major information of data evolution and represent them compactly; 2) detects the variations in a single pass over the stream with the help of wave-pattern matching algorithm; 3) only stores different segments of the pattern for incoming stream, and hence substantially compresses the data without losing important information; 4) distinguishes meaningful data changes from noise and reconstructs the stream with acceptable accuracy. Extensive experiments on real datasets containing millions of data items, as well as a prototype system, are carried out to demonstrate the feasibility and effectiveness of the proposed scheme.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
4
 
5
 
6
7
8
 
9
Varon Joseph, Marik PE. Clinical information systems and the electronic medical record in the intensive care unit. Current Option in Critical Care, 2002, 8(6): 616-624.
 
10
11
 
12
13
 
14
15
16
 
17
18
 
19
 
20
 
21
Hu Z, Li H, Qiu B, Tang L, Fan Y, Liu H, Gao J, Zhou X. Using control theory to guide load shedding in medical data stream management system. In Proc. the 10th Asian Computing Science Conference, Advances in Computer Science, 2005, Kunming, China, LNCS 3818, pp. 236-248.
22
 
23
David J Fraenkel, Melleesa Cowie, Peter Daley. Quality benefits of an intensive care clinical information system. Crit. Care Medi., 2003, 31: 120-125.
 
24
Axel Junger, Achim Michel et al. Evaluation of the suitability of a patient data management system for ICUs on a general ward. International Journal of Medical Informatics, 2001, 64: 57-66.
 
25
Liu Y B, Cai J R, Yin J, Fu W A. Clustering text data streams. Journal of Computer Science and Technology, Jan. 2008, 23(1): 112-128.
 
26
 
27
Chong Z H, Yu J X, Zhang Z J, Lin X M, Wang W, Zhou A Y. Efficient computation of k-medians over data streams under memory constraints. Journal of Computer Science and Technology, Mar. 2006, 21(2): 284-296.
 
28
Chang J H, Lee W S. Effect of count estimation in finding frequent itemsets over online transactional data streams. Journal of Computer Science and Technology, Jan. 2005, 20(1): 63-69.
29
 
30
31
32
 
33
Aggarwal C C. On abnormality detection in spuriously populuted data streams. In Proc. SIAM International Conference on Data Mining, Newport Beach, CA, USA, 2005.
34
 
35
 
36
 
37
38
 
39
Wu H, Sharp G, Salzberg B, Kaeli D, Shirato H, Jiang S. A finite state model for respiratory motion analysis in image guided radiation therapy. Physics in Medicine and Biology (PMB), 2004, 49(23): 5357-5372.
40
41
 
42
Wang H, Pei J. A random method for quantifying changing distributions in data streams. In Proc. the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD), Porto, Portugal, October 2005, pp. 684-691.
 
43
 
44
 
45

Collaborative Colleagues:
Lu-An Tang: colleagues
Bin Gui: colleagues
Hong-Yan Li: colleagues
Gao-Shan Miao: colleagues
Dong-Qing Yang: colleagues
Xin-Biao Zhou: colleagues