|
ABSTRACT
Content-based dissemination of XML data using the publish-subscribe paradigm is an effective means to deliver relevant data to interested data consumers. To meet the performance challenges of content-based filtering and routing, two key optimizations have been developed: the use of efficient indexes to speed up subscription filtering, and the use of effective aggregation algorithms to reduce the number of subscriptions. The effectiveness of both these techniques are, however, limited to locally improving the performance of individual routers. In this paper, we propose a novel and holistic optimization approach that allows a downstream router to leverage the subscription matchings done by upstream routers to reduce its own filtering work. This is achieved by piggybacking useful annotations to the XML document being forwarded. We explore several design options and tradeoffs of this novel optimization approach. Our experimental results demonstrate that our piggyback optimization achieves significant performance improvement under various conditions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. L. Diaz, D. Lovell (1999) XML Generator. http://www.alphaworks.ibm.com/tech/xmlgenerator.
|
| |
2
|
DBLP. http://www.acm.org/sigmod/dblp/db/about/dblp.dtd.
|
| |
3
|
NS2. http://www.isi.edu/nsnam/ns/.
|
| |
4
|
Protein. http://pir.georgetown.edu.
|
| |
5
|
R. Cover (1999) The SGML/XML web page. http://www.oasis.open.org/cover/sgml-ml.html.
|
| |
6
|
Treebank. http://www.cis.upenn.edu/~treebank/.
|
| |
7
|
M. K. Aguilera, R. E. Strom, D. C. Sturman, M. Astley, and T. D. Chandra. Matching events in a content-based subscription system. In PODC, 1999.
|
| |
8
|
M. Altinel and M. Franklin. Efficient filtering of XML documents for selective dissemination of information. In VLDB, 2000.
|
| |
9
|
N. Bruno, L. Gravano, N. Koudas, and D. Srivastava. Navigation- vs. index-based XML multi-query processing. In ICDE, 2003.
|
 |
10
|
|
| |
11
|
C. Y. Chan, W. Fan, P. Felber, M. Garofalakis, and R. Rastogi. Tree pattern aggregation for scalable XML data dissemination. In VLDB, 2002.
|
| |
12
|
|
| |
13
|
R. Chand and P. A. Felber. A scalable protocol for content-based routing in overlay networks. In NCA, 2003.
|
 |
14
|
|
| |
15
|
P. M. Fischer and D. Kossmann. Batched processing for information filters. In ICDE, 2005.
|
| |
16
|
X. Gong, W. Qian, Y. Yan, and A. Zhou. Bloom filter-based XML packets filtering for millions of path queries. In ICDE, 2005.
|
| |
17
|
T. J. Green, G. Miklau, M. Onizuka, and D. Suciu. Processing XML streams with deterministic automata. In ICDT, 2003.
|
| |
18
|
A. Gupta, A. Halevy, and D. Suciu. View selection for XML stream processing. In WebDB, 2002.
|
| |
19
|
A. Gupta and D. Suciu. Stream processing of XPath queries with predicates. In SIGMOD, 2003.
|
| |
20
|
A. Gupta, D. Suciu, and A. Halevy. The view selection problem for XML content based routing. In PODS, 2003.
|
| |
21
|
S. Hou and H. A. Jacobsen. Predicate-based filtering of XPath expressions. In ICDE, 2006.
|
| |
22
|
J. Kwon, P. Rao, B. Moon, and S. Lee. Fist: Scalable XML document filtering by sequencing twig patterns. In VLDB, 2005.
|
| |
23
|
D. S. Rosenblum and A. L. Wolf. A design framework for internet-scale event observation and notification. In ESEC/FSE-5, 1997.
|
| |
24
|
B. Segall, D. Arnold, J. Boot, M. Henderson, and T. Phelps. Content based routing with Elvin4. In AUUG2K, 2000.
|
| |
25
|
W3C. XML path language (XPath). 1999. http://www.w3.org/TR/xpath.
|
| |
26
|
X. Zhang, L. H. Yang, M. L. Lee, and W. Hsu. Scaling SDI systems via query clustring and aggregation. In DASFAA, 2004.
|
|