ACM Home Page
Please provide us with feedback. Feedback
A parallel index for semistructured data
Full text PdfPdf (638 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2002 ACM symposium on Applied computing table of contents
Madrid, Spain
SESSION: Parallel and distributed systems and networking table of contents
Pages: 890 - 896  
Year of Publication: 2002
ISBN:1-58113-445-2
Authors
Brian F. Cooper  Stanford University, Stanford, CA
Neal Sample  Stanford University, Stanford, CA
Moshe Shadmon  RightOrder Inc., San Jose, CA
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 26,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/508791.508963
What is a DOI?

ABSTRACT

Database systems are increasingly being used to manage semistructured data, which may not have a fixed structure or set of relationships between data items. Indexes which use tree structures to manage semistructured data become unbalanced and difficult to parallelize due to the complex nature of the data. We propose a mechanism by which an unbalanced vertical tree is managed in a balanced way by additional layers of horizontal index. Then, the vertical tree can be partitioned among parallel computing nodes in a balanced fashion. We discuss how to construct, search and update such a horizontal structure using the example of a Patricia trie index. We also present simulation results that demonstrate the speedup offered by such parallelism, for example, with three-way parallelism, our techniques can provide almost a factor of three speedup.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
DBLP computer science bibliography. http://www.informatik.uni-trier.de/-ley/db/.
 
2
 
3
Barbara T. Blaustein and Charles W. Kaufman. Updating replicated data during communications failures. In Proc. VLDB, pages 49-58, 1985.
4
 
5
Brian Cooper and Moshe Shadmon. The Index Fabric: A mechanism for indexing and querying the same data in many different ways, 2000. RightOrder Incorporated Technical Report.
6
7
 
8
 
9
10
11
 
12
 
13
 
14

Collaborative Colleagues:
Brian F. Cooper: colleagues
Neal Sample: colleagues
Moshe Shadmon: colleagues