|
ABSTRACT
Volcano is a new dataflow query processing system we have developed for database systems research and education. The uniform interface between operators makes Volcano extensible by new operators. All operators are designed and coded as if they were meant for a single-process system only. When attempting to parallelize Volcano, we had to choose between two models of parallelization, called here the bracket and operator models. We describe the reasons for not choosing the bracket model, introduce the novel operator model, and provide details of Volcano's exchange operator that parallelizes all other operators. It allows intra-operator parallelism on partitioned datasets and both vertical and horizontal inter-operator parallelism. The exchange operator encapsulates all parallelism issues and therefore makes implementation of parallel database algorithms significantly easier and more robust. Included in this encapsulation is the translation between demand-driven dataflow within processes and data-driven dataflow between processes. Since the interface between Volcano operators is similar to the one used in “real,” commercial systems, the techniques described here can be used to parallelize other query processing engines.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M Accetta, R Baron, W. Bolosky, D Golub, R. Raslud, A Tevaman and M Young. "Mach. A New Kernel Founda~on for UNIX Developm~t", Summer Conference Proceedings 1986,
|
 |
2
|
|
 |
3
|
M. M. Astrahan , M. W. Blasgen , D. D. Chamberlin , K. P. Eswaran , J. N. Gray , P. P. Griffiths , W. F. King , R. A. Lorie , P. R. McJones , J. W. Mehl , G. R. Putzolu , I. L. Traiger , B. W. Wade , V. Watson, System R: relational approach to database management, ACM Transactions on Database Systems (TODS), v.1 n.2, p.97-137, June 1976
[doi> 10.1145/320455.320457]
|
| |
4
|
C.K. Baru, O. Fneder, D. Kandlur and M Segal, "Yore on a Cube" Analysts, Sunulauon, and Implememauon", Proceedings of the 5th Interna~onal Workahop on Database Machv~, 1987.
|
| |
5
|
|
| |
6
|
|
 |
7
|
|
| |
8
|
H. Boral and D. J DeWIu, "Database Machines, An Idea Whose Tune Has Passed? A Cnuque of Future of Database Machines", Proceeding of the Internanonal Workshop on Database Machow.s, Mumch, 1983
|
 |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
David J. DeWitt , Robert H. Gerber , Goetz Graefe , Michael L. Heytens , Krishna B. Kumar , M. Muralikrishna, GAMMA - A High Performance Dataflow Database Machine, Proceedings of the 12th International Conference on Very Large Data Bases, p.228-237, August 25-28, 1986
|
| |
13
|
D. J. Dewitt , S. Ghandeharizadeh , D. A. Schneider , A. Bricker , H. -I. Hsiao , R. Rasmussen, The Gamma Database Machine Project, IEEE Transactions on Knowledge and Data Engineering, v.2 n.1, p.44-62, March 1990
[doi> 10.1109/69.50905]
|
| |
14
|
S Englert, J Gray, R Kocher and P Shah, "A Benchmark of NonStop SQL Release 2 Demonstr~mg Near-Linear Speedup and Scaleup on Large Databases", Tandem Computer Systems Technural Report 89 # (May 1989)
|
| |
15
|
|
| |
16
|
|
| |
17
|
(3. Gtaefe, "Volcano Art Extensxble and P~rLllel Data.flow Query Processing sysm , Oregon Graduate Center, Computer Science Techmcal Report, Beaverton, OR, June Z989
|
| |
18
|
O Grade, "Sot Processing and Complex Object Assembly m Volcano and the REVELATION P roject", Oregon Graduate Center, Computer Scw.nce Technical Report, Beaverton, OR, June 1989
|
| |
19
|
|
 |
20
|
|
| |
21
|
(3 Grade, "Parallel External Sorting m Volcano", submitted for publurat=on, February 1990
|
| |
22
|
L. M Haas, W F Cody, J C Freytag, G Lap=, B G. Lmdsay, O. M Lohman, K Ono and H Ptrahesh, "An Extensxble Processor for an Extended Relauonal Query Language", Computer Scw.nce Research Report, San Jose, CA, Aprd 1988
|
| |
23
|
T Keller and (3 Grade, "The One-to-One Match Operator of the Volcano Query Processing System", Oregon Graduate Center, Computer Science Techn~al Report, Beaverton, OR, June 1989
|
 |
24
|
|
| |
25
|
|
| |
26
|
P. Schwarz , W. Chang , J. C. Freytag , G. Lohman , J. McPherson , C. Mohan , H. Pirahesh, Extensibility in the Starburst database system, Proceedings on the 1986 international workshop on Object-oriented database systems, p.85-92, September 23-26, 1986, Pacific Grove, California, United States
|
 |
27
|
|
 |
28
|
|
| |
29
|
|
| |
30
|
Shun'ichi Torii , Keiji Kojima , Yasusi Kanada , Akiharu Sakata , Seiichi Yoshizumi , Masami Takahashi, Accelerating Nonnumerical Processing by an Extended Vector Processor, Proceedings of the Fourth International Conference on Data Engineering, p.194-201, February 01-05, 1988
|
| |
31
|
R. Williams , D. Daniels , L. Haas , G. Lapis , Lindsay P. Ng , R. Obermarck , P. Selinger , A. Walker , P. Wilms , R. Yost, R*: An overview of the architecture, Readings in database systems, Morgan Kaufmann Publishers Inc., San Francisco, CA, 1988
|
CITED BY 60
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hamid Pirahesh , C. Mohan , Josephine Cheng , T. S. Liu , Pat Selinger, Parallelism in relational data base systems: architectural issues and design approaches, Proceedings of the second international symposium on Databases in parallel and distributed systems, p.4-29, July 02-04, 1990, Dublin, Ireland
|
|
|
|
|
|
|
|
|
J. Thomas , B. Mitschang , N. Mattos , S. Deßloch, Enhancing knowledge processing in client/server environments, Proceedings of the second international conference on Information and knowledge management, p.324-334, November 01-05, 1993, Washington, D.C., United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jignesh Patel , JieBing Yu , Navin Kabra , Kristin Tufte , Biswadeep Nag , Josef Burger , Nancy Hall , Karthikeyan Ramasamy , Roger Lueder , Curt Ellmann , Jim Kupsch , Shelly Guo , Johan Larson , David De Witt , Jeffrey Naughton, Building a scaleable geo-spatial DBMS: technology, implementation, and evaluation, ACM SIGMOD Record, v.26 n.2, p.336-347, June 1997
|
|
|
|
|
|
|
|
|
Remzi H. Arpaci-Dusseau , Eric Anderson , Noah Treuhaft , David E. Culler , Joseph M. Hellerstein , David Patterson , Kathy Yelick, Cluster I/O with River: making the fast case common, Proceedings of the sixth workshop on I/O in parallel and distributed systems, p.10-22, May 05-05, 1999, Atlanta, Georgia, United States
|
|
|
|
|
|
|
|
|
|
|
|
Mehul A. Shah , Joseph M. Hellerstein , Eric Brewer, Highly available, fault-tolerant, parallel dataflows, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
|
|
|
|
|
|
|
|
|
|
|
|
Jim Smith , Paul Watson , Sandra de F. Mendes Sampaio , Norman Paton, Polar: an architecture for a parallel ODMG compliant object database, Proceedings of the ninth international conference on Information and knowledge management, p.352-359, November 06-11, 2000, McLean, Virginia, United States
|
|
|
Jim Smith , Sandra Sampaio , Paul Watson , Norman W. Paton, The Design, Implementation and Evaluation of an ODMG Compliant, Parallel Object Database Server, Distributed and Parallel Databases, v.16 n.3, p.275-319, November 2004
|
|
|
|
|
|
Michael Ortega , Yong Rui , Kaushik Chakrabarti , Kriengkrai Porkaew , Sharad Mehrotra , Thomas S. Huang, Supporting Ranked Boolean Similarity Queries in MARS, IEEE Transactions on Knowledge and Data Engineering, v.10 n.6, p.905-925, November 1998
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bin Liu , Yali Zhu , Mariana Jbantova , Bradley Momberger , Elke A. Rundensteiner, A dynamically adaptive distributed system for processing complex continuous queries, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Boon Thau Loo , Tyson Condie , Minos Garofalakis , David E. Gay , Joseph M. Hellerstein , Petros Maniatis , Raghu Ramakrishnan , Timothy Roscoe , Ion Stoica, Declarative networking: language, execution and optimization, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
|
|
|
Steven D. Gribble , Eric A. Brewer , Joseph M. Hellerstein , David Culler, Scalable, distributed data structures for internet service construction, Proceedings of the 4th conference on Symposium on Operating System Design & Implementation, p.22-22, October 22-25, 2000, San Diego, California
|
|
|
Ryan Huebsch , Joseph M. Hellerstein , Nick Lanham , Boon Thau Loo , Scott Shenker , Ion Stoica, Querying the internet with PIER, Proceedings of the 29th international conference on Very large data bases, p.321-332, September 09-12, 2003, Berlin, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Steven Lynden , Arijit Mukherjee , Alastair C. Hume , Alvaro A. A. Fernandes , Norman W. Paton , Rizos Sakellariou , Paul Watson, The design and implementation of OGSA-DQP: A service-based distributed query processor, Future Generation Computer Systems, v.25 n.3, p.224-236, March, 2009
|
|
|
|
|
|
|
|
|
Anastasios Gounaris , Jim Smith , Norman W. Paton , Rizos Sakellariou , Alvaro A. Fernandes , Paul Watson, Adaptive workload allocation in query processing in autonomous heterogeneous environments, Distributed and Parallel Databases, v.25 n.3, p.125-164, June 2009
|
|
|
Norman W. Paton , Jorge Buenabad-Chavez , Mengsong Chen , Vijayshankar Raman , Garret Swart , Inderpal Narang , Daniel M. Yellin , Alvaro A. Fernandes, Autonomic query parallelization using non-dedicated computers: an evaluation of adaptivity options, The VLDB Journal — The International Journal on Very Large Data Bases, v.18 n.1, p.119-140, January 2009
|
|
|
|
|
|
|
|