|
ABSTRACT
In practice, any database management system sometimes needs reorganization, that is, a change in some aspect of the logical and/or physical arrangement of a database. In traditional practice, many types of reorganization have required denying access to a database (taking the database offline) during reorganization. Taking a database offline can be unacceptable for a highly available (24-hour) database, for example, a database serving electronic commerce or armed forces, or for a very large database. A solution is to reorganize online (concurrently with usage of the database, incrementally during users' activities, or interpretively). This article is a tutorial and survey on requirements, issues, and strategies for online reorganization. It analyzes the issues and then presents the strategies, which use the issues. The issues, most of which involve design trade-offs, include use of partitions, the locus of control for the process that reorganizes (a background process or users' activities), reorganization by copying to newly allocated storage (as opposed to reorganizing in place), use of differential files, references to data that has moved, performance, and activation of reorganization. The article surveys online strategies in three categories of reorganization. The first category, maintenance, involves restoring the physical arrangement of data instances without changing the database definition. This category includes restoration of clustering, reorganization of an index, rebalancing of parallel or distributed data, garbage collection for persistent storage, and cleaning (reclamation of space) in a log-structured file system. The second category involves changing the physical database definition; topics include construction of indexes, conversion between B+ -trees and linear hash files, and redefinition (e.g., splitting) of partitions. The third category involves changing the logical database definition. Some examples are changing a column's data type, changing the inheritance hierarchy of object classes, and changing a relationship from one-to-many to many-to-many. The survey encompasses both research and commercial implementations, and this article points out several open research topics. As highly available or very large databases continue to become more common and more important in the world economy, the importance of online reorganization is likely to continue growing.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Achyutuni, K. J., Omiecinski, E., and Navathe, S. B. 1992. HISET: A generalized technique for efficient concurrent reorganization. Tech. rep. GIT-CC-92/57, College of Computing, Georgia Institute of Technology, Atlanta.
|
| |
3
|
Achyutuni, K. J., Omiecinski, E., and Navathe, S. B. 1993. The impact of data placement strategies on reorganization costs in parallel disk systems. Tech. rep. GIT-CC-93/58, College of Computing, Georgia Institute of Technology, Atlanta.
|
 |
4
|
Kiran J. Achyutuni , Edward Omiecinski , Shamkant B. Navathe, Two techniques for on-line index modification in shared nothing parallel databases, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.125-136, June 04-06, 1996, Montreal, Quebec, Canada
|
| |
5
|
Ahlsén, M., Björnerstedt, A., Britts, S., Hultén, C., and Söderlund, L. 1983. Making type changes transparent. In Proceedings of the IEEE Workshop on Languages for Automation. 110--117.
|
 |
6
|
|
| |
7
|
|
| |
8
|
Amer. Natl. Standards Institute. 1992. Database Language SQL. Amer. National Standards Institute, New York. X3.135-1992.
|
| |
9
|
|
| |
10
|
|
 |
11
|
|
 |
12
|
A. W. Appel , J. R. Ellis , K. Li, Real-time concurrent collection on stock multiprocessors, Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation, p.11-20, June 20-24, 1988, Atlanta, Georgia, United States
|
| |
13
|
|
 |
14
|
Hezi Azatchi , Yossi Levanoni , Harel Paz , Erez Petrank, An on-the-fly mark and sweep garbage collector based on sliding views, Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications, October 26-30, 2003, Anaheim, California, USA
|
 |
15
|
|
 |
16
|
David F. Bacon , Perry Cheng , V. T. Rajan, A unified theory of garbage collection, Proceedings of the 19th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, October 24-28, 2004, Vancouver, BC, Canada
|
| |
17
|
|
 |
18
|
|
 |
19
|
Jay Banerjee , Won Kim , Hyoung-Joo Kim , Henry F. Korth, Semantics and implementation of schema evolution in object-oriented databases, Proceedings of the 1987 ACM SIGMOD international conference on Management of data, p.311-322, May 27-29, 1987, San Francisco, California, United States
|
| |
20
|
Banzon, A. T., Friske, C. A., Garth, J. M., Ng, K. C., Ruddy, J. A., and Vizconde, B. B. 2006. Method, apparatus and program storage device for managing buffers during online reorganization. U. S. patent application 2006/0173922.
|
 |
21
|
Katherine Barabash , Ori Ben-Yitzhak , Irit Goft , Elliot K. Kolodner , Victor Leikehman , Yoav Ossia , Avi Owshanko , Erez Petrank, A parallel, incremental, mostly concurrent garbage collector for servers, ACM Transactions on Programming Languages and Systems (TOPLAS), v.27 n.6, p.1097-1146, November 2005
[doi> 10.1145/1108970.1108972]
|
 |
22
|
Katherine Barabash , Yoav Ossia , Erez Petrank, Mostly concurrent garbage collection revisited, Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications, October 26-30, 2003, Anaheim, California, USA
|
| |
23
|
Bartlett, J. F. 1988. Compacting garbage collection with ambiguous roots. Tech. rep. 88/2, Western Research Lab., Digital Equipment Corp., Palo Alto.
|
| |
24
|
Baru, C. and Zilio, D. C. 1993. Data reorganization in parallel database systems. In Proceedings of the IEEE Workshop on Advances in Parallel and Distributed Systems IEEE-CS. IEEE Computer Society Press, Los Alamitos, 102--107. (For later, more extensive analysis, see Zilio {1998}.)
|
| |
25
|
|
| |
26
|
|
| |
27
|
|
 |
28
|
|
| |
29
|
Batra, D. P. 1989. Response time analysis for computer database reorganisation with concurrent usage and changeover state. Defence Sci. J. 39, 1, 33--41. Defence Scientific Info. and Documentation Centre, Ministry of Defence, Delhi, India.
|
| |
30
|
Bayer, R. and McCreight, E. 1970. Organization and maintenance of large ordered indices. In Proceedings of the ACM SICFIDET Workshop on Data Description and Access. ACM, New York, 107--141. (For a revision, see Bayer and McCreight {1972}.)
|
| |
31
|
Bayer, R. and McCreight, E. 1972. Organization and maintenance of large ordered indexes. Acta Inf. 1, 3, 173--189.
|
| |
32
|
|
| |
33
|
Bennett, B. T. and Franaszek, P. A. 1977. Permutation clustering: An approach to on-line storage reorganization. IBM J. Res. Develop. 21, 6, 528--533.
|
 |
34
|
|
| |
35
|
|
 |
36
|
|
 |
37
|
|
 |
38
|
Stephen M. Blackburn , Kathryn S. McKinley, Ulterior reference counting: fast garbage collection without a long wait, Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications, October 26-30, 2003, Anaheim, California, USA
|
| |
39
|
|
| |
40
|
BMC Software. 1997. REORG PLUS for DB2: General Information, Version 5.1. BMC Software.
|
| |
41
|
BMC Software. 1998. CONCURRENT REORG and the Extended Performance Utilities: General Information, Version 1.3. BMC Software.
|
| |
42
|
Bollella, G., Lizzi, C., and Daynes, L. P. 2009. System and method for ensuring non-interfering garbage collection in a real-time multi-threaded environment. U.S. patent 7,484,067.
|
| |
43
|
Bracht, C. J., Malkemus, T. R., and Versteeg, A. 1991. Table reorganization identification. IBM Tech. Disclosure Bull. 34, 1, 163--165.
|
| |
44
|
|
| |
45
|
Bratsberg, S. E. 1993. Evolution and integration of classes in object-oriented databases. Dr.Ing. thesis, Division of Computer Systems and Telematics, Norwegian Institute of Technology, Trondheim, Norway.
|
| |
46
|
|
| |
47
|
Bratsberg, S. E. Hvasshovd, S.-O., and Torbjørnsen, Ø. 1997. Parallel solutions in ClustRa. Data Engin. 20, 2, 13--20.
|
 |
48
|
|
| |
49
|
|
| |
50
|
|
 |
51
|
Kurt P. Brown , Michael J. Carey , Miron Livny, Goal-oriented buffer management revisited, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.353-364, June 04-06, 1996, Montreal, Quebec, Canada
|
| |
52
|
|
| |
53
|
|
 |
54
|
|
 |
55
|
D. D. Chamberlin , J. N. Gray , I. L. Traiger, Views, authorization, and locking in a relational data base system, Proceedings of the May 19-22, 1975, national computer conference and exposition, May 19-22, 1975, Anaheim, California
[doi> 10.1145/1499949.1500032]
|
| |
56
|
|
| |
57
|
Chamberlin, D. D. and Schmuck, F. B. 1992b. Dynamic data distribution in a shared-nothing multiprocessor data store. Resear. rep. RJ 8730, IBM T. J. Watson Research Center, Yorktown Heights.
|
 |
58
|
|
| |
59
|
|
| |
60
|
Chen, I.-R. 1995. A degradable Blink-tree with periodic data reorganization. Comput. J. 38, 3, 245--252.
|
| |
61
|
|
| |
62
|
|
 |
63
|
|
 |
64
|
Peter M. Chen , Edward K. Lee , Garth A. Gibson , Randy H. Katz , David A. Patterson, RAID: high-performance, reliable secondary storage, ACM Computing Surveys (CSUR), v.26 n.2, p.145-185, June 1994
[doi> 10.1145/176979.176981]
|
 |
65
|
|
 |
66
|
|
 |
67
|
|
| |
68
|
|
 |
69
|
|
 |
70
|
|
 |
71
|
|
 |
72
|
Jonathan E. Cook , Artur W. Klauser , Alexander L. Wolf , Benjamin G. Zorn, Semi-automatic, self-adaptive control of garbage collection rates in object databases, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.377-388, June 04-06, 1996, Montreal, Quebec, Canada
|
 |
73
|
Jonathan E. Cook , Alexander L. Wolf , Benjamin G. Zorn, Partition selection policies in object database garbage collection, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.371-382, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
74
|
|
 |
75
|
George Copeland , William Alexander , Ellen Boughter , Tom Keller, Data placement in Bubba, Proceedings of the 1988 ACM SIGMOD international conference on Management of data, p.99-108, June 01-03, 1988, Chicago, Illinois, United States
|
| |
76
|
|
| |
77
|
Crus, R. A. 1984. Data recovery in IBM Database 2. IBM Syst. J. 23, 2, 178--188.
|
| |
78
|
|
| |
79
|
Detlefs, D. L. 2004. A hard look at hard real-time garbage collection. In Proceedings of the 7th IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC 2004). IEEE Computer Society Press, Los Alamitos, 23--32.
|
 |
80
|
David Detlefs , Christine Flood , Steve Heller , Tony Printezis, Garbage-first garbage collection, Proceedings of the 4th international symposium on Memory management, October 24-25, 2004, Vancouver, BC, Canada
[doi> 10.1145/1029873.1029879]
|
 |
81
|
|
 |
82
|
|
 |
83
|
|
 |
84
|
Tamar Domani , Elliot K. Kolodner , Ethan Lewis , Eliot E. Salant , Katherine Barabash , Itai Lahan , Yossi Levanoni , Erez Petrank , Igor Yanorer, Implementing an on-the-fly garbage collector for Java, Proceedings of the 2nd international symposium on Memory management, p.155-166, October 15-16, 2000, Minneapolis, Minnesota, United States
|
 |
85
|
Tamar Domani , Elliot K. Kolodner , Erez Petrank, A generational on-the-fly garbage collector for Java, Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation, p.274-284, June 18-21, 2000, Vancouver, British Columbia, Canada
|
 |
86
|
|
 |
87
|
|
| |
88
|
|
| |
89
|
|
| |
90
|
|
| |
91
|
|
| |
92
|
|
| |
93
|
|
| |
94
|
|
| |
95
|
Fontana, E. and Dennebouy, Y. 1995. Time-stamped versions for lazy propagation of schema changes. Tech. rep. 95/100, Department of Informatics, Swiss Federal Institute of Technology, Lausanne, Switzerland.
|
| |
96
|
|
| |
97
|
Franaszek, P. A. and Considine, J. P. 1979. Reduction of storage fragmentation on direct access devices. IBM J. Res. Devel. 23, 2, 140--148.
|
| |
98
|
Franaszek, P. A., Robinson, J. T., and Thomasian, A. 1994. Reorganizing data in log-structured file systems. IBM Tech. Disclos. Bull. 37, 6A, 623--624.
|
| |
99
|
Franklin, M., Copeland, G., and Weikum, G. 1989. What's different about garbage collection for persistent programming languages? Tech. rep. ACA-ST-062-89, MCC, Austin.
|
| |
100
|
Friske, C. A. 2004a. DB2 managing large partitioned tables in Version 8. Proc. SHARE 102. SHARE, Chicago.
|
| |
101
|
Friske, C. A. 2004b. DB2 online schema changes -- what's new in DB2 Version 8. Proc. SHARE 102. SHARE, Chicago.
|
| |
102
|
Friske, C. A., Garth, J. M., Lee, C. M., and Ruddy, J. A. 2007. Method and apparatus for building one or more indexes on data concurrent with manipulation of data. U.S. patent 7,308,456.
|
| |
103
|
Friske, C. A., Garth, J. M., and Ruddy, J. A. 2003a. Method of estimating an amount of changed data over plurality of intervals of time measurements. U.S. patent 6,535,870.
|
| |
104
|
Friske, C. A. and Ruddy, J. A. 1998. Online reorganization. DB2 Mag. 3, 3. (Online Ed. http://www.db2mag.com.) CMP Media.
|
| |
105
|
Friske, C. A., Ruddy, J. A., and Shibamiya, A. 2003b. Method for estimating the elapsed-time required for a log apply process. U.S. patent 6,535,893.
|
| |
106
|
|
| |
107
|
|
 |
108
|
|
| |
109
|
Garthwaite, A. T. 2008. Concurrent incremental garbage collector with a card table summarizing modified reference locations. U.S. patent 7,412,580.
|
| |
110
|
Gerritsen, R. 1994. Private communication.
|
 |
111
|
|
| |
112
|
|
| |
113
|
Ghandeharizadeh, S., Gao, S., Gahagan, C., and Krauss, R. 2006. An on-line reorganization framework for SAN file systems. In Proceedings of the 10th East European Conference on Advances in Databases and Information Systems, Y. Manolopoulos, J. Pokorný, and T. K. Sellis, Eds. Lecture Notes in Computer Science, vol. 4152. Springer-Verlag, New York, 399--414.
|
| |
114
|
|
| |
115
|
Richard Golding , Peter Bosch , Carl Staelin , Tim Sullivan , John Wilkes, Idleness is not sloth, Proceedings of the USENIX 1995 Technical Conference Proceedings on USENIX 1995 Technical Conference Proceedings, p.17-17, January 16-20, 1995, New Orleans, Louisiana
|
 |
116
|
|
| |
117
|
|
| |
118
|
|
 |
119
|
|
| |
120
|
|
| |
121
|
|
| |
122
|
|
| |
123
|
|
 |
124
|
|
| |
125
|
Hamada, Y., Ishihara, K., Masumoto, T., Fujita, T., Masatomi, K., Takeda, Y., and Kawashima, M. 2000. On-line database duplication with incorporation of concurrent database updates. U.S. patent 6,023,707.
|
 |
126
|
|
| |
127
|
|
 |
128
|
|
| |
129
|
Ho, F., Jain, R., and Troisi, J. 1994. An overview of NonStop SQL/MP. Tandem Syst. Rev. 10, 3, 6--17. Hewlett-Packard.
|
| |
130
|
Huras, M. A., Hing, N. H., Goss, J. J., and Lindsay, B. G. 2005. Online database table reorganization. U.S. patent 6,950,834.
|
| |
131
|
IBM. 1986. IMS/VS Version 1 Utilities Reference Manual. IBM. SH20-9029-9.
|
| |
132
|
IBM. 1993a. Implementing Concurrent Copy. IBM. GG24-3990-00.
|
| |
133
|
IBM. 1993b. An Introduction to Data Propagator Relational Release 1. IBM. GC26-3398-01.
|
| |
134
|
IBM. 1997. DB2 for OS/390 Version 5 Utility Guide and Reference. IBM. SC26-8967-00.
|
| |
135
|
IBM. 2001. DB2 Universal Database for OS/390 and z/OS Utility Guide and Reference, Version 7. IBM. SC26-9945-00.
|
| |
136
|
IBM. 2005. IMS Administration Guide: Database Manager, Version 9. IBM. SC18-7806-01.
|
| |
137
|
IBM and Integrated Systems Solutions. 1994. Replidata/MVS User's Guide. IBM and Integrated Systems Solutions. BLD-REP-UG-00.
|
| |
138
|
Isip, Jr., A. B. 2007. System and method for reorganizing stored data. U.S. patent 7,225,206.
|
| |
139
|
Itasca Systems. 1993. ITASCA Distributed Object Database Management System: Technical Summary for Release 2.2. Itasca Systems.
|
| |
140
|
Iyer, B. R. and Sockut, G. H. 2002. Methods for in-place online reorganization of a database. U.S. patent 6,411,964.
|
| |
141
|
|
 |
142
|
|
 |
143
|
|
 |
144
|
|
 |
145
|
|
 |
146
|
|
| |
147
|
|
 |
148
|
Wiebren de Jonge , M. Frans Kaashoek , Wilson C. Hsieh, The logical disk: a new approach to improving file systems, Proceedings of the fourteenth ACM symposium on Operating systems principles, p.15-28, December 05-08, 1993, Asheville, North Carolina, United States
|
| |
149
|
Katabami, N., Itoh, T., and Takahashi, Y. 1997. Method and apparatus for reorganizing an on-line database system in accordance with an access time increase. U.S. patent 5,596,747.
|
 |
150
|
Haim Kermany , Erez Petrank, The Compressor: concurrent, incremental, and parallel compaction, Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation, June 11-14, 2006, Ottawa, Ontario, Canada
|
 |
151
|
|
 |
152
|
|
| |
153
|
Keyes, J. 1992. DBAs face challenge of 24 by 7 availability. Software Mag. 12, 11, 58--63.
|
| |
154
|
|
| |
155
|
Kitsuregawa, M., Goda, K., and Kawamura, N. 2008. Method and system for data processing with database reorganization for the same. U.S. patent 7,421,456.
|
| |
156
|
Kitsuregawa, M. and Mogi, K. 1993. Virtual striping: A RAID5 storage management scheme with robustness for the peak access traffic. In Proceedings of the International Symposium on Next Generation Database Systems and Their Applications. ACM, New York, 280--287.
|
| |
157
|
Kohl, J. T., Staelin, C., and Stonebraker, M. 1993. HighLight: Using a log-structured file system for tertiary storage management. In Proceedings of the Winter 1993 USENIX Conference. USENIX Assoc., Berkeley, 435--447.
|
| |
158
|
Kolodner, E. K. 1990. Atomic incremental garbage collection and recovery for a large stable heap. In Proceedings of the 4th International Workshop on Persistent Object Systems, A. Dearle, G. M. Shaw, and S. B. Zdonik, Eds. Morgan-Kaufmann, San Francisco, 185--198. (see Kolodner {1992}.)
|
| |
159
|
|
| |
160
|
Kolodner, E. K., Lewis, E., and Petrank, E. 2005. Trace termination for on-the-fly garbage collection for weakly-consistent computer architecture. U.S. patent 6,920,541.
|
| |
161
|
Kolodner, E. K. and Petrank, E. 2002. On-the-fly garbage collector. U.S. patent 6,490,599.
|
 |
162
|
|
 |
163
|
|
| |
164
|
|
 |
165
|
Mohana K. Lakhamraju , Rajeev Rastogi , S. Seshadri , S. Sudarshan, On-line reorganization in object databases, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.58-69, May 15-18, 2000, Dallas, Texas, United States
|
| |
166
|
Lakhamraju, M. K., Rastogi, R., Seshadri, S., and Sudarshan, S. 2002. On-line reorganization in object-oriented databases. U.S. patent 6,343,296. (This is similar to Lakhamraju et al. {2000}.)
|
| |
167
|
|
| |
168
|
|
| |
169
|
Langley, J. T. and Moore, D. W. 2008. Non-disruptive backup copy in a database online reorganization environment. U.S. patent 7,433,902.
|
 |
170
|
Mong Li Lee , Masaru Kitsuregawa , Beng Chin Ooi , Kian-Lee Tan , Anirban Mondal, Towards self-tuning data placement in parallel database systems, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.225-236, May 15-18, 2000, Dallas, Texas, United States
|
| |
171
|
|
| |
172
|
|
| |
173
|
|
| |
174
|
|
 |
175
|
|
| |
176
|
Løland, J. and Hvasshovd, S.-O. 2006. Online, non-blocking relational schema changes. In Proceedings of the 10th International Conference on Extending Database Technology. Advances in Database Technology—EDBT 2006, Y. E. Ioannidis, M. H. Scholl, J. W. Schmidt, F. Matthes, M. Hatzopoulos, K. Böhm, A. Kemper, T. Grust, and C. Böhm, Eds. Lecture Notes in Computer Science, vol. 3896. Springer-Verlag, New York, 405--422.
|
 |
177
|
|
 |
178
|
|
| |
179
|
|
| |
180
|
Christopher R. Lumb , Jiri Schindler , Gregory R. Ganger , David F. Nagle , Erik Riedel, Towards higher disk head utilization: extracting free bandwidth from busy disk drives, Proceedings of the 4th conference on Symposium on Operating System Design & Implementation, p.7-7, October 22-25, 2000, San Diego, California
|
 |
181
|
|
| |
182
|
Maier, D. S., Marton, R. S., Troisi, J. H., and Celis, P. 1997. Relational database system and method with high data availability during table data restructuring. U.S. patent 5,625,815.
|
| |
183
|
Malhotra, A. and Munroe, S. J. 1997. Schema evolution in persistent object systems. In Proceedings of the 7th International Workshop on Persistent Object Systems, R. Connor and S. Nettles, Eds. Morgan-Kaufmann, San Francisco, 194--204.
|
| |
184
|
Manber, U. 1984. Concurrent maintenance of binary search trees. IEEE Trans. Softw. Engin. SE-10, 6, 777--784.
|
 |
185
|
|
 |
186
|
|
| |
187
|
Marshall, B. J., Henderson, M. D., Bailey, M. I., Bruce, T. R., Rogala, R. J., Webster, W. A., Metzner, D. F., and Dres, P. J. 2006. Method and system for online reorganization of databases. U.S. patent 7,117,229.
|
| |
188
|
|
| |
189
|
Martin, Jr., J. L. and Crown, Jr., G. N. 2003a. IMS on-line reorganization utility. U.S. patent 6,606,631.
|
| |
190
|
Martin, Jr., J. L. and Crown, Jr., G. N. 2003b. System and method for analyzing a database for on-line reorganization. U.S. patent 6,633,884.
|
 |
191
|
|
 |
192
|
Jeanna Neefe Matthews , Drew Roselli , Adam M. Costello , Randolph Y. Wang , Thomas E. Anderson, Improving the performance of log-structured file systems with adaptive methods, Proceedings of the sixteenth ACM symposium on Operating systems principles, p.238-251, October 05-08, 1997, Saint Malo, France
|
 |
193
|
Mark L. Mcauliffe , Michael J. Carey , Marvin H. Solomon, Vclusters: a flexible, fine-grained object clustering mechanism, Proceedings of the 13th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, p.230-243, October 18-22, 1998, Vancouver, British Columbia, Canada
|
 |
194
|
|
 |
195
|
Andrew McCreight , Zhong Shao , Chunxiao Lin , Long Li, A general framework for certifying garbage collectors and their mutators, Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, June 10-13, 2007, San Diego, California, USA
|
| |
196
|
|
 |
197
|
William J. McIver, Jr. , Roger King, Self-adaptive, on-line reclustering of complex object data, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.407-418, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
198
|
|
| |
199
|
Meltzer, H. S. 1975. An overview of the administration of data bases. In Proceedings of the 2nd USA-Japan Computer Conference. AFIPS Press, Reston, 365--370.
|
| |
200
|
|
| |
201
|
Mendelson, H. and Yechiali, U. 1981. Optimal policies for data base reorganization. Oper. Resear. 29, 1, 23--36.
|
| |
202
|
Menon, J. and Stockmeyer, L. 1998. An age-threshold algorithm for garbage collection in log-structured arrays and file systems. In Proceedings of the 12th Annual International Symposium on High-Performance Computing Systems and Application, J. Schaefer, Ed. Kluwer Academic Publishers, Hingham, 119--132. (More details appear in Resear. Rep. RJ 10120, IBM T. J. Watson Research Center, Yorktown Heights.)
|
| |
203
|
Mix, F. 1992. DB2 Reorg and continuous select access. In Proceedings of the Summer 1992 Meeting/SHARE 79. SHARE, Chicago, 2577--2614. Vol. II (microfiche). (Mix {1994} describes a revision.)
|
| |
204
|
Mix, F. 1994. DB2 Reorg and continuous select. In Proceedings of the 6th Annual North American Conference of the International DB2 Users Group. IDUG, Chicago, 545--563.
|
| |
205
|
Mizoguchi, M. and Nishiyama, K. 1994. Database management system. Japan patent 06-67944.
|
| |
206
|
|
| |
207
|
|
 |
208
|
|
| |
209
|
|
 |
210
|
|
 |
211
|
|
 |
212
|
|
| |
213
|
|
| |
214
|
Mondal, A. 2002. Query-centric online reorganization of indexed data in shared-nothing architectures. Ph.D. dissertation, School of Computing, National University of Singapore.
|
 |
215
|
Anirban Mondal , Masaru Kitsuregawa , Beng Chin Ooi , Kian Lee Tan, R-tree-based data migration and self-tuning strategies in shared-nothing spatial databases, Proceedings of the 9th ACM international symposium on Advances in geographic information systems, November 09-10, 2001, Atlanta, Georgia, USA
[doi> 10.1145/512161.512169]
|
| |
216
|
|
| |
217
|
Morsi, M. M. A., Navathe, S. B., and Kim, H.-J. 1991. A schema management and prototyping interface for an object-oriented database environment. In Proceedings of the IFIP Working Conference on Object-Oriented Approach in Information Systems, F. Van Assche, B. Moulin, and C. Rolland, Eds. North-Holland, Amsterdam, The Netherlands, 157--180. (For more details, see Morsi {1992}.)
|
| |
218
|
|
| |
219
|
Moss, J. E. B., Munro, D. S., and Hudson, R. L. 1997. PMOS: A complete and coarse-grained incremental garbage collector for persistent object stores. In Proceedings of the 7th International Workshop on Persistent Object Systems, R. Connor and S. Nettles, Eds. Morgan Kaufmann Publishers, San Francisco, 140--150. (Munro et al. {1999} describe an implementation.)
|
| |
220
|
|
 |
221
|
|
 |
222
|
|
 |
223
|
O. Nurmi , E. Soisalon-Soininen , D. Wood, Concurrency control in database structures with relaxed balance, Proceedings of the sixth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.170-176, March 23-25, 1987, San Diego, California, United States
[doi> 10.1145/28659.28677]
|
| |
224
|
|
| |
225
|
Olson, N. E. 1993. Partial indexing in POSTGRES. M.S. dissertation. Department of Electrical Engineering and Computer Science, University of California, Berkeley.
|
| |
226
|
Omiecinski, E. 1985a. Concurrency during the reorganization of indexed files. In Proceedings of the 9th International Conference on Computer Software and Applications (COMPSAC 85). IEEE Computer Society Press, Los Alamitos, 482--488.
|
| |
227
|
|
| |
228
|
|
| |
229
|
|
| |
230
|
Omiecinski, E. 1996. Concurrent file reorganization: Clustering, conversion and maintenance. Data Engin. 19, 2, 25--32. IEEE-CS TC DE.
|
| |
231
|
|
| |
232
|
|
| |
233
|
|
| |
234
|
Oracle. 2005. Oracle Database 10g Release 2 online data reorganization & redefinition. Oracle White Paper.
|
| |
235
|
|
 |
236
|
Yoav Ossia , Ori Ben-Yitzhak , Irit Goft , Elliot K. Kolodner , Victor Leikehman , Avi Owshanko, A parallel, incremental and concurrent GC for servers, Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation, June 17-19, 2002, Berlin, Germany
|
 |
237
|
James O'Toole , Scott Nettles , David Gifford, Concurrent compacting garbage collection of a persistent heap, Proceedings of the fourteenth ACM symposium on Operating systems principles, p.161-174, December 05-08, 1993, Asheville, North Carolina, United States
|
 |
238
|
|
| |
239
|
|
| |
240
|
Park, J. S., Bartoszynski, R., and Pirkul, H. 1989. Optimal database reorganization policies: A stochastic control approach. In Proceedings of the 22nd Annual Hawaii International Conference on System Sciences. Vol. 3. IEEE Computer Society Press, Los Alamitos, 752--761. (For a revision, see Park et al. {1990}.)
|
| |
241
|
|
 |
242
|
|
| |
243
|
Paz, H., Petrank, E., Bacon, D. F., Kolodner, E. K., and Rajan, V. T. 2005. An efficient on-the-fly cycle collection. In Compiler Construction: 14th International Conference, CC 2005, Held as Part of Joint European Conferences on Theory and Practice of Software, ETAPS 2005, R. Bodik, Ed. Lecture Notes in Computer Science, vol. 3443. Springer-Verlag, New York, 156--171. (For a revision, see Paz et al. {2007}.)
|
| |
244
|
Pereira, H. M. 2000. Method and apparatus for reorganizing an active DBMS table. U.S. patent 6,122, 640.
|
| |
245
|
|
 |
246
|
|
 |
247
|
Filip Pizlo , Daniel Frampton , Erez Petrank , Bjarne Steensgaard, Stopless: a real-time garbage collector for multiprocessors, Proceedings of the 6th international symposium on Memory management, October 21-22, 2007, Montreal, Quebec, Canada
[doi> 10.1145/1296907.1296927]
|
 |
248
|
|
| |
249
|
Plow, G. M., Pourmirzaie, F. E., and Shiomi, T. 2008. Method for reorganizing a set of database partitions. U.S. patent 7,454,449.
|
| |
250
|
|
| |
251
|
Pong, M. 1990. An overview of NonStop SQL Release 2. Tandem Syst. Rev. 6, 2, 4--11.
|
 |
252
|
|
 |
253
|
|
| |
254
|
|
| |
255
|
Pu, C. 1986. On-the-fly, incremental, consistent reading of entire databases. Algorithmica 1, 3, 271--287.
|
| |
256
|
C. Pu , C. H. Hong , J. M. Wha, Performance evaluation of global reading of entire databases, Proceedings of the first international symposium on Databases in parallel and distributed systems, p.167-176, December 05-07, 1988, Austin, Texas, United States
|
 |
257
|
|
| |
258
|
|
| |
259
|
|
| |
260
|
Radosevich, L. 1993. Hotel cans mainframes. Computerworld 27, 8, 1 and 16.
|
| |
261
|
|
 |
262
|
|
| |
263
|
Ram, S. and Shankaranarayanan, G. 2003. Research issues in database schema evolution: The road not taken. Working Paper 2003-15, Information Systems Department, Boston University School of Management.
|
| |
264
|
Ramírez, R. J., Tompa, F. W., and Munro, J. I. 1982. Optimum reorganization points for arbitrary database costs. Acta Informatica 18, 1, 17--30.
|
| |
265
|
Rengarajan, T. K., Dimino, L., and Chung, D. 1996. Sybase System 11 online capabilities. Data Engin. 19, 2, 19--24.
|
| |
266
|
|
| |
267
|
|
 |
268
|
|
| |
269
|
Roddick, J. F. 1994. Schema evolution in database systems—An updated bibliography. Tech. rep. CIS-94-012, School of Computer and Information Science, University of South Australia, The Levels, S. Australia.
|
| |
270
|
Roddick, J. F. 1995. A survey of schema versioning issues for database systems. Inf. Softw. Tech. 37, 7, 383--393.
|
| |
271
|
|
 |
272
|
|
 |
273
|
|
| |
274
|
Roy, P., Seshadri, S., Silberschatz, A., and Sudarshan, S. 2002. Garbage collection in object-oriented databases using transactional cyclic reference counting. U.S. patent 6,363,403.
|
| |
275
|
|
| |
276
|
Ruddy, J. A., Shyam, K., Sockut, G. H., and Watts, J. A. 1999. Application of log records to data compressed with different encoding scheme. U.S. patent 5,897,641.
|
 |
277
|
|
| |
278
|
|
| |
279
|
Salant, E. E. and Kolodner, E. K. 2002. Data structure for keeping track of objects remaining to be traced by concurrent garbage collector. U.S. patent 6,393,440.
|
| |
280
|
|
| |
281
|
Salzberg, B. 1985. Restructuring the Lehman-Yao tree. Tech. rep. BS-85-21, College of Computer Science, Northeastern University, Boston.
|
| |
282
|
|
| |
283
|
|
| |
284
|
|
| |
285
|
Scheuermann, P., Weikum, G., and Zabback, P. 1994. “Disk cooling” in parallel disk systems. Bull. Tech. Committee Data Engin. 17, 3, 29--40.
|
| |
286
|
|
 |
287
|
|
 |
288
|
|
| |
289
|
Schubert, R. F. 1974. Directions in data base management technology. Datamation 20, 9 (Sept), 48--51.
|
| |
290
|
|
| |
291
|
Seidl, M. L., Wright, G. M., and Wolczko, M. I. 2008. Method and system for concurrent garbage collection and mutator execution. U.S. patent 7,421,539.
|
| |
292
|
|
| |
293
|
Margo Seltzer , Keith Bostic , Marshall Kirk Mckusick , Carl Staelin, An implementation of a log-structured file system for UNIX, Proceedings of the USENIX Winter 1993 Conference Proceedings on USENIX Winter 1993 Conference Proceedings, p.3-3, January 25-29, 1993, San Diego, California
|
 |
294
|
|
 |
295
|
|
 |
296
|
|
 |
297
|
|
 |
298
|
|
| |
299
|
Singhal, V., Kakkad, S. V., and Wilson, P. R. 1992. Texas: An efficient, portable persistent store. In Proceedings of the 5th International Workshop on Persistent Object Systems, A. Albano and R. Morrison, Eds. Springer-Verlag, New York, 11--33.
|
| |
300
|
Siwiec, J. E. 1977. A high-performance DB/DC system. IBM Syst. J. 16, 2, 169--195.
|
| |
301
|
|
| |
302
|
Smith, B. F. 1994. Understanding DB2 RUNSTATS statistics and version 3 enhancements. In Proceedings of the SHARE 82. Vol. II (CD-ROM). SHARE, Chicago.
|
| |
303
|
Smith, G. S. 1990. Online reorganization of key-sequenced tables and files. Tandem Syst. Rev. 6, 2, 52--59.
|
| |
304
|
Sockut, G. H. 1974. A graphics monitor for animation of dynamic processes. M.S. dissertation., Department of Electrical Engineering, Massachusetts Institute of Technology, Cambridge.
|
| |
305
|
Sockut, G. H. 1977. Data base performance under concurrent reorganization and usage. Ph.D. dissertation, Division of Applied Sciences, Harvard University, Cambridge. (Also Tech. rep. 12-77, Center for Research in Computing Technology.)
|
| |
306
|
Sockut, G. H. 1978. A performance model for computer data-base reorganization performed concurrently with usage. Oper. Res. 26, 5, 789--804. (For more details, see Sockut {1977}.)
|
| |
307
|
|
| |
308
|
Sockut, G. H. and Beavin, T. A. 1998. Interaction between application of a log and maintenance of a table that maps record identifiers during online reorganization of a database. U.S. patent 5,721,915. (Also U.S. Patent 6,026,412, 2000.)
|
| |
309
|
|
| |
310
|
Sockut, G. H. and Goldberg, R. P. 1976. Motivation for data base reorganization performed concurrently with usage. Tech. rep. 16-76, Center for Research in Computing Technology, Harvard University, Cambridge, MA. (full article). (Also in Preprints, IEEE-CS Workshop on Operating and Data base Management Systems (Datab. Engin. 1, 1, 1977, IEEE-CS Technical Communication on Data Base Engineering), 18-19 (abstract). For more details, see Sockut {1977}.)
|
 |
311
|
|
| |
312
|
Sockut, G. H. and Iyer, B. R. 1996. A survey on online reorganization in IBM products and research. Data Engin. 19, 2, 4--11.
|
| |
313
|
Söderlund, L. 1980. A study on concurrent data base reorganization. Ph.D. dissertation. Deparment of Information Processing and Computer Science, University of Stockholm, Sweden.
|
| |
314
|
|
 |
315
|
|
| |
316
|
Jagannathan Srinivasan , Souripriya Das , Chuck Freiwald , Eugene Inseok Chong , Mahesh Jagannath , Aravind Yalamanchi , Ramkumar Krishnan , Anh-Tuan Tran , Samuel DeFazio , Jayanta Banerjee, Oracle8i Index-Organized Table and Its Application to New Domains, Proceedings of the 26th International Conference on Very Large Data Bases, p.285-296, September 10-14, 2000
|
| |
317
|
|
| |
318
|
Srinivasan, V. and Carey, M. J. 1991a. On-line index construction algorithms. Tech. rep. 1008, Computer Sciences Department, University of Wisconsin, Madison. (Presented at 4th International Workshop on High-Performance Transaction Systems; For more details, see Srinivasan {1992}.)
|
 |
319
|
|
 |
320
|
|
| |
321
|
|
| |
322
|
|
 |
323
|
|
 |
324
|
|
 |
325
|
|
| |
326
|
|
| |
327
|
Strickland, J. P., Uhrowczik, P. P., and Watts, V. L. 1982. IMS/VS: An evolving system. IBM Syst. J. 21, 4, 490--510.
|
| |
328
|
Subramaniam, M. and Loaiza, J. 2005. Online reorganization and redefinition of relational database tables. U.S. patent 6,965,899.
|
| |
329
|
Sun, X. 2005. Online reorganization in parallel database systems: Scalability, load balancing and high availability. Ph.D. dissertation, College of Computer Science, Northeastern University, Boston.
|
 |
330
|
|
| |
331
|
Takahashi, H. and Takahashi, J. 1990. Record-type change method by binary relations. IBM Tech. Disclosure Bull. 33, 2, 262--264. (For more details, see Takahashi {1990}.)
|
| |
332
|
Takahashi, J. 1990. Hybrid relations for database schema evolution. In Proceedings of the 14th Annual International Computer Software and Applications Conference (COMPSAC 90). IEEE Computer Society Press, Los Alamitos, 465--470.
|
| |
333
|
Tan, L. and Katayama, T. 1990. Meta operations for type management in object-oriented databases: A lazy mechanism for schema evolution. In Proceedings of the 1st International Conference on Deductive and Object-Oriented Databases, W. Kim, J.-M. Nicolas, and S. Nishio, Eds. North-Holland, Amsterdam, The Netherlands, 241--258.
|
 |
334
|
|
| |
335
|
Tene, G. and Wolf, M. A. 2008. System and method for concurrent compacting self pacing garbage collection using loaded value and access barriers. U.S. patent 7,469,324.
|
| |
336
|
Teng, J. Z. and Todd, J. A. 2002. Method, system, and program for managing file names during the reorganization of a database object. U.S. patent 6,460,048.
|
| |
337
|
|
 |
338
|
|
 |
339
|
|
| |
340
|
Troisi, J. 1994. Nonstop availability and database configuration operations. Tandem Syst. Rev. 10, 3, 18--23.
|
| |
341
|
Troisi, J. 1996. Nonstop SQL/MP availability and database configuration operations. Data Engin. 19, 2, 12--18.
|
 |
342
|
|
| |
343
|
|
 |
344
|
|
 |
345
|
Martin T. Vechev , Eran Yahav , David F. Bacon , Noam Rinetzky, CGCExplorer: a semi-automated search procedure for provably correct concurrent collectors, Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, June 10-13, 2007, San Diego, California, USA
|
| |
346
|
Versant Object Technology. 1991. VERSANT System Reference Manual (for IBM RISC System/6000). Versant Object Technology.
|
 |
347
|
Radek Vingralek , Yuri Breitbart , Gerhard Weikum, Distributed file organization with scalable cost/performance, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.253-264, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
348
|
|
| |
349
|
|
| |
350
|
|
| |
351
|
|
| |
352
|
|
 |
353
|
Michal Wegiel , Chandra Krintz, The mapping collector: virtual memory support for generational, parallel, and concurrent compaction, Proceedings of the 13th international conference on Architectural support for programming languages and operating systems, March 01-05, 2008, Seattle, WA, USA
|
| |
354
|
Weikum, G. 1989. Clustering vs. concurrency: A framework and a case study. Tech. rep. ACA-ST-096-89, MCC, Austin.
|
 |
355
|
Gerhard Weikum , Peter Zabback , Peter Scheuermann, Dynamic file allocation in disk arrays, Proceedings of the 1991 ACM SIGMOD international conference on Management of data, p.406-415, May 29-31, 1991, Denver, Colorado, United States
|
| |
356
|
|
 |
357
|
|
| |
358
|
|
| |
359
|
Wilson, T. B. 1979. Data base restructuring: Options and obstacles. In Proceedings of the European Conference on Applied Information Technology (EURO IFIP 79), P. A. Samet, Ed. North-Holland, Amsterdam, The Netherlands, 567--573.
|
| |
360
|
Wilson, T. B. 1980. The description and usage of evolving schemas. In Proceedings of the 4th International Computer Software and Application Conference (COMPSAC '80). IEEE Computer Society Press, Los Alamitos, 546--551.
|
 |
361
|
|
| |
362
|
Winter. 2005. TopTen program winners. On the web site (http://www.wintercorp.com), search the press releases for the most recent TopTen Program Winners.
|
| |
363
|
Wolczko, M. and Williams, I. 1992. Multi-level garbage collection in a high-performance persistent object system. In Proceedings of the 5th International Workshop on Persistent Object Systems, A. Albano and R. Morrison, Eds. Springer-Verlag, New York, 396--418.
|
 |
364
|
|
| |
365
|
|
| |
366
|
|
 |
367
|
|
| |
368
|
|
| |
369
|
|
| |
370
|
Zou, C. and Salzberg, B. 1996a. Efficiently updating references during on-line reorganization. Tech. rep. NU-CCS-96-08, College of Computer Science, Northeastern University, Boston.
|
 |
371
|
|
| |
372
|
Zou, C. and Salzberg, B. 1996c. Towards efficient online database reorganization. Data Engin. 19, 2, 33--40.
|
| |
373
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.7
Database Administration
Additional Classification:
C.
Computer Systems Organization
C.2
COMPUTER-COMMUNICATION NETWORKS
C.2.4
Distributed Systems
Subjects:
Distributed databases
C.4
PERFORMANCE OF SYSTEMS
D.
Software
D.2
SOFTWARE ENGINEERING
D.2.9
Management
Subjects:
Life cycle
D.3
PROGRAMMING LANGUAGES
D.3.3
Language Constructs and Features
Subjects:
Dynamic storage management
D.3.4
Processors
Subjects:
Memory management (garbage collection)
D.4
OPERATING SYSTEMS
D.4.1
Process Management
Subjects:
Scheduling;
Synchronization;
Concurrency
D.4.2
Storage Management
Subjects:
Allocation/deallocation strategies;
Garbage collection
D.4.3
File Systems Management
Subjects:
File organization;
Distributed file systems;
Access methods
D.4.9
Systems Programs and Utilities
E.
Data
E.5
FILES
Subjects:
Organization/structure
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.1
Logical Design
Subjects:
Schema and subschema
H.2.2
Physical Design
Subjects:
Access methods
H.2.4
Systems
Subjects:
Distributed databases;
Concurrency;
Parallel databases
K.
Computing Milieux
K.6
MANAGEMENT OF COMPUTING AND INFORMATION SYSTEMS
K.6.3
Software Management
Subjects:
Software maintenance
General Terms:
Algorithms,
Design,
Performance
Keywords:
Clustering,
concurrent reorganization,
indexes,
log-structured file systems,
maintenance,
online reorganization,
redefinition,
reorganization,
restructuring,
schema evolution,
very large databases
|