|
ABSTRACT
The old problem of managing the history of database information is now made more urgent and complex by fast spreading web information systems, such as Wikipedia. Our PRIMA system addresses this difficult problem by introducing two key pieces of new technology. The first is a method for publishing the history of a relational database in XML, whereby the evolution of the schema and its underlying database are given a unified representation. This temporally grouped representation makes it easy to formulate sophisticated historical queries on any given schema version using standard XQuery. The second key piece of technology is that schema evolution is transparent to the user: she writes queries against the current schema while retrieving the data from one or more schema versions. The system then performs the labor-intensive and error-prone task of rewriting such queries into equivalent ones for the appropriate versions of the schema. This feature is particularly important for historical queries spanning over potentially hundreds of different schema versions and it is realized in PRIMA by (i) introducing Schema Modification Operators (SMOs) to represent the mappings between successive schema versions and (ii) an XML integrity constraint language (XIC) to efficiently rewrite the queries using the constraints established by the SMOs. The scalability of the approach has been tested against both synthetic data and real-world data from the Wikipedia DB schema evolution history.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Oracle Documentation. http://otn.oracle.com.
|
| |
2
|
|
| |
3
|
R. B. Almeida, B. Mozafari, and J. Cho. On the evolution of wikipedia. In ICWSM, 2007.
|
| |
4
|
|
| |
5
|
P. A. Bernstein. Applying model management to classical meta data problems. In CIDR, 2003.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
C. Curino, H. J. Moon, L. Tanca, and C. Zaniolo. Schema evolution in wikipedia: toward a web information system benchmark. In ICEIS, 2008.
|
| |
10
|
|
| |
11
|
C. Curino, H. J. Moon, and C. Zaniolo. Managing the history of metadata in support for db archiving and schema evolution. In ECDM, 2008.
|
| |
12
|
|
| |
13
|
|
| |
14
|
S. Kepser. A proof of the turing-completeness of xslt and xquery. In Technical report SFB 441, Eberhard Karls Universitat Tubingen, 2002.
|
| |
15
|
S. Marche. Measuring the stability of data models. European Journal of Information Systems, 2(1):37--47, 1993.
|
| |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
S. Ram and G. Shankaranarayanan. Research issues in database schema evolution: the road not taken. In Boston University School of Management, Department of Information Systems, Working Paper, 2003.
|
| |
21
|
|
| |
22
|
J. Roddick. A Survey of Schema Versioning Issues for Database Systems. Information and Software Technology, 37(7):383--393, 1995.
|
 |
23
|
|
| |
24
|
D. I. Sjoberg. Quantifying schema evolution. Information and Software Technology, 35(1):35--44, 1993
|
| |
25
|
|
| |
26
|
|
| |
27
|
|
| |
28
|
|
 |
29
|
|
| |
30
|
|
CITED BY 2
|
|
|
|
|
Hyun J. Moon , Carlo A. Curino , Myungwon Ham , Carlo Zaniolo, PRIMA: archiving and querying historical data with evolving schemas, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|