|
ABSTRACT
Human Genome Project databases present a confluence of interesting database challenges: rapid schema and data evolution, complex data entry and constraint management, and the need to integrate multiple data sources and software systems which range over a wide variety of models and formats. While these challenges are not necessarily unique to biological databases, their combination, intensity and complexity are unusual and make automated solutions imperative. We illustrate these problems in the context of the Philadelphia Genome Center for Human Chromosome 22, and describe a new approach to a solution for these problems, by means of a deductive language for expressing database transformations and constraints.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
W. Barker, D. George, L. Hunt, and J. GaraveUi, "The PIR protein sequence database," Nucleic Acids Research, vol. 19, pp. 2231-2236, 1991.
|
| |
2
|
P. Pearson, "The genome data base (GDB), a human genome mapping repository," Nucleic Acids Research, vol. 19, pp. 2237-2239, 1991.
|
| |
3
|
S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, "Basic local alignment search tool," Journal of Molecular Biology, vol. 215, pp. 403-410, 1990.
|
| |
4
|
W. R. Pearson, "Rapid and sensitive sequence comparison with FASTP and FASTA," Proc. Natl. Acad. Sci. U.S.A., vol. 85, pp. 2444-2448, 1990.
|
| |
5
|
National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, EN- TREZ: Sequences Users' Guide, 1992. Release 1.0.
|
| |
6
|
M. J. Cinkosky, J. Fiekett, D. Nelson, and T. G. Marr, "The restructuring of GenBank," October 1987.
|
| |
7
|
K. Hart, D. B. Searls, and G. C. Overton, "SORTEZ: A relational translator for NCBI's ASN.1 database," Computer Applications in the Biosciences, vol. 10, no. 3, 1994. To appear. See also UPenn Technical Report CBIL-9203.
|
| |
8
|
G. C. Overton, J. Aaronson, J. Haas, and J. Adams, "QGB: A system for querying sequence database fields and features," Computational Biology, 1994. To appear.
|
| |
9
|
Department of Energy, DOE In/ormatics Summit Meeting Report, April 1993. Available via gopher at gopher, gdb. org.
|
| |
10
|
N. Goodman, S. Rozen, and L. Stein, "'Requirements for a deductive query language in the Map- Base genome-mapping database," in Proceedings of Workshop on Programming with Logic Databases, Vancouver, BC, October 1993.
|
 |
11
|
|
| |
12
|
|
| |
13
|
E. Szeto and V. M. Markowitz, "Erdraw 4.0: A graphical editor for extended entity-relationship schemas, reference manual," Tech. Rep. LBL- PUB-3084, Lawrence Berkeley Laboritory, Berkeley, California, 1993.
|
| |
14
|
S. Abiteboul and C. Beeri, "On the power of languages for the manipulation of complex objects," in Proceedings of International Workshop on Theory and Applications of Nested Relations and Complex Objects, (Darmstadt), 1988. Also available as IN- RIA Technical Report 846.
|
| |
15
|
|
| |
16
|
|
| |
17
|
A. S. Kosky, "A language for database transformations and constrains," 1993. Manuscript available from kosky@saul, cis.upema, edu.
|
| |
18
|
L. Wong, "Querying nested collections: A dissertation proposal," August 1993. Manuscript available from li_msoon@saul, cis. upenn, edu.
|
| |
19
|
|
 |
20
|
|
| |
21
|
S. Widjojo, D. S. Wile, and R. Hull, "Worldbase: A new approach to shaxing distributed information," tech. rep., USC/Information Sciences Institute, February 1990.
|
| |
22
|
|
| |
23
|
|
| |
24
|
S. Navathe, R. Elmasri, and J. Larson, "Integrating user views in database design," IEEE Computer, vol. 19, pp. 50-62, January 1986.
|
 |
25
|
|
 |
26
|
|
|