| Repository synchronization in the OAI framework |
| Full text |
Pdf
(98 KB)
|
| Source
|
International Conference on Digital Libraries
archive
Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
table of contents
Houston, Texas
SESSION: OAI in action
table of contents
Pages: 191 - 198
Year of Publication: 2003
ISBN:0-7695-1939-3
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
IEEE Computer Society
Washington, DC, USA
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 23, Citation Count: 2
|
|
|
ABSTRACT
The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) began as an alternative to distributed searching of scholarly eprint repositories. The model embraced by the OAI-PMH is that of metadata harvesting, where value-added services (by a "service provider") are constructed on cached copies of the metadata extracted from the repositories of the harvester's choosing. While this model dispenses with the well known problems of distributed searching, it introduces the problem of synchronization. Stated simply, this problem arises when the service provider's copy of the metadata does not match the metadata currently at the constituent repositories. We define some metrics for describing the synchronization problem in the OAI-PMH. Based on these metrics, we study the synchronization problem of the OAI-PMH framework and propose several approaches for harvesters to implement better synchronization. In particular, if a repository knows its update frequency, it can publish it in an OAI-PMH Identify response using an optional About container that borrows from RDF Site Syndication (RSS) Format.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
G. Beged-Dov, D. Brickley, R. Dornfest, I. Davis, L. Dodds, J. Eisenzopf, D. Galbraith, R. Guha, K. MacLeod, E. Miller, A. Swartz, and E. van der Vlist. RDF Site Summary 1.0 Modules: Syndication, 2000. http://purl.org/rss/1.0/modules/syndication/.
|
| |
2
|
|
| |
3
|
T. Brody. Mining the social life of an eprint archive. http://opcit.eprints.org/tdb198/opcit/.
|
| |
4
|
|
 |
5
|
|
| |
6
|
|
 |
7
|
Luis Gravano , Chen-Chuan K. Chang , Héctor García-Molina , Andreas Paepcke, STARTS: Stanford proposal for Internet meta-searching, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.207-218, May 11-15, 1997, Tucson, Arizona, United States
[doi> 10.1145/253260.253299]
|
| |
8
|
|
 |
9
|
Carl Lagoze , William Arms , Stoney Gan , Diane Hillmann , Christopher Ingram , Dean Krafft , Richard Marisa , Jon Phipps , John Saylor , Carol Terrizzi , Walter Hoehn , David Millman , James Allan , Sergio Guzman-Lara , Tom Kalt, Core services in the architecture of the national science digital library (NSDL), Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, July 14-18, 2002, Portland, Oregon, USA
[doi> 10.1145/544220.544264]
|
| |
10
|
C. Lagoze, H. Van de Sompel, M. Nelson, and S. Warner. The Open Archives Initiative Protocol for Metadata Harvesting, version 2.0. http://www.openarchives.org/OAI/openarchivesprotocol.html.
|
| |
11
|
|
| |
12
|
|
| |
13
|
X. Liu, K. Maly, M. Zubair, and M. L. Nelson. Arc - an OAI service provider for digital library federation. DLib Magazine, 7(4), 2001. http://www.dlib.org/dlib/april01/liu/04liu.html.
|
 |
14
|
Xiaoming Liu , Kurt Maly , Mohammad Zubair , Michael L. Nelson, Arc: an OAI service provider for cross-archive searching, Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries, p.65-66, January 2001, Roanoke, Virginia, United States
[doi> 10.1145/379437.379451]
|
| |
15
|
|
| |
16
|
R. Rivest. The MD5 message-digest algorithm. Technical Report Internet RFC-1321, IETF, 1992. http://www.ietf.org/rfc/rfc1321.txt.
|
| |
17
|
A. Van Hoff, J. Giannandrea, M. Hapner, S. Carter, and M. M. The HTTP distribution and replication protocol. Technical Report NOTE-DRP, World Wide Web Consortium, 1997. http://www.w3.org/TR/NOTE-drp.
|
CITED BY 2
|
|
|
|
|
Michael L. Nelson , Joan A. Smith , Ignacio Garcia del Campo, Efficient, automatic web resource harvesting, Proceedings of the eighth ACM international workshop on Web information and data management, November 10-10, 2006, Arlington, Virginia, USA
|
|