|
ABSTRACT
We consider the problem of mapping data in peer-to-peer data-sharing systems. Such systems often rely on the use of mapping tables listing pairs of corresponding values to search for data residing in different peers. In this paper, we address semantic and algorithmic issues related to the use of mapping tables. We begin by arguing why mapping tables are appropriate for data mapping in a peer-to-peer environment. We discuss alternative semantics for these tables and we present a language that allows the user to specify mapping tables under different semantics. Then, we show that by treating mapping tables as constraints (called mapping constraints) on the exchange of information between peers it is possible to reason about them. We motivate why reasoning capabilities are needed to manage mapping tables and show the importance of inferring new mapping tables from existing ones. We study the complexity of this problem and we propose an efficient algorithm for its solution. Finally, we present an implementation along with experimental results that show that mapping tables may be managed efficiently in practice.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
GDB. http://www.gdb.org/.
|
| |
2
|
Gnutella. http://www.gnutelliums.com/.
|
| |
3
|
Hugo. http://www.gene.ucl.ac.uk/hugo/.
|
| |
4
|
Hyperion. http://www.cs.toronto.edu/db/hyperion/.
|
| |
5
|
Locus. http://www.ncbi.nlm.nih.gov/LocusLink/.
|
| |
6
|
MIM. http://www.ncbi.nlm.nih.gov/omim/.
|
| |
7
|
Napster. http://www.napster.com/.
|
| |
8
|
SWISS-PROT. http://www.ebi.ac.uk/swissprot/.
|
| |
9
|
Unigene. http://www.ncbi.nlm.nih.gov/UniGene/.
|
| |
10
|
|
| |
11
|
P. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data Management for Peer-to-Peer Computing: A Vision. In WebDB, 2002.
|
| |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
S. Davidson, G. C. Overton, and P. Buneman. Challenges in Integrating Biological Data Sources. Journal of Computational Biology, 2(4):557--572, 1995.
|
| |
16
|
|
| |
17
|
S. Gribble, A. Halevy, Z. Ives, M. Rodrig, and D. Suciu. What Can Databases Do for Peer-to-Peer? In WebDB, 2001.
|
| |
18
|
A. Halevy, Z. Ives, D. Suciu, and I. Tatarinov. Schema Mediation in Peer Data Management Systems. To appear in ICDE 2003.
|
| |
19
|
Matthew Harren , Joseph M. Hellerstein , Ryan Huebsch , Boon Thau Loo , Scott Shenker , Ion Stoica, Complex Queries in DHT-based Peer-to-Peer Networks, Revised Papers from the First International Workshop on Peer-to-Peer Systems, p.242-259, March 07-08, 2002
|
 |
20
|
|
| |
21
|
|
| |
22
|
Jayant Madhavan , Philip A. Bernstein , Pedro Domingos , Alon Y. Halevy, Representing and reasoning about mappings between domain models, Eighteenth national conference on Artificial intelligence, p.80-86, July 28-August 01, 2002, Edmonton, Alberta, Canada
|
| |
23
|
L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernandez, and R. Fagin. Translating Web Data. In VLDB, pages 598--609, Aug. 2002.
|
 |
24
|
Sylvia Ratnasamy , Paul Francis , Mark Handley , Richard Karp , Scott Schenker, A scalable content-addressable network, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.161-172, August 2001, San Diego, California, United States
|
| |
25
|
S. Sarawagi, editor. IEEE Data Engineering Bulletin: Special Issue on Data Cleaning, volume 23, Dec. 2000.
|
 |
26
|
Ion Stoica , Robert Morris , David Karger , M. Frans Kaashoek , Hari Balakrishnan, Chord: A scalable peer-to-peer lookup service for internet applications, Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications, p.149-160, August 2001, San Diego, California, United States
|
| |
27
|
|
CITED BY 41
|
|
|
|
|
|
|
|
Igor Tatarinov , Zachary Ives , Jayant Madhavan , Alon Halevy , Dan Suciu , Nilesh Dalvi , Xin (Luna) Dong , Yana Kadiyska , Gerome Miklau , Peter Mork, The Piazza peer data management project, ACM SIGMOD Record, v.32 n.3, September 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Angela Bonifati , Elaine Qing Chang , Aks V. S. Lakshmanan , Terence Ho , Rachel Pottinger, HePToX: marrying XML and heterogeneity in your P2P databases, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Patricia Rodríguez-Gianolli , Anastasios Kementsietsidis , Maddalena Garzetti , Iluju Kiringa , Lei Jiang , Mehedi Masud , Renée J. Miller , John Mylopoulos, Data sharing in the Hyperion peer database system, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
|
|
Boualem Benatallah , Mohand-Said Hacid , Hye-young Paik , Christophe Rey , Farouk Toumani, Towards semantic-driven, flexible and scalable framework for peering and querying e-catalog communities, Information Systems, v.31 n.4, p.266-294, June 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Deepavali Bhagwat , Laura Chiticariu , Wang-Chiew Tan , Gaurav Vijayvargiya, An annotation management system for relational databases, Proceedings of the Thirtieth international conference on Very large data bases, p.900-911, August 31-September 03, 2004, Toronto, Canada
|
|
|
|
|
|
|
|
|
Enrico Franconi , Gabriel Kuper , Andrei Lopatenko , Ilya Zaihrayeu, Queries and updates in the coDB peer to peer database system, Proceedings of the Thirtieth international conference on Very large data bases, p.1277-1280, August 31-September 03, 2004, Toronto, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Diego Calvanese , Giuseppe De Giacomo , Domenico Lembo , Maurizio Lenzerini , Riccardo Rosati, Inconsistency tolerance in P2P data integration: An epistemic logic approach, Information Systems, v.33 n.4-5, p.360-384, June, 2008
|
|
|
Eric Chu , Akanksha Baid , Ting Chen , AnHai Doan , Jeffrey Naughton, A relational approach to incrementally extracting and querying structure in unstructured data, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
Zachary G. Ives , Todd J. Green , Grigoris Karvounarakis , Nicholas E. Taylor , Val Tannen , Partha Pratim Talukdar , Marie Jacob , Fernando Pereira, The ORCHESTRA Collaborative Data Sharing System, ACM SIGMOD Record, v.37 n.3, September 2008
|
|
|
|
|
|
|
|