ACM Home Page
Please provide us with feedback. Feedback
Zerber+R: top-k retrieval from a confidential index
Full text PdfPdf (999 KB)
Source Extending Database Technology; Vol. 360 archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology table of contents
Saint Petersburg, Russia
SESSION: Research sessions: Top-K techniques table of contents
Pages 439-449  
Year of Publication: 2009
ISBN:978-1-60558-422-5
Authors
Sergej Zerr  Leibniz University of Hanover, Hanover, Germany
Daniel Olmedilla  Leibniz University of Hanover, Hanover, Germany
Wolfgang Nejdl  Leibniz University of Hanover, Hanover, Germany
Wolf Siberski  Leibniz University of Hanover, Hanover, Germany
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 46,   Citation Count: 0
Additional Information:

abstract   references   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1516360.1516412
What is a DOI?

ABSTRACT

Privacy-preserving document exchange among collaboration groups in an enterprise as well as across enterprises requires techniques for sharing and search of access-controlled information through largely untrusted servers. In these settings search systems need to provide confidentiality guarantees for shared information while offering IR properties comparable to the ordinary search engines. Top-k is a standard IR technique which enables fast query execution on very large indexes and makes systems highly scalable. However, indexing access-controlled information for top-k retrieval is a challenging task due to the sensitivity of the term statistics used for ranking.

In this paper we present Zerber+R -- a ranking model which allows for privacy-preserving top-k retrieval from an outsourced inverted index. We propose a relevance score transformation function which makes relevance scores of different terms indistinguishable, such that even if stored on an untrusted server they do not reveal information about the indexed data. Experiments on two real-world data sets show that Zerber+R makes economical usage of bandwidth and offers retrieval properties comparable with an ordinary inverted index.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Alspach, D. and Sorenson, H. Nonlinear Bayesian Estimation Using Gaussian Sum Approximations. IEEE Transactions on Automatic Control, Vol. 17, No. 4, p. 439--448, Aug., 1972.
 
2
 
3
 
4
 
5
Boneh, D., Crescenzo, G. D., Ostrovsky, R., and Persiano, G., Public-key encryption with keyword search. In Proceedings of Eurocrypt 2004.
 
6
 
7
Chang, Y.-C. and Mitzenmacher, M. Privacy preserving keyword searches on remote encrypted data. Cryptology ePrint Archive, Report 2004/051, Feb 2004.
 
8
 
9
Goh, E., Shacham, H., Modadugu, N. and Boneh, D. Sirius: Securing remote untrusted storage. In NDSS, 2003.
10
11
 
12
 
13
Kohlschütter, C., Chirita, P.-A. and Nejdl W. Using Link Analysis to Identify Aspects in Faceted Web Search. SIGIR'2006 Faceted Search Workshop, 2006, Seattle, WA.
 
14
 
15
 
16
Open Directory Project: http://www.dmoz.org/
 
17
Rice, J. Mathematical Statistics and Data Analysis II Edition 1995. ISBN 0-534-20934-3
 
18
Singhal, A. Modern Information Retrieval: A Brief Overview. In IEEE, Data Eng. Bull. 24(4), 2001
 
19
 
20
Stud IP LMS. Available at: http://www.studip.de/.
21
22
Collaborative Colleagues:
Sergej Zerr: colleagues
Daniel Olmedilla: colleagues
Wolfgang Nejdl: colleagues
Wolf Siberski: colleagues