ACM Home Page
Please provide us with feedback. Feedback
Ranking distributed probabilistic data
Full text PdfPdf (546 KB)
Source
International Conference on Management of Data archive
Proceedings of the 35th SIGMOD international conference on Management of data table of contents
Providence, Rhode Island, USA
SESSION: Research session 10: probabilistic databases I table of contents
Pages 361-374  
Year of Publication: 2009
ISBN:978-1-60558-551-2
Authors
Feifei Li  Florida State University, Tallahassee, FL, USA
Ke Yi  Hong Kong University of Science and Technology, Hong Kong, China
Jeffrey Jestes  Florida State University, Tallahassee, FL, USA
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 55,   Downloads (12 Months): 203,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1559845.1559885
What is a DOI?

ABSTRACT

Ranking queries are essential tools to process large amounts of probabilistic data that encode exponentially many possible deterministic instances. In many applications where uncertainty and fuzzy information arise, data are collected from multiple sources in distributed, networked locations, e.g., distributed sensor fields with imprecise measurements, multiple scientific institutes with inconsistency in their scientific data. Due to the network delay and the economic cost associated with communicating large amounts of data over a network, a fundamental problem in these scenarios is to retrieve the global top-k tuples from all distributed sites with minimum communication cost. Using the well founded notion of the expected rank of each tuple across all possible worlds as the basis of ranking, this work designs both communication- and computation-efficient algorithms for retrieving the top-k tuples with the smallest ranks from distributed sites. Extensive experiments using both synthetic and real data sets confirm the efficiency and superiority of our algorithms over the straightforward approach of forwarding all data to the server.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
4
5
6
7
 
8
9
10
 
11
 
12
 
13
14
 
15
GLPK. GNU Linear Programming Kit. http://www.gnu.org/software/glpk/.
16
17
18
 
19
 
20
 
21
22
 
23
 
24
 
25
 
26
 
27
C. Re, N. Dalvi, and D. Suciu. efficient top-k query evaluation on probalistic databases. In ICDE, 2007.
 
28
 
29
P. Sen and A. Deshpande. Representing and querying correlated tuples in probabilistic databases. In ICDE, 2007.
 
30
31
 
32
 
33
M. A. Soliman, I. F. Ilyas, and K. C.-C. Chang. Top-k query processing in uncertain databases. In ICDE, 2007.
34
 
35
36
 
37
X. Zhang and J. Chomicki. On the semantics and evaluation of top-k queries in probabilistic databases. In DBRank, 2008.

Collaborative Colleagues:
Feifei Li: colleagues
Ke Yi: colleagues
Jeffrey Jestes: colleagues