ACM Home Page
Please provide us with feedback. Feedback
Information discovery in loosely integrated data
Full text PdfPdf (495 KB)
Source
International Conference on Management of Data archive
Proceedings of the 2007 ACM SIGMOD international conference on Management of data table of contents
Beijing, China
SESSION: Group 4 table of contents
Pages: 1147 - 1149  
Year of Publication: 2007
ISBN:978-1-59593-686-8
Authors
Heasoo Hwang  IBM Almaden Research Center, San Jose, CA
Andrey Balmin  IBM Almaden Research Center, San Jose, CA
Hamid Pirahesh  IBM Almaden Research Center, San Jose, CA
Berthold Reinwald  IBM Almaden Research Center, San Jose, CA
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 74,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1247480.1247637
What is a DOI?

ABSTRACT

We model heterogeneous data sources with cross references, such as those crawled on the (enterprise) web, as a labeled graph with data objects as typed nodes and references or links as edges. Given the labeled data graph, we introduce flexible and efficient querying capabilities that go beyond existing capabilities by additionally discovering meaningful relationships between objects that satisfy keyword and/or structured query filters. We introduce the relationship search operator that exploits the link structure between data objects to rank objects related to the result of a filter. We implement the search operator using the ObjectRank [1] algorithm that uses the random surfer model. We study several alternatives for constructing summary graphs for query results that consist of individual and aggregate nodes that are somehow linked to qualifying result nodes. Some of the summary graphs are useful for presenting query results to the user, while others could be used to evaluate subsequent queries efficiently without considering all the nodes and links in the original data graph.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Andrey Balmin, Vagelis Hristidis, Yannis Papakonstantinou: ObjectRank: Authority-Based Keyword Search in Databases. VLDB 2004: 564--575.
 
2
Paul Brown, Peter J. Haas, Jussi Myllymaki, Hamid Pirahesh, Berthold Reinwald, Yannis Sismanis: Toward Automated large-scale Information Integration and Discovery. Data Management in a Connected World 2005: 161--180.
 
3
IBM Entity Analytic Solutions (EAS)-Solution Overview www.ibm.com/software/data/db2/eas/
 
4

Collaborative Colleagues:
Heasoo Hwang: colleagues
Andrey Balmin: colleagues
Hamid Pirahesh: colleagues
Berthold Reinwald: colleagues