| A language for manipulating clustered web documents results |
| Full text |
Pdf
(326 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceeding of the 17th ACM conference on Information and knowledge management
table of contents
Napa Valley, California, USA
SESSION: DB: faceted search, web query results presentation
table of contents
Pages 23-32
Year of Publication: 2008
ISBN:978-1-59593-991-3
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 20, Downloads (12 Months): 160, Citation Count: 1
|
|
|
ABSTRACT
We propose a novel conception language for exploring the results retrieved by several internet search services (like search engines) that cluster retrieved documents. The goal is to offer users a tool to discover relevant hidden relationships between clustered documents. The proposal is motivated by the observation that visualization paradigms, based on either the ranked list or clustered results, do not allow users to fully exploit the combined use of several search services to answer a request. When the same query is submitted to distinct search services, they may produce partially overlapped clustered results, where clusters identified by distinct labels collect some common documents. Moreover, clusters with similar labels, but containing distinct documents, may be produced as well. In such a situation, it may be useful to compare, combine and rank the cluster contents, to filter out relevant documents. In the proposed language, we define several operators (inspired by relational algebra) that work on groups of clusters. New clusters (and groups) can be generated by combining (i.e., overlapping, refining and intersecting) clusters (and groups), in a set oriented fashion. Furthermore, several ranking functions are also proposed, to model distinct semantics of the combination.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
|
| |
4
|
T. Coates, D. Connolly, D. Dack, L. Daigle, R. Denenberg, M. Durst, P. Grosso, S. Hawke, R. Iannella, G. Klyne, L. Masinter, M. Mealling, M. Needleman, and N. Walsh. URIs, URLs, and URNs: Clarifications and recommendations 1.0. Technical report, World Wide Web Consortium, URI Planning Interest Group W3C/IETF, http://www.w3.org/TR/2001/NOTE-uri-clarification-20010921/, 2001.
|
| |
5
|
A. L. N. Fred and A. K. Jain. Robust data clustering. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '03), 2:128, 2003.
|
 |
6
|
|
| |
7
|
N. Kampanya, R. Shen, S. Kim, C. North, and E. A. Fox. Citiviz: A visual user interface to the citidel system. LNCS, Springer Verlag, 3232:122--133, 2004.
|
| |
8
|
A. V. Leouski and W. B. Croft. An evaluation of techniques for clustering search results. Technical Report of the Department of Computer Science f University of Massachusetts at Amherst, IR-76:122--133, 1996.
|
| |
9
|
|
| |
10
|
S. Osinski. An algorithm for clustering of web search results. Master's thesis, Department of Computing Science, Poznan' University of Technology, http://project.carrot2.org/publications/osinski-2003-lingo.pdf, 2003.
|
 |
11
|
Marc M. Sebrechts , John V. Cugini , Sharon J. Laskowski , Joanna Vasilakis , Michael S. Miller, Visualization of search results: a comparative evaluation of text, 2D, and 3D interfaces, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.3-10, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312634]
|
| |
12
|
E. Staley and M. Twidale. Graphical interfaces to support information search. Technical report, University of Illinois, http://people.lis.uiuc.edu/~twidale/irinterfaces/bib-main.html, 2000.
|
| |
13
|
|
| |
14
|
L. Zadeh. Fuzzy sets. Information and control, 8:338--353, 1965.
|
| |
15
|
|
|