ACM Home Page
Please provide us with feedback. Feedback
On content-driven search-keyword suggesters for literature digital libraries
Full text PdfPdf (423 KB)
Source
International Conference on Digital Libraries archive
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries table of contents
Pittsburgh PA, PA, USA
SESSION: Automatic tools for digital libraries table of contents
Pages 21-24  
Year of Publication: 2008
ISBN:978-1-59593-998-2
Authors
Sulieman A. Bani-Ahmad  Case Western Reserve University, Cleveland, OH, USA
Gultekin Ozsoyoglu  Case Western Reserve University, Cleveland, OH, USA
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 133,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1378889.1378893
What is a DOI?

ABSTRACT

We propose and evaluate a "content-driven search keyword suggester" for keyword-based search in literature digital libraries. Suggesting search keywords at an early stage, i.e., while the user is entering search terms, is helpful for constructing more accurate, less ambiguous, and focused search keywords for queries. Our search keyword suggestion approach is based on an a priori analysis of the publication collection in the digital library at hand, and consists of the following steps. We (i) parse the document collection using the Link Grammar parser, a syntactic parser of English, (ii) group publications based on their "most-specific" research topics, (iii) use the parser output to build a hierarchical structure of simple and compound tokens to be used to suggest search terms, (iv) use TextRank, a text summarization tool, to assign topic-sensitive scores to keywords, and (v) use the identified research-topics to help user aggregate search keywords prior to the actual search query execution.

We experimentally show that the proposed framework, which is optimized to work on literature digital libraries, promises a more scalable, high quality, and user-friendly search-keyword suggester when compared to its competitors. We validate our proposal experimentally using a subset of the ACM SIGMOD Anthology digital library as a testbed, and by employing the research-pyramid model to identify the "most-specific" research topics.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Wiki, http://en.wikipedia.org
 
2
Google Suggest Labs, http://www.google.com/webhp?complete=1&hl=en
 
3
CiteSeer, www.citeseer.com
 
4
ACM Digital Library, http://portal.acm.org/dl.cfm
 
5
IEEE XPlore, http://ieeexplore.ieee.org
 
6
ScienceDirect, www.sciencedirect.info
 
7
S. Aya, C. Lagoze, and T. Joachims, "Citation Classification and its Applications", ICKM 2005.
8
 
9
Google Scholar, http://scholar.google.com/scholar
 
10
Bani-Ahmad, S., Ozsoyoglu, T. "Improved Publication Scores for Online Digital Libraries via Research Pyramids", ECDL 2007.
 
11
Davy Temperley, Daniel Sleator, John Lafferty, Link Gramar Parser 4.1 - http://www.link.cs.cmu.edu/link, 2005.
 
12
D. Sleator, D. Temperly and J. Lafferty, Link Grammar Parser version 4.1 (http://www.link.cs.cmu.edu/link/), 2005
 
13
Derek Sisson, A Thoughtful Approach to Web Site Quality, http://www.philosophe.com/
 
14
iProspect Inc., iProspect Search Engine User Behavior study, iProspect 2006.
15
 
16
Y. Zhang and A. Moffat, Some Observations on User Search Behavior, the 11th Australian Document Computing Symposium, Australia, 2006.
 
17
J. Wusteman and P. O'hIceadha, Using Ajax to Empower Dynamic Searching, Information Technolgy and Libraries, Vol. 25 No 2, June 2006, PP 57--64.
 
18
R. Mihalcea and P. Tarau, TextRank: Bringing Order into Texts, in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), Barcelona, Spain, July 2004.
 
19
The Onix full text indexing engine, The most widely used stopword list, http://www.lextek.com/manuals/onix/stopwords1.html
 
20
Using AJAX to Empower Dynamic Searching, Information Technology and Libraries, Vol 25, No 2, June 2006
 
21
 
22
elGiza Content-Based Search-keyword Suggester, http://sulieman.case.edu/giza/cbsks_xkb_search/
 
23
The CompleteSearch engine, http://search.mpi-inf.mpg.de/
24
25
 
26
Sulieman Bani-Ahmad, Gultekin Ozsoyoglu, elGiza, A Research-Pyramid Based Search Tool for Vertical Literature Digital Libraries, DBRank 2008.
 
27
 
28
Gupta, Chirag, Efficient K-Word Proximity Search, MS Thesis, CWRU, EECS Department, 2008.
 
29
 
30
AJAX tutorial, http://www.w3schools.com/ajax/
 
31

Collaborative Colleagues:
Sulieman A. Bani-Ahmad: colleagues
Gultekin Ozsoyoglu: colleagues