ACM Home Page
Please provide us with feedback. Feedback
An initial evaluation of automated organization for digital library browsing
Full text PdfPdf (243 KB)
Source International Conference on Digital Libraries archive
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries table of contents
Denver, CO, USA
SESSION: Tools & techniques track: browsing and visualizing collections table of contents
Pages: 246 - 255  
Year of Publication: 2005
ISBN:1-58113-876-8
Authors
Aaron Krowne  Emory University, Atlanta, GA
Martin Halbert  Emory University, Atlanta, GA
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 130,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1065385.1065442
What is a DOI?

ABSTRACT

In this article we present an evaluation of text clustering and classification methods for creating digital library browse interfaces, focusing on the particular case of collections made up of heterogeneous metadata records. This situation is common in "portal" style digital libraries, which are built by harvesting content from many disparate sources, typically using the Open Archives Protocol for Metadata Harvesting (OAI-PMH). By studying the activity of users in an experimental system, we find that taxonomies built or populated using machine-learning (or "AI") techniques provide a potentially useful avenue for browsing in this digital library scenario.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Deerwester, S. C., Dumais, S. T., Landauer, T. K., Furnas, G. W., Harshman, R. A. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.
 
3
Garner, S. R. WEKA: The Waikato environment for knowledge analysis. In Proceedings of the New Zealand Computer Science Research Students Conference, pp. 57--64, 1995.
 
4
 
5
Kaufman, L., Rousseeuw, P.J.Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, 1990.
 
6
 
7
Lee, D., Seung, H. S. Learning the parts of objects by non-negative matrix factorization.Nature, vol. 401, October 1999.
8
 
9
McCallum, A. K.Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering, 1996. http://www.cs.cmu.edu/~mccallum/bow.
 
10
11
12
 
13
Willett, P. Document clustering using an inverted file approach. Journal of Information Science, 2:223--231, 1990.
14


Collaborative Colleagues:
Aaron Krowne: colleagues
Martin Halbert: colleagues