ACM Home Page
Please provide us with feedback. Feedback
User-oriented document clustering: a framework for learning in information retrieval
Full text PdfPdf (777 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Palazzo dei Congressi, Pisa, Italy
Pages: 157 - 163  
Year of Publication: 1986
ISBN:0-89791-187-3
Authors
J. S. Deogun  Department of Camputer Science, University of Nebraska, Lincoln, Nebraska
V. V. Raghavan  Department of Computer Science, University of Regina, Regina, Saskatchewan, Canada S4S 0A2
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 20,   Citation Count: 13
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/253168.253202
What is a DOI?

ABSTRACT

In information retrieval, cluster analysis is an important tool employed to enhance both efficiency and effectiveness of the retrieval process. Most clustering algorithms have difficulty in reflecting the closeness of documents as perceived by the user. A two phase scheme for document clustering, whose results reflect the “conceptual” clusters that are perceived by the user of the retrieval system, is proposed. Since the clusters obtained by this scheme are not characterized in terms of the document representations, a strategy for cluster searching is also developed. Both the proposed document clustering scheme and document searching strategy are experimentally evaluated using a test collection from the SMART system. The preliminary experimental results obtained are very encouraging.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Doyle, L.B. (1966). Breaking the Cost Barrier in Aut#atlc Classification. Report No. SP-2516, System Development Corp., Santa Nonl ca, Ca.
 
2
Dattola, R.T. (1971). Experiments with a fast algorithm for automatic classification. In the SMART Retrieval System- Experiments in Automatic Document Processing, G. Salton, Ed., Prentice-Hall, Englewood Cliffs, N.J.
 
3
Rocchlo, J.J. Jr. (1966). #nt retrieval systems - optimization and evaluation. Doctoral thesis. In: Information Storage and Retrieval, Scientific Report NO. ISR-10, Harvard Univ., Cambridge, MaSs.
 
4
van Rijsbergen, C.J. and Croft, W.B. (1975). Document clustering: An evaluation of some experln#ts with the Cranfield 1400 collectlon. Inf. Process #anage., 11, 171-182.
5
 
6
Yu, C.T. (1974). A clustering algorlthm . based on user queries. J. Amer. Soc. Infom. Sol., 25, 218-226.
7
 
8
Sparck-Jc#es, K. (1971). Automatic Keyword Classiflcatlon for Information Retrieval. Archon Books, Cormecticut.
9
 
10
Dattola, R.T. and Ruffly, O.M. (1967). An experiment in autcmatlc thesaurus ccnstructlon. In: Information Storage and Retrieval, Scientific Report No. ISR-13, Chpt. VIII, Cornell University, Ithaca, N.Y.
11
12
 
13
Jardine, N. and Sibson, R. (1968). Amodel{ for taxonomy. Mathematical Biosciences, 2, 456-482.
14
 
15
Jardlne, N. and van Rijsbergen, C.J. (1971). The use of hierarchic clustering in information retrieval. Information Storage and Retrieval, 7,217-240.
 
16
Rocchio, Jr., J.O. (1971). Relevance feedback in information retrieval, in the Smart Retrieval System- Experiments in Autcmatlc Document Processing, G. Salton, Ed., Prentice Hall Inc., Engl# Cliffs, N.J.
 
17
Knuth, D.E. (1973). Sorting and Searching- The Art of #ter Progrmsmlng. Vol. 3, Addison-Wesley Publishing Co., Reading, N#.
 
18
Salton, G. (1980). Automatic term class cormtructlon using relevance -A summary of work in autum#tlc psmxlo-classlfication. Inf. Process. and Nanage., 16,1-15.
 
19
Robertson, S.E. and Sparck Jo#es, K. (1976). Relevance welghtlng of search terms. J. of AmDr. Soc. Inform. Sci., 27, 129-146.
20
21
22
 
23
Croft, W.B. (1980). A model of cluster searching based on classification. Information systems, 5,189-195.

CITED BY  13
Collaborative Colleagues:
J. S. Deogun: colleagues
V. V. Raghavan: colleagues