|
ABSTRACT
In information retrieval, cluster analysis is an important tool employed to enhance both efficiency and effectiveness of the retrieval process. Most clustering algorithms have difficulty in reflecting the closeness of documents as perceived by the user. A two phase scheme for document clustering, whose results reflect the “conceptual” clusters that are perceived by the user of the retrieval system, is proposed. Since the clusters obtained by this scheme are not characterized in terms of the document representations, a strategy for cluster searching is also developed. Both the proposed document clustering scheme and document searching strategy are experimentally evaluated using a test collection from the SMART system. The preliminary experimental results obtained are very encouraging.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Doyle, L.B. (1966). Breaking the Cost Barrier in Aut#atlc Classification. Report No. SP-2516, System Development Corp., Santa Nonl ca, Ca.
|
| |
2
|
Dattola, R.T. (1971). Experiments with a fast algorithm for automatic classification. In the SMART Retrieval System- Experiments in Automatic Document Processing, G. Salton, Ed., Prentice-Hall, Englewood Cliffs, N.J.
|
| |
3
|
Rocchlo, J.J. Jr. (1966). #nt retrieval systems - optimization and evaluation. Doctoral thesis. In: Information Storage and Retrieval, Scientific Report NO. ISR-10, Harvard Univ., Cambridge, MaSs.
|
| |
4
|
van Rijsbergen, C.J. and Croft, W.B. (1975). Document clustering: An evaluation of some experln#ts with the Cranfield 1400 collectlon. Inf. Process #anage., 11, 171-182.
|
 |
5
|
|
| |
6
|
Yu, C.T. (1974). A clustering algorlthm . based on user queries. J. Amer. Soc. Infom. Sol., 25, 218-226.
|
 |
7
|
C. T. Yu , Y. T. Wang , C. H. Chen, Adaptive document clustering, Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval, p.197-203, June 05-07, 1985, Montreal, Quebec, Canada
[doi> 10.1145/253495.253525]
|
| |
8
|
Sparck-Jc#es, K. (1971). Automatic Keyword Classiflcatlon for Information Retrieval. Archon Books, Cormecticut.
|
 |
9
|
|
| |
10
|
Dattola, R.T. and Ruffly, O.M. (1967). An experiment in autcmatlc thesaurus ccnstructlon. In: Information Storage and Retrieval, Scientific Report No. ISR-13, Chpt. VIII, Cornell University, Ithaca, N.Y.
|
 |
11
|
|
 |
12
|
|
| |
13
|
Jardine, N. and Sibson, R. (1968). Amodel{ for taxonomy. Mathematical Biosciences, 2, 456-482.
|
 |
14
|
|
| |
15
|
Jardlne, N. and van Rijsbergen, C.J. (1971). The use of hierarchic clustering in information retrieval. Information Storage and Retrieval, 7,217-240.
|
| |
16
|
Rocchio, Jr., J.O. (1971). Relevance feedback in information retrieval, in the Smart Retrieval System- Experiments in Autcmatlc Document Processing, G. Salton, Ed., Prentice Hall Inc., Engl# Cliffs, N.J.
|
| |
17
|
Knuth, D.E. (1973). Sorting and Searching- The Art of #ter Progrmsmlng. Vol. 3, Addison-Wesley Publishing Co., Reading, N#.
|
| |
18
|
Salton, G. (1980). Automatic term class cormtructlon using relevance -A summary of work in autum#tlc psmxlo-classlfication. Inf. Process. and Nanage., 16,1-15.
|
| |
19
|
Robertson, S.E. and Sparck Jo#es, K. (1976). Relevance welghtlng of search terms. J. of AmDr. Soc. Inform. Sci., 27, 129-146.
|
 |
20
|
|
 |
21
|
|
 |
22
|
|
| |
23
|
Croft, W.B. (1980). A model of cluster searching based on classification. Information systems, 5,189-195.
|
CITED BY 13
|
|
Yong Zhang , Vijay V. Raghavan , Jitender S. Deogun, An object-oriented modeling of the history of optimal retrievals, Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, p.241-250, October 13-16, 1991, Chicago, Illinois, United States
|
|
|
|
|
|
|
|
|
Keishi Tajima , Yoshiaki Mizuuchi , Masatsugu Kitagawa , Katsumi Tanaka, Cut as a querying unit for WWW, Netnews, e-mail, Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems, p.235-244, June 20-24, 1998, Pittsburgh, Pennsylvania, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|