|
ABSTRACT
This report considers combining information to improve retrieval. The vector space model has been extended so different classes of data are associated with distinct concept types and their respective subvectors. Two collections with multiple concept types are described, ISI-1460 and CACM-3204. Experiments indicate that regression methods can help predict relevance, given query-document similarity values for each concept type. After sampling and transformation of data, the coefficient of determination for the best model was .48 (.66) for ISI (CACM). Average precision for the two collections was 11% (31%) better for probabilistic feedback with all types versus with terms only. These findings may be of particular interest to designers of document retrieval or hypertext systems since the role of links is shown to be especially beneficial.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
BABA85
|
Babatz, R. and M. Bogen. Semantic Relations in Message Handling Systems: Referable Documents. In Proc. IFIP WG 6.5 Symposium, Sept. } 985.
|
| |
BELK87
|
|
| |
BICH80
|
Bichteler, J. and Eaton I/I, E.A. The Combined Use of Bibliographic Coupling and Cocitation for Document Retrieval. Journal of the American Society for Information Science, 31 (4):278-282, July 1980.
|
| |
BUCK85
|
|
| |
BUSH45
|
Bush, V. As We May Think. Atlantic Monthly, 176:101-108, July 1945.
|
 |
CHRI86
|
S. Christodoulakis , M. Theodoridou , F. Ho , M. Papa , A. Pathria, Multimedia document presentation, information extraction, and document formation in MINOS: a model and a system, ACM Transactions on Information Systems (TOIS), v.4 n.4, p.345-383, Oct. 1986
[doi> 10.1145/9760.9764]
|
| |
CONK87
|
|
| |
CROF84
|
|
| |
CROF87
|
|
| |
CUMM73
|
Cummings, L.J. and D.A. Fox. Some Mathematical Properties of Cycling Strategies Using Citation Indexes. Information Storage and Retrieval 9(12): 713-719, December 1973.
|
| |
DESA86
|
Desai, B.C., P. Goyal, and F. Sadri. A Data Model for Use with Formatted and Textual Data. Journal of the American Society for Information Science, 37(3): 158-165, May 1986.
|
| |
FOXE83a
|
|
| |
FOXE83b
|
Fox, E.A. Some Considc'.rations for Implementing the SMART Information Retrieval System under UNIX. TR 83-560, Cornell Univ., Dept. of Comp. Sci., Sept. 1983.
|
| |
FOXE83c
|
Fox, E.A. Characterization ,:ff Two New Experimental Co#,lections in Computer and Information Science Containing Textual and }3ibliographic Concepts. TR 83-561, Cornell Univ., Dept. of Comp. Sci., Sept. 1983.
|
| |
FOXE84
|
Fox, E.A. Combining Information in an Extended Automatic Information Retrieval System for Agriculture. In The Infrastructure of an information Society, ed. B. E1-Hadidy and E.E. Home, North-Holland, Anasterclam, 449-466, 1984.
|
 |
FOXE85
|
|
| |
FOXE87
|
|
| |
FOXE88a
|
|
| |
FOXE88b
|
|
| |
GARF78
|
Garfield, IE. Citation Indexing" Its Theory and Application in Science, Technology, and Humanities. John YViLey & Sons, New York, 1978.
|
| |
KATZ82
|
Katzer, J., et. al. A Study c,f the Overlap Among Document Representations. Inf. Tech." Res. & Dev., 1 (4)- 261-274, Oct. 1982.
|
| |
KESS63
|
Kessler, M.M. Bibliographic Coupling Between Scientific Papers. American Documentation, 14(1)" 10-24, January 1963.
|
| |
KOCH82
|
Kochtanek, Thomas R. Bibliographic Compilation using Reference and Citation Links. lnfortnation Processing and Manageme#zt, 18(1): 33-39, 1982.
|
| |
KWOK75
|
Kwok, K.L. The Use of Title and Cited Titles as Document Representation for Automatic Classification. Informatio,7 Processing and Management, 1 t (8-12)" 201-206, 1975.
|
| |
MICH71
|
Michelson et al. An Experiment in the Use of Bibliographic Data As a Source of Relevance Feedback in Information Retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing, ed. G. Salton, Prentice Hall, Englewood Cliffs, NJ, 1971.
|
| |
NUNN87
|
Nunn, Gary L. Regression Analysis of Extended Vectors to Obtain Coefficients for Use in Probabilistic Information Retrieval Systems. MS Report, VPI&SU Dept. of Comp. Sci., Blacksburg VA, Dec. 1987.
|
| |
OCON82
|
O'Connor, John. Citing Statements: Recognition by Computer and Use to Improve Retrieval. Information Processing and Management, 18(3): 125-13 I, July 1982.
|
| |
RAGH86
|
Raghavan, Vijay V. and S.K.M. Wong. A Critical Analysis of Vector Space Model for Information Retrieval, Journal of the American Society for Information ,Science, 37(5):279-287, Sept. I986.
|
| |
ROBE76
|
Robertson, S.E. and K. Sparc:k Jones. Relevance Weighting of Search Terms. .lournal of the American Society for In.formation Science, 27(3)" 129-146, 1976.
|
| |
ROCC71
|
Rocchio, Jr., J.J. Relevance Feedback in Information Retrieval. In The SMART Retrieval System, Experiments in Automatic Document Processing, ed. by G. Salton, Prentice Hail, Englewood Cliffs, N J, 1971.
|
 |
SALT63
|
|
| |
SALT71
|
Salton, G. Automatic Indexing Using Bibliographic Citations. J. Doc., 27(2), 2Iune 197 1.
|
| |
SALT75a
|
Salton, G., Yang, C.S., and C.T. Yu. A Theory of Term Importance in Automatic Text Analysis. Journal of the American Society for Information Science, 26(1)'33-44, Jan.-Feb. 1975.
|
 |
SALT75b
|
|
| |
SMAL73
|
Small, H.G. Co-Citation in the Scientific Literature: A New Measure of the Relationship Between Two Documents. Journal of the American Society for Information Science, 24(4), July-Aug. 1973.
|
| |
SMAL80
|
Small, H. Co-Citation Context Analysis and the Structure of Paradigms. J. Doc., 36(3):183-196, Sept. 1980.
|
| |
WEIN74
|
Weinberg, B.H. Bibliographic Coupling: A Review. Information Storage and Retrieval, 10(5-6): 189-196, 1974.
|
CITED BY 15
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Abbe Don , Tim Oren , Brenda Laurel, Guides 3.0, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.447-448, April 27-May 02, 1991, New Orleans, Louisiana, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hua Xu , Jung-Wei Fan , Carol Friedman, Combining multiple evidence for gene symbol disambiguation, Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing, June 29-29, 2007, Prague, Czech Republic
|
|