|
ABSTRACT
Experimental information retrieval (IR) systems, some dating back to the sixties, have demonstrated the viability of fully automatic document storage and retrieval methodologies with small to medium size bibliographic collections [72]. Many of these experimental systems utilize the vector space model in which each important term (such as a word stem) identifies a different dimension in a space, so that matrix methods and vector operations can be defined on queries and documents. Statistical techniques have been very effective, and probabilistic enhancements have given additional improvements [84]. However, the basic vector space model is oriented towards recording the essential information in the text of a title/abstract combination rather than describing more complex document structures. It is necessary to extend the model in order to handle composite documents.On the other hand, commonly available retrieval systems that employ Boolean logic queries and utilize inverted file storage schemes can without modification accommodate such documents, albeit with somewhat less effectiveness than is possible with more sophisticated systems. Hence, it is also of interest to consider how Boolean logic systems can be extended to give better performance, especially with composite documents, and to integrate those approaches with vector methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Allman, E., SENDMAIL - An Internetwork Mail Router. In UNIX Programmer'8 Manual, Berkeley Release 4.2, 1983.
|
 |
3
|
|
| |
4
|
|
| |
5
|
Bichteler, J. and Eaton III, E.A., The Combined Use of Bibliographic Coupling and Cocita- .tion for Document Retrieval. J. Am. Soc. Inf. Sci., 31(4), July 1980.
|
| |
6
|
|
 |
7
|
|
| |
8
|
Bolt Beranek, and Newman, Inc., Naming and Addressing in Computer Based Message Systems. Draft Report No. ICST/CBOS-82-4, Dept. of Commerce, National Bureau of Standards, Aug. 1982.
|
| |
9
|
Bookstein, A., Fuzzy Requests: An Approach to Weighted Boolean Searches. J. Am. Soc. Inf. Sci., 31(4), July 1980, 240-247.
|
| |
10
|
Bovey, J.D. and Robertson, S.E., An Algorithm for Weighted Searching on a Boolean System. Inf. Tech.: Res. Dev. Applications, 3(2), April 1984, 84-87.
|
| |
11
|
|
| |
13
|
Charniak, E., Context Recognition in Language Comprehension. In Strategies for Natural Language Processing, ed. by Wendy G. Lehnert and Martin H. Ringle, Lawrence Erlo baum Assoc., Hillsdale N J, 1982, 435-454.
|
| |
14
|
Chupin, J.C. and Joloboff, V., A Data Model for Office Systems. In Office Information. Systems, ed. by N. Naffah, North-Holland, Amsterdam, 1982, 39-56.
|
 |
15
|
|
| |
16
|
|
| |
17
|
Crawford, R.G., The Relational Model in Information Retrieval. J. Am. Soc. Inf. Sci., 32(1), 1981, 51-64.
|
| |
18
|
Crocker, D.H., Standard for the Format of ARPA Internet Text Messages. RFC 822, ARPANET Networking Group, Aug. 1982.
|
 |
19
|
|
| |
20
|
Croft, W.B. and Pezarro, M.T., Text Retrieval Techniques for the Automated Office. In Office Information Systems, ed. by N. Naffah, North-Holland, Amsterdam, 1982, 565-576.
|
| |
21
|
Croft, W.B., Experiments with Representation in a Document Retrieval System. Inf. Tech.: Res. Dev. Applications, 211), Jan. 1983, 1-22.
|
 |
22
|
|
| |
23
|
Daney, C., The VMSHARE Computer Conferencing Facility. In Computer Message Systems, ed. by Ronald P. Uhlig, North-Holland, Amsterdam, 1982, 115-127.
|
| |
24
|
Dattola, R.T., FIRST: Flexible Information Retrieval System for Text. J. Am. Soc. Inf. Sci., 30(1}, 1979, 9-14.
|
 |
25
|
|
| |
26
|
DeJong, G., An Overview of the FRUMP System. In Strategies for Natural Language Processing, ed. by Wendy G. Lehnert and Martin H. Ringle, Lawrence Erlbaum Assoc., Hillsdale N J, 1982, 149-176.
|
 |
27
|
|
| |
28
|
Eastman, C.M., File Searching Problems in Logic Programming Systems. Technical Report 83-CSE-8, Dept. of Comp. Sci. and Eng., Southern Methodist Univ., Feb. 1983.
|
 |
29
|
|
| |
30
|
Fox, E.A., Automatic Document and Passage Retrieval Methods: Aids to Searching the BahaT Writings. Proc. Annual Meeting Assoc. for Baha'i Studies, April 1981.
|
| |
31
|
Fox, E.A., Combining Information in an Extended Automatic Information Retrieval System for Agriculture. Infrastructure of an Information Society (Proc. Ist Int. Information Conf. Egypt, I3-I6 Dec. 198~), 1983.
|
| |
32
|
|
| |
33
|
Fox, E.A., Some Considerations for Implementing the SMART Information Retrieval System under UNIX. TR 83-560, Cornell Univ., Dept. of Comp. Sci., Sept. 1983.
|
| |
34
|
Fox, E.A., Characterization of Two New Experimental Collections in Computer and Information Science Containing Textual and Bibliographic Concepts. TR 83-561~ Cornell Univ., Dept. of Comp. Sci., Sept. 1983.
|
| |
35
|
|
| |
36
|
Frei, H.P. and Janslin, J.-F., Two- Dimensional Representation of Information Retrieval Services. In Representation and Exchange of Knowledge as a Basis of Information Processes, ed. by Hans J. Dietschmann, North- Holland, New York, 1984, 383-396.
|
| |
37
|
Fuhr, N. and Knorz, G., Retrieval Test Evaluation of a Rule Based Automatic Indexing. DV II 84-2, Technische Hochschule Darmstadt, 1984.
|
| |
38
|
Greenfield, R.H., An Experiment to Measure the Performance of Phonetic Key Compression Retrieval Schemes. Meth. Inform. Med., 16, 1977, "230-233.
|
| |
39
|
Hahn, U. and Reimer, U., Heuristic Text Parsing in 'Topic': Methodological Issues in a Knowledge-based Text Condensation System. In Representation and Exchange of Knowledge as a Basis of Information Processes, ed. by Hans J. Dietschmann, North-Holland, New York, 1084, 143-163.
|
 |
40
|
|
| |
41
|
|
| |
42
|
|
 |
43
|
|
| |
44
|
IFIP WG 6.5. A User-friendly Naming Convention for Use in Communication Networks. Working Paper, Version 3, IFIP WG 6.5, March 1984.
|
| |
45
|
Jennings, M., The Electronic Manuscript Project. Bulletin of the Am. Soc. Inf. Sci., 10(3), Feb. 1984, 11-13.
|
| |
46
|
Joseph, D.M. and Wong, R.L., Correction of Misspellings and Typographic Errors in a Free- Text Medical English Information Storage and Retrieval System. Meth. Inform. Med., 18, 1979, 238-234.
|
| |
47
|
Katzer, J., et al. A Study of the Overlap Among Document Representations. Syracuse Univ. School of Inform. Studies, 1982.
|
| |
48
|
Kessler, M.M., Bibliographic Coupling Between Scientific Papers. Amer. Doc., 14(1), Jan. 1963, 10-25.
|
 |
49
|
|
| |
50
|
Korfhage, R.R. and Chavarria-Garza, H., Retrieval Improvement by the Interaction of Queries and User Profiles. Proc. of COMPSAC '82, Sizth International Conference on Computer Software 8Y Applications, Nov. 1982, 470-475.
|
| |
51
|
|
| |
52
|
Lamb, D.A., RdMaa Message Management System: User's Guide and Reference. 7th Ed. Carnegie-Mellon Univ. Comp. Sci. Dept., Pittsburgh, PA, Aug. 1982.
|
 |
53
|
|
| |
54
|
|
| |
55
|
Macleod, I.A. and Crawford, R.G., Document Retrieval as a Database Application. Inf. Tech.: Res. Dev. Applications, 2(1), Jan. 1~983, 43- 60.
|
| |
56
|
McGiI1, M.J., Koll, M. and Noreanlt, T., An Evaluation of Factors Affecting Document Ranking "By Information Retrieval Systems. Syracuse Univ. School of Inform. Studies, 1979.
|
| |
57
|
Minsky, M., A Framework for Representing Knowledge. In The Psychology of Computer' Vision, ed. by P. Winston, McGraw-Hill, New York, 1975.
|
| |
58
|
Mooers, C.D., The Hermes Guide. Report No. 4995, BBN, Inc., Aug. 1982.
|
| |
59
|
|
| |
60
|
Myer, T.H., Standards for Global Messaging: A Progress Report. J. Telecommunication Networks, 2(4), Winter 1983.
|
| |
61
|
National Bureau of Standards. Message Format for Computer-Based Message Systems. Federal Inf. Proc. Standards Pub. {FIPS PUB) 98, NTIS, March 1983.
|
| |
62
|
Nodtvedt, E., Information Retrieval in the Business Environment. TR 80-447, Cornell Univ., Dept. of Comp. Sci., 1980.
|
| |
63
|
O'Connor, J., Answer-Passage Retrieval by Text Searching. J. Am. Soc. Inf. Sei., 31(4), 1980, 227-239.
|
| |
64
|
|
| |
65
|
Pereira, F., Logic for Natural Language Analysis. Technical Note 275, SRI International, Jan. 1983.
|
| |
66
|
Riesbeckh, C.K., Realistic Language Comprehension. In Strategies for Natural Language Processing, ed. by Wendy G. Lehnert and Martin H. Ringle, Lawrence Erlbaum Assoc., Hillsdale N J, 1982, 37-54.
|
| |
67
|
Ritchie, G.D. and Hanna, F.K., Semantic Networks - a General Definition and a Survey. Inf. Tech.: Res. Dev. Applications, 2(4), Oct. 1983, 187-231.
|
| |
68
|
Roach, J. and Savarese, J., Designing n Natural Language Interface for a Graphics Editor. Virginia Poly. Inst. and State Univ., Dept of Comp. Sci., 1982.
|
| |
69
|
Roach, J. and Fowler, G., The HC Manual: Virginia Tech Prolog. Technical Manual, Virginia Poly. Inst. and State Univ., Dept of Comp. Sci., 1983.
|
| |
70
|
Rumelhart, D.E., Notes on a Schema for Stories. In Representation and Understanding, ed. by D. G. Bobrow and A. Collins, Academic Press, New York, 1975, 211-236.
|
 |
71
|
|
| |
72
|
Salton, G., The SMART System 1961-1976: Experiments in Dynamic Document Processing. In Encyclopedia of Library and Information Science, 1980, 1-36.
|
 |
73
|
|
| |
74
|
Simmons, R.F., Computations from the English. Prentice Hall, Englewood Cliffs N J, 1984.
|
| |
75
|
Sirbu, Jr., M.A. and Sutherland, J.B., Naming and Directory Issues in Message Transfer Systems. Proc. IFIP 6.5 Working Conf., Nottingham, England, May 1984.
|
| |
76
|
Small, H.G., Co-Citation in the Scientific Literature: A New Measure of the Relationship Between Two Documents. J. Am. Soc. Inf. Sci., 24(4), July-Aug. 1973, 265-269.
|
| |
77
|
Small, S. and Rieger, C., Parsing and Comprehending with Word Experts (A Theory' and its Realization). In Strategies for Natural LanguageProcessing, ed. by Wendy G. Lehnert and Martin H. Ringle, Lawrence Erlbaum Assoc., Hillsdale N J, 1982, 89-148.
|
| |
78
|
|
| |
79
|
Smith, L.C. and Warner, A.J., A Taxonomy of Represen~tion~ in Information Retrieval System Design. I n Representation and Ezchange of Knowledge as a Basis of Information Processes, ed. by Hans J. Dietschmann, North-Holland, New York, 1984, 31-49.
|
| |
80
|
Solomon, M., Landweber, L.H. and Neuhengen, D., The CSNET Name Server. In Computer Networks 6, North-Holland, 1982.
|
| |
81
|
Spark Jones, K. and Tait, J.I., Automatic Search Term Variant Generation. J. Doc., 40(1), March 1984, 50-66.
|
 |
82
|
|
| |
83
|
Tong, R.M., et at. A Rule-Based Approach to Information Retrieval: Some Results and Comments. Proc. AAAI-83, 1983.
|
| |
84
|
|
| |
85
|
Vickers, P.H., Common Problems of Documentary Information Transfer, Storage and Retrieval in Industrial Organizations. J. Doe., 39(4), Dec. 1983, 217-229.
|
| |
86
|
|
| |
87
|
Yu, C.T., Buckley, C., Lain, K. and Salton, G., A Generalized Term Dependence Model in Information Retrieval. Inf. Tech.: Res. and Dev., 2(4), Oct. 1983.
|
| |
88
|
Zadeh, L.A., Fuzzy Sets. Information and Control, 8, 1055, 338-353.
|
| |
89
|
Zarri, G.P., An Outline of the Representation and Use of Temporal Data in the RESEDA System. Inf. Tech.: Res. Dev. Applications, 2(2/3), July 1083, 80-108.
|
CITED BY 4
|
|
Robert W.P. Luk , H. V. Leong , Tharam S. Dillon , Alvin T.S. Chan , W. Bruce Croft , James Allan, A survey in indexing and searching XML documents, Journal of the American Society for Information Science and Technology, v.53 n.6, p.415-437, May, 2002
|
|
|
|
|
|
|
|
|
|
|