ACM Home Page
Please provide us with feedback. Feedback
Indexing and retrieval of scientific literature
Full text PdfPdf (985 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the eighth international conference on Information and knowledge management table of contents
Kansas City, Missouri, United States
Pages: 139 - 146  
Year of Publication: 1999
ISBN:1-58113-146-1
Authors
Steve Lawrence  NEC Research Institute, 4 Independence Way, Princeton, NJ
Kurt Bollacker  NEC Research Institute, 4 Independence Way, Princeton, NJ
C. Lee Giles  NEC Research Institute, 4 Independence Way, Princeton, NJ
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGIR: ACM Special Interest Group on Information Retrieval
SIGMIS: ACM Special Interest Group on Management Information Systems
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 87,   Citation Count: 23
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/319950.319970
What is a DOI?

ABSTRACT

The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher homepages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J.M. Barrie and D.E. Presti. The World Wide Web as an instructional tool. Science, 274:371-372, 1996.
2
3
4
5
 
6
 
7
 
8
 
9
Robert D. Cameron. A universal citation database as a catalyst for reform in scholarly communication. First Monday, 2(4), 1997.
 
10
11
 
12
Eugene Garfield. Citation Indexing: Its Theory and Application in Science, Technology, and Humanities. Wiley, New York, 1979.
13
 
14
15
16
 
17
18
 
19
 
20
Steve Lawrence and C. Lee Giles. Searching the World Wide Web. Science, 280(5360):98-t00, 1998.
 
21
Steve Lawrence and C. Lee Giles. Accessibility of information on the web. Nature, 400(6740):107-109, 1999.
 
22
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. 1998.
 
23
 
24
E. Selberg and O. Etzioni. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 1995 World Wide Web Conference, 1995.
 
25
Kristie Seymore, Andrew McCallum, and Roni Rosen~ feld. Learning hidden Markov model structure for information extraction. In AAAI 99 Workshop on Machine Learning for Information Extraction, 1999.
 
26
N. Shivakumar and H. Garcia-Molina. SCAM: A copy detection mechanism for digital documents. In 2nd International Conference on the Theory and Practice of Digital Libraries, 1995.
27
28
 
29
Anastasios Tombros. Reflecting User Information Needs Through Query Biased Summaries. PhD thesis, Department of Computer Science, University of Glasgow, September 1997.
 
30
 
31
I.H. Witten, C.G. Nevill-Manning, and S.J.Cunningham. Building a digital library for computer science research: technical issues. In Proceedings Australasian Computer Science Conference, Melbourne, Australia, January 1996.
 
32
I.H. Witten, C.G. Nevill-Manning, and S.J. Cunningham. Digital libraries based on fulltext retrieval. In Proceedings of WebNet 96, San Francisco, October 1996.

CITED BY  23

Collaborative Colleagues:
Steve Lawrence: colleagues
Kurt Bollacker: colleagues
C. Lee Giles: colleagues