ACM Home Page
Please provide us with feedback. Feedback
Harvesting needed to maintain scientific literature online
Full text PdfPdf (148 KB)
Source
International Conference on Digital Libraries archive
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries table of contents
Pittsburgh PA, PA, USA
POSTER SESSION: Posters table of contents
Pages 460-460  
Year of Publication: 2008
ISBN:978-1-59593-998-2
Authors
Nikolay Nikolov  European Bioinformatics Institute, Cambridge, United Kingdom
Peter Stoehr  European Bioinformatics Institute, Cambridge, United Kingdom
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 39,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1378889.1379005
What is a DOI?

ABSTRACT

Millions of scientific articles are accessible freely on the web. While some of them are stored in institutional repositories many are made available on personal pages which are exposed to the net's transience. We found that nearly 11% of URLs of PDF documents containing references to life science publications were not accessible within 5 months after being harvested using a search engine's (SE) API. For most of them (8.4%) no SE cache backup could be found. Although we have yet to estimate the exact rate at which the scientific literature disappears and the duration of its disappearance the results so far are a clear indicator that web harvesting is needed to preserve the online scientific literature.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
4
Google blocks academic http://tinyurl.com/2j7alg last retrieved on February 3, 2008


Collaborative Colleagues:
Nikolay Nikolov: colleagues
Peter Stoehr: colleagues