ACM Home Page
Please provide us with feedback. Feedback
Design of a crawler with bounded bandwidth
Full text PdfPdf (52 KB)
Source International World Wide Web Conference archive
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters table of contents
New York, NY, USA
POSTER SESSION: Posters table of contents
Pages: 292 - 293  
Year of Publication: 2004
ISBN:1-58113-912-8
Authors
Michelangelo Diligenti  Università di Siena Via Roma, Siena, Italy
Marco Maggini  Università di Siena Via Roma, Siena, Italy
Filippo Maria Pucci  Università di Siena Via Roma, Siena, Italy
Franco Scarselli  Università di Siena Via Roma, Siena, Italy
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 31,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1013367.1013441
What is a DOI?

ABSTRACT

This paper presents an algorithm to bound the bandwidth of a Web crawler. The crawler collects statistics on the transfer rate of each server to predict the expected bandwidth use for future downloads. The prediction allows us to activate the optimal number of fetcher threads in order to exploit the assigned bandwidth. The experimental results show the effectiveness of the proposed technique.



Collaborative Colleagues:
Michelangelo Diligenti: colleagues
Marco Maggini: colleagues
Filippo Maria Pucci: colleagues
Franco Scarselli: colleagues