|
ABSTRACT
A large scale collection of web pages has been essential for research in information retrieval and related areas. This paper provides an overview of a large web collection used in the SPIRIT project for the design and testing of spatially-aware retrieval systems. Several statistics are derived and presented to show the characteristics of the collection.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Cacheda, F., Plachouras, V. & Ounis:, I. (2004). "Performance Analysis of Distributed Architectures to Index One Terabyte of Text". In: McDonald, S. & Tait, J. (eds.), Advances in Information Retrieval, Proceedings of the 26th European Conference on IR Research, Lecture Notes in Computer Science, Vol. 2997, Sunderland, UK. pp. 394--408. Springer.
|
 |
2
|
C. L. A. Clarke , G. V. Cormack , M. Laszlo , T. R. Lynam , E. L. Terra, The impact of corpus size on question answering performance, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564448]
|
| |
3
|
Craswell, N., Hawking, D., Wilkinson, R. & Wu, M. (2003). "Overview of the TREC 2003 Web Track". In: Voorheer, E. (ed.), NIST Special Publication 500--255:The Twelfth Text REtrieval Conference (TREC 2003), Gaithersburg, MD. pp. 78--92. NIST.
|
| |
4
|
|
 |
5
|
Christopher B. Jones , R. Purves , A. Ruas , M. Sanderson , M. Sester , M. van Kreveld , R. Weibel, Spatial information retrieval and geographical ontologies an overview of the SPIRIT project, Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, August 11-15, 2002, Tampere, Finland
[doi> 10.1145/564376.564457]
|
CITED BY 8
|
|
|
|
|
|
Jun Jie Foo , Justin Zobel , Ranjan Sinha , S. M. M. Tahaghoghi, Detection of near-duplicate images for web search, Proceedings of the 6th ACM international conference on Image and video retrieval, p.557-564, July 09-11, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ross S. Purves , Paul Clough , Christopher B. Jones , Avi Arampatzis , Benedicte Bucher , David Finch , Gaihua Fu , Hideo Joho , Awase Khirni Syed , Subodh Vaid , Bisheng Yang, The design and implementation of SPIRIT: a spatially aware search engine for information retrieval on the Internet, International Journal of Geographical Information Science, v.21 n.7, p.717-745, January 2007
|
|