ACM Home Page
Please provide us with feedback. Feedback
Urban web crawling
Full text PdfPdf (651 KB)
Source
ACM International Conference Proceeding Series; Vol. 300 archive
Proceedings of the first international workshop on Location and the web table of contents
Beijing, China
Pages 25-32  
Year of Publication: 2008
ISBN:978-1-60558-160-6
Authors
Dirk Ahlers  OFFIS Institute for Information Technology, Oldenburg, Germany
Susanne Boll  University of Oldenburg, Germany
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 126,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1367798.1367803
What is a DOI?

ABSTRACT

Local search is increasingly becoming a major focus point of research interest. It is a widely-recognized speciality search with a large application area. Its data is usually aggregated from a variety of sources. One as yet largely untapped source of location data is the WWW. Today, the Web does not explicitly reveal its location-relation; rather this information is hidden somewhere within pages' contents. To exploit such location information, we need to find, extract and geo-spatially index relevant Web pages. For an effective retrieval of such content, this paper examines the application of focused Web crawling to the geospatial domain. We describe our approach for a geo-aware focused crawling of urban areas and other regions with a high building density. We present our experimental results that give us insight into spatial Web information such as location density and link distance between topical pages. Our crawls and evaluations back our hypothesis that geospatially focused crawling is suitable for the urban geospatial topic.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
D. Ahlers and S. Boll. Geospatially Focused Web Crawling. Datenbank-Spektrum, Special Issue Focused Search, 7(23):3--12, 2007.
 
2
D. Ahlers and S. Boll. Location-based Web search. In A. Scharl and K. Tochterman, editors, The Geospatial Web. Springer, London, 2007.
 
3
D. Ahlers and S. Boll. A Web more Geospatial: Insights into the Location Inside. In Web Science Workshop (WSW2008) at WWW2008, Beijing, China, 2008.
4
 
5
 
6
 
7
W. Cai, S. Wang, and Q. Jiang. Address extraction: Extraction of location-based information from the web. In Y. Zhang, K. Tanaka, J. X. Yu, S. Wang, and M. Li, editors, APWeb 2005. Springer, 2005.
8
 
9
 
10
 
11
 
12
13
 
14
 
15
M. Jakob, M. Gromann, D. Nicklas, and B. Mitschang. DCbot: Finding Spatial Information on the Web. In L. Zhou, B. C. Ooi, and X. Meng, editors, DASFAA 2005, pages 779--790. Springer, 2005.
16
17
 
18
 
19
A. Markowetz, Y.-Y. Chen, T. Suel, X. Long, and B. Seeger. Design and Implementation of a Geographic Search Engine. In WebDB 2005, pages 19--24, 2005.
20
 
21
 
22
23
 
24
M. Sanderson and J. Kohler. Analyzing Geographic Queries. In Proc. of the ACM SIGIR Workshop on Geographic Information Retrieval, Sheffield, UK, 2004.
25
 
26
T. T. Tang, D. Hawking, N. Craswell, and R. S. Sankaranarayana. Focused Crawling in Depression Portal Search: A Feasibility Study. In ADCS 2004, pages 2--9, Melbourne, Australia, 2004.


Collaborative Colleagues:
Dirk Ahlers: colleagues
Susanne Boll: colleagues