ACM Home Page
Please provide us with feedback. Feedback
Exploiting the deep web with DynaBot: matching, probing, and ranking
Full text PdfPdf (107 KB)
Source International World Wide Web Conference archive
Special interest tracks and posters of the 14th international conference on World Wide Web table of contents
Chiba, Japan
POSTER SESSION: Posters table of contents
Pages: 1174 - 1175  
Year of Publication: 2005
ISBN:1-59593-051-5
Authors
Daniel Rocco  University of West Georgia, Carrollton, GA
James Caverlee  Georgia Inst. of Technology, Atlanta, GA
Ling Liu  Georgia Inst. of Technology, Atlanta, GA
Terence Critchlow  Lawrence Livermore Nat'l Lab, Livermore, CA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 48,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1062745.1062925
What is a DOI?

ABSTRACT

We present the design of Dynabot, a guided Deep Web discovery system. Dynabot's modular architecture supports focused crawling of the Deep Web with an emphasis on matching, probing, and ranking discovered sources using two key components: service class descriptions and source-biased analysis. We describe the overall architecture of Dynabot and discuss how these components support effective exploitation of the massive Deep Web data available.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
P. Lyman and H. R. Varian. How much information. www.sims.berkeley.edu/how-much-info-2003, 2003.
 
3
D. Rocco, J. Caverlee, L. Liu, and T. Critchlow. Focused Crawling of the Deep Web using Service Class Descriptions, U. of West Georgia, 2005.


Collaborative Colleagues:
Daniel Rocco: colleagues
James Caverlee: colleagues
Ling Liu: colleagues
Terence Critchlow: colleagues