| Web page classification without the web page |
| Full text |
Pdf
(58 KB)
|
| Source
|
International World Wide Web Conference
archive
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
table of contents
New York, NY, USA
POSTER SESSION: Posters
table of contents
Pages: 262 - 263
Year of Publication: 2004
ISBN:1-58113-912-8
|
|
Author
|
|
Min-Yen Kan
|
National University of Singapore, Singapore
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 15, Downloads (12 Months): 108, Citation Count: 11
|
|
|
ABSTRACT
Uniform resource locators (URLs), which mark the address of a resource on the World Wide Web, are often human-readable and can hint at the category of the resource. This paper explores the use of URLs for webpage categorization via a two-phase pipeline of word segmentation/expansion and classification. We quantify its performance against document-based methods, which require the retrieval of the source document.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
K. T. Lua and G. W. Gan. An application of information theory in chinese word segmentation. Computer Processing of Chinese and Oriental Languages, 8(1):115--124, 1994.
|
| |
3
|
|
 |
4
|
|
CITED BY 11
|
|
|
|
|
Chuang Wang , Xing Xie , Lee Wang , Yansheng Lu , Wei-Ying Ma, Detecting geographic locations from web resources, Proceedings of the 2005 workshop on Geographic information retrieval, November 04-04, 2005, Bremen, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
B. Barla Cambazoglu , Evren Karaca , Tayfun Kucukyilmaz , Ata Turk , Cevdet Aykanat, Architecture of a grid-enabled Web search engine, Information Processing and Management: an International Journal, v.43 n.3, p.609-623, May, 2007
|
|
|
|
|
|
|
|
|
Kerstin Bischoff , Claudiu S. Firan , Wolfgang Nejdl , Raluca Paiu, Can all tags be used for search?, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|