ACM Home Page
Please provide us with feedback. Feedback
Web classification using support vector machine
Full text PdfPdf (327 KB)
Source Workshop On Web Information And Data Management archive
Proceedings of the 4th international workshop on Web information and data management table of contents
McLean, Virginia, USA
SESSION: Web mining, tools, and performance evaluation table of contents
Pages: 96 - 99  
Year of Publication: 2002
ISBN:1-58113-593-9
Authors
Aixin Sun  Nanyang Technological University, Singapore
Ee-Peng Lim  Nanyang Technological University, Singapore
Wee-Keong Ng  Nanyang Technological University, Singapore
Sponsors
ACM: Association for Computing Machinery
SIGMIS: ACM Special Interest Group on Management Information Systems
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 180,   Citation Count: 24
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/584931.584952
What is a DOI?

ABSTRACT

In web classification, web pages from one or more web sites are assigned to pre-defined categories according to their content. Since web pages are more than just plain text documents, web classification methods have to consider using other context features of web pages, such as hyperlinks and HTML tags. In this paper, we propose the use of Support Vector Machine (SVM) classifiers to classify web pages using both their text and context feature sets. We have experimented our web classification method on the WebKB data set. Compared with earlier Foil-Pilfs method on the same data set, our method has been shown to perform very well. We have also shown that the use of context features especially hyperlinks can improve the classification performance significantly.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
4
 
5
 
6
 
7
L. Getoor, E. Segal, B. Taskar, and D. Koller. Probabilistic models of text and link structure for hypertext classification. In Proc. of the Int. Joint Conf. on Artificial intelligence Workshop on Text Learning: Beyond Supervision, Seattle, WA, Aug 2001.
8
 
9
T. Joachims. SVM light, An implementation of Support Vector Machines (SVMs) in C. http://svmlight.joachims.org/.
 
10
 
11
D. D. Lewis. Applying support vector machines to the TREC-2001 batch filtering and routing tasks. In Proc. of the TREC2001, Gaithersburg, Maryland, Nov 2001.
 
12
D. Mladenic. Turning Yahoo to automatic web-page classifier. In Proc. of the 13th European Conf. on Artificial Intelligence, pages 473--474, Brighton, UK, Aug 1998.
 
13
14
15
 
16
 
17
18
 
19

CITED BY  25

Collaborative Colleagues:
Aixin Sun: colleagues
Ee-Peng Lim: colleagues
Wee-Keong Ng: colleagues