ACM Home Page
Please provide us with feedback. Feedback
A clustering method for web data with multi-type interrelated components
Full text PdfPdf (73 KB)
Source
International World Wide Web Conference archive
Proceedings of the 16th international conference on World Wide Web table of contents
Banff, Alberta, Canada
POSTER SESSION: Search table of contents
Pages: 1121 - 1122  
Year of Publication: 2007
ISBN:978-1-59593-654-7
Authors
Levent Bolelli  Pennsylvania State University
Seyda Ertekin  Pennsylvania State University
Ding Zhou  Pennsylvania State University
C. Lee Giles  Pennsylvania State University
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 40,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1242572.1242725
What is a DOI?

ABSTRACT

Traditional clustering algorithms work on "flat" data, making the assumption that the data instances can only be represented by a set of homogeneous and uniform features. Many real world data, however, is heterogeneous in nature, comprising of multiple types of interrelated components. We present a clustering algorithm, K-SVMeans, that integrates the well known K-Means clustering with the highly popular Support Vector Machines(SVM) in order to utilize the richness of data. Our experimental results on authorship analysis of scientific publications show that K-SVMeans achieves better clustering performance than homogeneous data clustering.



Collaborative Colleagues:
Levent Bolelli: colleagues
Seyda Ertekin: colleagues
Ding Zhou: colleagues
C. Lee Giles: colleagues