ACM Home Page
Please provide us with feedback. Feedback
Large scale multi-label classification via metalabeler
Full text PdfPdf (569 KB)
Source
International World Wide Web Conference archive
Proceedings of the 18th international conference on World wide web table of contents
Madrid, Spain
SESSION: Data mining/session: learning table of contents
Pages 211-220  
Year of Publication: 2009
ISBN:978-1-60558-487-4
Authors
Lei Tang  Arizona State University, Tempe, AZ, USA
Suju Rajan  Yahoo! Inc., Sunnyvale, CA, USA
Vijay K. Narayanan  Yahoo! Inc., Sunnyvale, CA, USA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 57,   Downloads (12 Months): 222,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1526709.1526738
What is a DOI?

ABSTRACT

The explosion of online content has made the management of such content non-trivial. Web-related tasks such as web page categorization, news filtering, query categorization, tag recommendation, etc. often involve the construction of multi-label categorization systems on a large scale. Existing multi-label classification methods either do not scale or have unsatisfactory performance. In this work, we propose MetaLabeler to automatically determine the relevant set of labels for each instance without intensive human involvement or expensive cross-validation. Extensive experiments conducted on benchmark data show that the MetaLabeler tends to outperform existing methods. Moreover, MetaLabeler scales to millions of multi-labeled instances and can be deployed easily. This enables us to apply the MetaLabeler to a large scale query categorization problem in Yahoo!, yielding a significant improvement in performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
 
5
 
6
 
7
R.-E. Fan and C.-J. Lin. A study on threshold selection for multi-label classication. 2007.
8
 
9
10
 
11
 
12
I. Katakis, G. Tsoumakas, and I. Vlahavas. Multilabel text classification for automated tag suggestion. In Proceedings of the ECML/PKDD 2008 Discovery Challenge, 2008.
13
 
14
 
15
16
17
 
18
19
 
20
 
21
 
22
23
24
25
 
26
G. Tsoumakas and K. Ioannis. Multi label classification: An overview. International Journal of Data Warehousing and Mining, 3:1--13, 2007.
 
27
 
28
N. Ueda and K. Saito. Parametric mixture models for multi-labeled text. In NIPS, pages 721--728, 2002.
29
30
31
32
 
33
34


Collaborative Colleagues:
Lei Tang: colleagues
Suju Rajan: colleagues
Vijay K. Narayanan: colleagues