ACM Home Page
Please provide us with feedback. Feedback
Representative entry selection for profiling blogs
Full text PdfPdf (117 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
POSTER SESSION: Poster session 1/knowledge management table of contents
Pages 1387-1388  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Jinfeng Zhuang  Nanyang Technological University, Singapore, Singapore
Steven C.H. Hoi  Nanyang Technological University, Singapore, Singapore
Aixin Sun  Nanyang Technological University, Singapore, Singapore
Rong Jin  Michigan State University, East Lansing, MI, USA
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 79,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458293
What is a DOI?

ABSTRACT

Many applications on blog search and mining often meet the challenge of handling huge volume of blog data, in which one single blog could contain hundreds or even thousands of entries. We investigate novel techniques for profiling blogs by selecting a subset of representative entries for each blog. We propose two principles for guiding the entry selection task: representativeness and diversity. Further, we formulate the entry selection task into a combinatorial optimization problem and propose a greedy yet effective algorithm for finding a good approximate solution by exploiting the theory of submodular functions. We suggest blog classification for judging the performance of the proposed entry selection techniques and evaluate their performance on a real blog dataset, in which encouraging results were obtained.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
G. Nemhauser, L. Wolsey, and M. Fisher. An analysis of the approximations for maximizing submodular set functions. Mathematical Programming, pages 265--294, 1978.
3

Collaborative Colleagues:
Jinfeng Zhuang: colleagues
Steven C.H. Hoi: colleagues
Aixin Sun: colleagues
Rong Jin: colleagues