| HyperSum: hypergraph based semi-supervised sentence ranking for query-oriented summarization |
| Full text |
Pdf
(337 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceeding of the 18th ACM conference on Information and knowledge management
table of contents
Hong Kong, China
POSTER SESSION: Poster session 6: IR track
table of contents
Pages: 1855-1858
Year of Publication: 2009
ISBN:978-1-60558-512-3
|
|
Authors
|
|
Wei Wang
|
Key Laboratory of Computational Linguistics (Peking University), Ministry of Education, China, Beijing, China
|
|
Furu Wei
|
IBM China Research Laboratory, Beijing, China, Beijing, China
|
|
Wenjie Li
|
Deparment of Computing, the Hong Kong Polytechnic University, Hong Kong, Hong Kong
|
|
Sujian Li
|
Key Laboratory of Computational Linguistics (Peking University), Ministry of Education, China, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 29, Downloads (12 Months): 50, Citation Count: 0
|
|
|
ABSTRACT
Graph based sentence ranking algorithms such as PageRank and HITS have been successfully used in query-oriented summarization. With these algorithms, the documents to be summarized are often modeled as a text graph where nodes represent sentences and edges represent pairwise similarity relationships between two sentences. A deficiency of conventional graph modeling is its incapability of naturally and effectively representing complex group relationships shared among multiple objects. Simply squeezing complex relationships into pairwise ones will inevitably lead to loss of information which can be useful for ranking and learning. In this paper, we propose to take advantage of hypergraph, i.e. a generalization of graph, to remedy this defect. In a text hypergraph, nodes still represent sentences, yet hyperedges are allowed to connect more than two sentences. With a text hypergraph, we are thus able to integrate both group relationships formulated among multiple sentences and pairwise relationships formulated between two sentences in a unified framework. As essential work, it is first addressed in the paper that how a text hypergraph can be built for summarization by applying clustering techniques. Then, a hypergraph based semi-supervised sentence ranking algorithm is developed for query-oriented extractive summarization, where the influence of query is propagated to sentences through the structure of the constructed text hypergraph. When evaluated on DUC data sets, performance of the proposed approach is remarkable.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Jahna Otterbacher , Güneş Erkan , Dragomir R. Radev, Using random walks for question-focused sentence retrieval, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.915-922, October 06-08, 2005, Vancouver, British Columbia, Canada
[doi> 10.3115/1220575.1220690]
|
 |
3
|
|
| |
4
|
D. Zhou, O. Bousquet, T.N. Lal, J. Weston and B. Schölkopf. 2004. Learning with Local and Global Consistency. 16th NIPS, Cambridge, MA, 2004. MIT Press.
|
| |
5
|
D. Zhou, J. Huang and B. Schölkopf. 2005. Beyond Pairwise Classification and Clustering Using Hypergraphs. MPI Technical Report (143), Tübingen, Germany 2005
|
|