| Novelty and redundancy detection in adaptive filtering |
| Full text |
Pdf
(578 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Tampere, Finland
SESSION: Filtering
table of contents
Pages: 81 - 88
Year of Publication: 2002
ISBN:1-58113-561-0
|
|
Authors
|
|
Yi Zhang
|
Carnegie Mellon University, Pittsburgh, PA
|
|
Jamie Callan
|
Carnegie Mellon University, Pittsburgh, PA
|
|
Thomas Minka
|
Carnegie Mellon University, Pittsburgh, PA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 20, Downloads (12 Months): 173, Citation Count: 55
|
|
|
ABSTRACT
This paper addresses the problem of extending an adaptive information filtering system to make decisions about the novelty and redundancy of relevant documents. It argues that relevance and redundance should each be modelled explicitly and separately. A set of five redundancy measures are proposed and evaluated in experiments with and without redundancy thresholds. The experimental results demonstrate that the cosine similarity metric and a redundancy measure based on a mixture of language models are both effective for identifying redundant documents.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Y. Yang. Topic detection and tracking pilot study. In Topic Detection and Tracking Workshop Report 2001.
|
 |
2
|
James Allan , Victor Lavrenko , Hubert Jin, First story detection in TDT is hard, Proceedings of the ninth international conference on Information and knowledge management, p.374-381, November 06-11, 2000, McLean, Virginia, United States
[doi> 10.1145/354756.354843]
|
 |
3
|
|
| |
4
|
J. Carbonell, Y. Yang, R. Brown, C. Jin, and J. Zhang. CMU TDT report 13-- 14 Nov 2001. In Topic Detection and Tracking Workshop Report 2001.
|
| |
5
|
M. Franz, A. Ittycheriah, J. S. McCarley, andT. Ward. First story detection: Combining similarity and novelty based approaches. In Topic Detection and Tracking Workshop Report 2001.
|
| |
6
|
W. P. Jones and G. W. Furnas. Pictures of relevance. Journal of the American Society for Information Science 1987.
|
| |
7
|
W. Kraaij, R. Pohlmann, and D. Hiemstra. Twenty- one at TREC- 8: using language technology for information retrieval. In Proceedings of the Eighth Text REtrieval Conference (TREC- 8) , 1999.
|
| |
8
|
|
| |
9
|
|
 |
10
|
David R. H. Miller , Tim Leek , Richard M. Schwartz, A hidden Markov model information retrieval system, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.214-221, August 15-19, 1999, Berkeley, California, United States
[doi> 10.1145/312624.312680]
|
| |
11
|
S. Robertson. Threshold setting in adaptive filtering. Journal of Documentation 2000.
|
| |
12
|
S. Robertson and D. Hull. The TREC- 9 Filtering track report. In The Ninth Text REtrieval Conference (TREC- 9) , 2001.
|
| |
13
|
M. Spitters and W. Kraaij. TNO at TDT2001: Language model-based topic detection. In Topic Detection and Tracking Workshop Report. 2001.
|
 |
14
|
|
| |
15
|
J. Yamron, S. Knecht, and P. van Mulbregt. Dragon's tracking and detection systems for the TDT2000 evaluation. In Proceedings of the Broadcast News Transcription and Understanding Workshop 1998.
|
 |
16
|
|
 |
17
|
|
 |
18
|
|
CITED BY 55
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Matthias Bender , Sebastian Michel , Peter Triantafillou , Gerhard Weikum , Christian Zimmer, Improving collection selection with overlap awareness in P2P search engines, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, August 15-19, 2005, Salvador, Brazil
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yiming Yang , Abhimanyu Lad , Ni Lao , Abhay Harpale , Bryan Kisiel , Monica Rogati, Utility-based information distillation over temporally sequenced documents, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, July 23-27, 2007, Amsterdam, The Netherlands
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
K. Selçuk Candan , Mehmet E. Dönderler , Terri Hedgpeth , Jong Wook Kim , Qing Li , Maria Luisa Sapino, SEA: Segment-enrich-annotate paradigm for adapting dialog-based content for improved accessibility, ACM Transactions on Information Systems (TOIS), v.27 n.3, p.1-45, May 2009
|
|
|
|
|
|
|
|
|
|
|
|
Biplav Srivastava , Tuan A. Nguyen , Alfonso Gerevini , Subbarao Kambhampati , Minh Binh Do , Ivan Serina, Domain independent approaches for finding diverse plans, Proceedings of the 20th international joint conference on Artifical intelligence, p.2016-2022, January 06-12, 2007, Hyderabad, India
|
|