ACM Home Page
Please provide us with feedback. Feedback
Utility-based information distillation over temporally sequenced documents
Full text PdfPdf (259 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Amsterdam, The Netherlands
SESSION: Routing and filtering table of contents
Pages: 31 - 38  
Year of Publication: 2007
ISBN:978-1-59593-597-7
Authors
Yiming Yang  Carnegie Mellon University
Abhimanyu Lad  Carnegie Mellon University
Ni Lao  Carnegie Mellon University
Abhay Harpale  Carnegie Mellon University
Bryan Kisiel  Carnegie Mellon University
Monica Rogati  Carnegie Mellon University
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 85,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1277741.1277750
What is a DOI?

ABSTRACT

This paper examines a new approach to information distillation over temporally ordered documents, and proposes a novel evaluation scheme for such a framework. It combines the strengths of and extends beyond conventional adaptive filtering, novelty detection and non-redundant passage ranking with respect to long-lasting information needs ("tasks" with multiple queries). Our approach supports fine-grained user feedback via highlighting of arbitrary spans of text, and leverages such information for utility optimization in adaptive settings. For our experiments, we defined hypothetical tasks based on news events in the TDT4 corpus, with multiple queries per task. Answer keys (nuggets) were generated for each query and a semi-automatic procedure was used for acquiring rules that allow automatically matching nuggets against system responses. We also propose an extension of the NDCG metric for assessing the utility of ranked passages as a combination of relevance and novelty. Our results show encouraging utility enhancements using the new approach, compared to the baseline systems without incremental learning or the novelty detection components.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
C. Buckley, G. Salton, and J. Allan. Automatic Retrieval with Locality Information using SMART. NIST special publication, (500207):59--72, 1993.
4
5
 
6
E. Efthimiadis. Query Expansion. Annual Review of Information Science and Technology (ARIST), 31:p 121--87, 1996.
 
7
 
8
R. Florian, H. Hassan, A. Ittycheriah, H. Jing, N. Kambhatla, X. Luo, N. Nicolov, and S. Roukos. A Statistical Model for Multilingual Entity Detection and Tracking. NAACL/HLT, 2004.
9
 
10
 
11
 
12
 
13
 
14
E. Riloff. Automatically Constructing a Dictionary for Information Extraction Tasks. Proceedings of the Eleventh National Conference on Artificial Intelligence, pages 811--816, 1993.
 
15
S. Robertson and S. Walker. Microsoft Cambridge at TREC-9: Filtering track. The Ninth Text REtrieval Conference (TREC-9), pages 361--368.
16
 
17
T. Strohman, D. Metzler, H. Turtle, and W. Croft. Indri: A Language Model-based Search Engine for Complex Queries. Proceedings of the International Conference on Intelligence Analysis, 2004.
 
18
The Linguistic Data Consortium. http://www.ldc.upenn.edu/.
 
19
E. Voorhees. Overview of the TREC 2003 Question Answering Track. Proceedings of the Twelfth Text REtrieval Conference (TREC 2003), 2003.
20
21
22
23
24
25


Collaborative Colleagues:
Yiming Yang: colleagues
Abhimanyu Lad: colleagues
Ni Lao: colleagues
Abhay Harpale: colleagues
Bryan Kisiel: colleagues
Monica Rogati: colleagues