|
ABSTRACT
Documents with timestamps, such as email and news, can be placed along a timeline. The timeline for a set of documents returned in response to a query gives an indication of how documents relevant to that query are distributed in time. Examining the timeline of a query result set allows us to characterize both how temporally dependent the topic is, as well as how relevant the results are likely to be. We outline characteristic patterns in query result set timelines, and show experimentally that we can automatically classify documents into these classes. We also show that properties of the query result set timeline can help predict the mean average precision of a query. These results show that meta-features associated with a query can be combined with text retrieval techniques to improve our understanding and treatment of text search on documents with timestamps.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Allan, J., Callan, J., Collins-Thompson, K., Croft, B., Feng, F., Fisher, D., Lafferty, J., Larkey, L., Truong, T. N., Ogilvie, P., Si, L., Strohman, T., Turtle, H., and Zhai, C. 2003. The lemur toolkit for language modeling and information retrieval. http://www-2.cs.cmu.edu/~lemur/.
|
 |
2
|
|
 |
3
|
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
He, B. and Ounis, I. 2004. Inferring query performance using pre-retrieval predictors. In Proceedings of the 11th Symposium on String Processing and Information Retrieval (SPIRE 2004) (Padova, Italy). Lecture Notes in Computer Science, Springer-Verlag, New York.
|
 |
8
|
|
 |
9
|
|
 |
10
|
|
 |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
Swan, R. and Jensen, D. 2000. TimeMines: Constructing timelines with statistical models of word usage. In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2000). ACM, New York, 73--80.
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
CITED BY 5
|
|
Paolo Boldi , Francesco Bonchi , Carlos Castillo , Debora Donato , Aristides Gionis , Sebastiano Vigna, The query-flow graph: model and applications, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Fernando Mourão , Leonardo Rocha , Renata Araújo , Thierson Couto , Marcos Gonçalves , Wagner Meira, Jr., Understanding temporal aspects in document classification, Proceedings of the international conference on Web search and web data mining, February 11-12, 2008, Palo Alto, California, USA
|
|
|
|
|
|
Leonardo Rocha , Fernando Mourão , Adriano Pereira , Marcos André Gonçalves , Wagner Meira, Jr., Exploiting temporal contexts in text classification, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|