| Supporting analysis of future-related information in news archives and the web |
| Full text |
Pdf
(1.06 MB)
|
Source
|
International Conference on Digital Libraries
archive
Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
table of contents
Austin, TX, USA
Pages 115-124
Year of Publication: 2009
ISBN:978-1-60558-322-8
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 24, Downloads (12 Months): 74, Citation Count: 0
|
|
|
ABSTRACT
A lot of future-related information is available in news articles or Web pages. This information can however differ to large extent and may fluctuate over time. It is therefore difficult for users to manually compare and aggregate it, and to re-construct the most probable course of future events. In this paper we approach a problem of automatically generating summaries of future events related to queries using data obtained from news archive collections or from the Web. We propose two methods, explicit and implicit future-related information detection. The former is based on analyzing the context of future temporal expressions in documents, while the latter relies on detecting periodical patterns in historical document collections. We present a graph-based visualization of future-related information and demonstrate its usefulness through several examples.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R. Baeza-Yates. Searching the Future. Proceedings of ACM SIGIR Workshop on Mathematical/Formal Methods in Information Retrieval (MF/IR 2005), 2005.
|
| |
2
|
T. Calinski and J. Harabasz. A Dendrite Method for Cluster Analysis. Communications in Statistics, vol. 3, no.1, pp.1--27, 1974.
|
 |
3
|
Munmun De Choudhury , Hari Sundaram , Ajita John , Dorée Duncan Seligmann, Can blog communication dynamics be correlated with stock market activity?, Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, June 19-21, 2008, Pittsburgh, PA, USA
[doi> 10.1145/1379092.1379106]
|
 |
4
|
Daniel Gruhl , R. Guha , Ravi Kumar , Jasmine Novak , Andrew Tomkins, The predictive power of online chatter, Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
[doi> 10.1145/1081870.1081883]
|
 |
5
|
|
| |
6
|
R. Kimura, S. Oyama, H. Toda and K. Tanaka. Creating Personal Histories from the Web using Namesake Disambiguation and Event Extraction. Proceedings of the 7th International Conference on Web Engineering, pp. 400--414, 2007.
|
 |
7
|
|
| |
8
|
G. Mishne and N. Glance. Predicting Movie Sales from Blogger Sentiment. Proceedings of the Spring Symposia on Computational Approaches to Analyzing Weblogs, 2006.
|
 |
9
|
|
| |
10
|
A. Pepe and J. Bollen. Between Conjecture and Memento: Shaping a Collective Emotional Perception of the Future. Proceedings of the AAAI 2008 Spring Symposium on Emotion, Personality and Social Behavior, 2008.
|
| |
11
|
B. Wuthrich, D. Permunetilleke, S. Leung, V. Cho, J. Zhang and W. Lam. Daily Prediction of Major Stock Indices from textual WWW Data. Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, pp.364--368, 1998.
|
|