| Automatic generation of overview timelines |
| Full text |
Pdf
(1.13 MB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Athens, Greece
Pages: 49 - 56
Year of Publication: 2000
ISBN:1-58113-226-3
|
|
Authors
|
|
Russell Swan
|
Center for Intelligent Information Retrieval, Department of Computer Science, University of Massachusetts, Amherst, Massachusetts
|
|
James Allan
|
Center for Intelligent Information Retrieval, Department of Computer Science, University of Massachusetts, Amherst, Massachusetts
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 20, Downloads (12 Months): 168, Citation Count: 32
|
|
|
ABSTRACT
We present a statistical model of feature occurrence over time, and develop tests based on classical hypothesis testing for significance of term appearance on a given date. Using additional classical hypothesis testing we are able to combine these terms to generate “topics” as defined by the Topic Detection and Tracking study. The groupings of terms obtained can be used to automatically generate an interactive timeline displaying the major events and topics covered by the corpus. To test the validity of our technique we extracted a large number of these topics from a test corpus and had human evaluators judge how well the selected features captured the gist of the topics, and how they overlapped with a set of known topics from the corpus. The resulting topics were highly rated by evaluators who compared them to known topics.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
R. B. Allen. Timelines as information system interfaces. In Proceedings International Symposium on Digital Libraries, pages 175-180, Tsukuba, Japan, 1995.
|
| |
3
|
Daniel M. Bikel , Scott Miller , Richard Schwartz , Ralph Weischedel, Nymble: a high-performance learning name-finder, Proceedings of the fifth conference on Applied natural language processing, p.194-201, March 31-April 03, 1997, Washington, DC
[doi> 10.3115/974557.974586]
|
| |
4
|
|
| |
5
|
Ido Dagan and Ronen Feldman. Keyword-based browsing and analysis of large document sets. In Proceedings of the Symposium on Document Analysis and Information Retrieval (SDAIR-96), Las Vegas, Nevada, 1996.
|
| |
6
|
DARPA, editor. Proceedings of the DARPA Broadcast news Workshop, Herndon, Virginia, February 1999.
|
| |
7
|
B. S. Everitt. The Analysis of Contingency Tables. Chapman and Hall, London, 1977.
|
| |
8
|
Ronen Feldman and Ido Dagan. Knowledge discovery in textual databases (kdt). In Proceedings of the First International Conference on Knowledge Discovery (KDD-95). ACM, August 1995.
|
 |
9
|
Scott Fertig , Eric Freeman , David Gelernter, Lifestreams: an alternative to the desktop metaphor, Conference companion on Human factors in computing systems: common ground, p.410-411, April 13-18, 1996, Vancouver, British Columbia, Canada
[doi> 10.1145/257089.257404]
|
 |
10
|
Dan Ingalls , Ted Kaehler , John Maloney , Scott Wallace , Alan Kay, Back to the future: the story of Squeak, a practical Smalltalk written in itself, Proceedings of the 12th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, p.318-326, October 05-09, 1997, Atlanta, Georgia, United States
|
 |
11
|
Vijay Kumar , Richard Furuta , Robert B. Allen, Metadata visualization for digital libraries: interactive timeline editing and review, Proceedings of the third ACM conference on Digital libraries, p.126-133, June 23-26, 1998, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/276675.276689]
|
 |
12
|
Catherine Plaisant , Brett Milash , Anne Rose , Seth Widoff , Ben Shneiderman, LifeLines: visualizing personal histories, Proceedings of the SIGCHI conference on Human factors in computing systems: common ground, p.221-ff., April 13-18, 1996, Vancouver, British Columbia, Canada
[doi> 10.1145/238386.238493]
|
 |
13
|
|
 |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
James A. Wise , James J. Thomas , Kelly Pennock , David Lantrip , Marc Pottier , Anne Schur , Vern Crow, Visualizing the non-visual: spatial analysis and interaction with information for text documents, Readings in information visualization: using vision to think, Morgan Kaufmann Publishers Inc., San Francisco, CA, 1999
|
 |
18
|
Xia Lin , Dagobert Soergel , Gary Marchionini, A self-organizing semantic map for information retrieval, Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, p.262-269, October 13-16, 1991, Chicago, Illinois, United States
[doi> 10.1145/122860.122887]
|
| |
19
|
|
 |
20
|
|
CITED BY 32
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
David Frey , Rahul Gupta , Vikas Khandelwal , Victor Lavrenko , Anton Leuski , James Allan, Monitoring the news: a TDT demonstration system, Proceedings of the first international conference on Human language technology research, p.1-5, March 18-21, 2001, San Diego
|
|
|
|
|
|
R. Guha , Ravi Kumar , D. Sivakumar , Ravi Sundaram, Unweaving a web of documents, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xuanhui Wang , ChengXiang Zhai , Xiao Hu , Richard Sproat, Mining correlated bursty topic patterns from coordinated text streams, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
Gabriel Pui Cheong Fung , Jeffrey Xu Yu , Huan Liu , Philip S. Yu, Time-dependent event hierarchy construction, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
Fabian Mörchen , Mathäus Dejori , Dmitriy Fradkin , Julien Etienne , Bernd Wachmann , Markus Bundschus, Anticipating annotations and emerging trends in biomedical literature, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
Canhui Wang , Min Zhang , Liyun Ru , Shaoping Ma, Automatic online news topic ranking using media focus and user attention based on aging theory, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|