|
ABSTRACT
This paper proposes a method for event tracking on broadcast news stories based on distinction between a topic and an event. A topic and an event are identified using a simple criterion called domain dependency of words: how greatly a word features a given set of data. The method was tested on the TDT corpus which has been developed by the TDT Pilot Study and the result can be regarded as promising the usefulness of the method.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. Allan. J. Carbonell, G. Doddington. J. Yamron, and Y. Yang. Topic Detection and Tracking Pilot Study: Final Report. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1998.
|
| |
2
|
J. Allan et al. The Topic Detection and Tracking (TDT) Pilot Study Evaluation Plan, Version 2.8. pages 1-8, 1997.
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
J. Carbonell, Y. Yang, J. Lafferty, R. D. Brown, T. Pierce, and X. Liu. CMU Report on TDT-2: Segmentation, Detection and Tracking. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1999.
|
 |
7
|
Douglass R. Cutting , David R. Karger , Jan O. Pedersen , John W. Tukey, Scatter/Gather: a cluster-based approach to browsing large document collections, Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, p.318-329, June 21-24, 1992, Copenhagen, Denmark
[doi> 10.1145/133160.133214]
|
| |
8
|
|
 |
9
|
Julian Kupiec , Jan Pedersen , Francine Chen, A trainable document summarizer, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, p.68-73, July 09-13, 1995, Seattle, Washington, United States
[doi> 10.1145/215206.215333]
|
 |
10
|
W. Lam , S. Mukhopadhyay , J. Mostafa , M. Palakal, Detection of shifts in user interests for personalized information filtering, Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, p.317-325, August 18-22, 1996, Zurich, Switzerland
[doi> 10.1145/243199.243279]
|
| |
11
|
C. Leacock and M. Chodorow. Combining Local Context and WordNet Similarity for Word Sense Identification, WordNet. An Electronic Lezical Database. MIT Press, 1988.
|
 |
12
|
|
 |
13
|
David D. Lewis , Robert E. Schapire , James P. Callan , Ron Papka, Training algorithms for linear text classifiers, Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval, p.298-306, August 18-22, 1996, Zurich, Switzerland
[doi> 10.1145/243199.243277]
|
| |
14
|
S. A. Lowe. The BetaBinomial Mixture Model and its Application to TDT Tracking and Detection. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1999.
|
| |
15
|
H. P. Luhn. A Statistical Approach to Mechanized Encoding and Searching of Literary Information. IBM journal, 1(4):307-319, 1957.
|
| |
16
|
II. P. Luhn. The Automatic Creation of Literature Abstracts. IBM journal, 2(1):159-165, 1958.
|
| |
17
|
|
| |
18
|
G. A. Miller. Nouns in WordNet, An Electronic Lexical Database. MIT Press, 1998.
|
| |
19
|
R. Papka, J. Allan, and V. Lavrenko. UMASS Approaches to Detection and Tracking at TDT2. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1999.
|
| |
20
|
P. Resnik. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In Proceedings of 15th International Joint Conference on Artificial Intelligence, pages 448-453, 1995.
|
| |
21
|
|
| |
22
|
|
| |
23
|
S. Sekine and R. Grishman. A Corpus-based Probabilistic Grammar with Only Two Non-Terminals. In Proceedings of the Fourth International Workshop on Parsing Technology, pages 216-223, 1995.
|
| |
24
|
T. Strzalkowski, G. C. Stein. and G. B. Wise. GE.Tracker: A Robust, Lightweight Topic Tracking System. In Proceedings of the DARPA Broadcast News Transcmption and Understanding Workshop, 1999.
|
| |
25
|
Y. Watanabe, Y. Okaxta, K. Kaneji, and Y. Sakamoto. Multiple Media Database System for TV Newscasts and Newspapers(in 3apanese). In Technical Report of IEIGE. NLG*97-69, PRMU97- 257, pages 47-54. 1998.
|
| |
26
|
J. P. Yamron, I. Carp, L. Gillick. S. Lowe, and P. van Mulbregt. Topic Tracking in a News Stream. In Proceedmgs of the DARPA Broadcast News Transcmplion and Understandtng Workshop, 1999.
|
| |
27
|
|
| |
28
|
Y. Yang J. G. Carbonell, J. Allan, and J. Ymron. Topic Detection and Tracking: Detection-Task. In Proceedings of the Workshop of Topic Detection and Tracking, 1997.
|
 |
29
|
|
|