| Combining semantic and syntactic document classifiers to improve first story detection |
| Full text |
Pdf
(149 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
New Orleans, Louisiana, United States
Pages: 424 - 425
Year of Publication: 2001
ISBN:1-58113-331-6
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 60, Citation Count: 13
|
|
|
ABSTRACT
In this paper we describe a type of data fusion involving the combination of evidence derived from multiple document representations. Our aim is to investigate if a composite representation can improve the online detection of novel events in a stream of broadcast news stories. This classification process otherwise known as first story detection FSD (or in the Topic Detection and Tracking pilot study as online new event detection [1]), is one of three main classification tasks defined by the TDT initiative. Our composite document representation consists of a semantic representation (based on the lexical chains derived from a text) and a syntactic representation (using proper nouns). Using the TDT1 evaluation methodology, we evaluate a number of document representation combinations using these document classifiers.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Ron Papka, James Allan, Topic Detection and Tracking: Event Clustering as a basis for first story detection, Kluwer Academic Publishers, 4:97-126, 2000.
|
| |
2
|
|
| |
3
|
Christiane Fellbaum, WordNet: An Electronic Lexical Database, MIT Press, 1998.
|
| |
4
|
Nicola Stokes, Paula Hatch, Joe Carthy, Topic Detection, a new application for lexical chaining?, In the Proceedings of BCS IRSG Colloquium 2000, pp. 94-103, 2000.
|
| |
5
|
W. B. Croft, Combining Approaches to information retrieval, Advances in Information Retrieval, 1:1-36 Kluwer Academic Publishers, 2000.
|
CITED BY 13
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Canhui Wang , Min Zhang , Liyun Ru , Shaoping Ma, Automatic online news topic ranking using media focus and user attention based on aging theory, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|