| Hierarchical rule generalisation for speaker identification in fiction books |
| Full text |
Pdf
(687 KB)
|
| Source
|
SAICSIT; Vol. 204
archive
Proceedings of the 2006 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries
table of contents
Somerset West, South Africa
Pages: 31 - 40
Year of Publication: 2006
ISBN:1-59593-567-3
|
|
Authors
|
|
| Publisher |
South African Institute for Computer Scientists and Information Technologists
, Republic of South Africa
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 12, Citation Count: 2
|
|
|
ABSTRACT
This paper presents a hierarchical pattern matching and generalisation technique which is applied to the problem of locating the correct speaker of quoted speech found in fiction books. Patterns from a training set are generalised to create a small number of rules, which can be used to identify items of interest within the text. The pattern matching technique is applied to finding the Speech-Verb, Actor and Speaker of quotes found in fiction books. The technique performs well over the training data, resulting in rule-sets many times smaller than the training set, but providing very high accuracy. While the rule-set generalised from one book is less effective when applied to different books than an approach based on hand coded heuristics, performance is comparable when testing on data closely related to the training set.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Ciravegna, F. 2001. Adaptive information extraction from text by rule induction and generalisation. In 17th International Joint Conference on Artificial Intelligence (IJCAI). Seattle.
|
 |
4
|
|
| |
5
|
|
| |
6
|
Downey, D., Soderland, O. E. S., and Weld, D. 2004. Learning text patterns for web information extraction and assessment. In AAAI 2004 workshop on Adaptive Text Extraction and Mining. San Jose, CA.
|
| |
7
|
|
| |
8
|
|
| |
9
|
Hobbs, J. R. 1978. Resolving pronoun references. Lingua 44, 311--338.
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
MUC-7. 1998. MUC-7 test scores introduction. In Proceedings of 7th Message Understanding Conference. Fairfax, Virginia.
|
 |
14
|
|
| |
15
|
|
| |
16
|
Tapanainen, P. 1999. Parsing in two frameworks: finite-state and functional dependency grammar. Ph.D. thesis, University of Helsinki.
|
| |
17
|
|
| |
18
|
|
| |
19
|
Zhang, J., Black, A., and Sproat, R. 2003. Identifying speakers in children's stories for speech synthesis. In Proceedings of EUROSPEECH 2003. Geneva.
|
CITED BY 2
|
|
|
|
|
Kevin Glass , Shaun Bangay , Bruce Alcock, Mechanisms for multimodality: taking fiction to another dimension, Proceedings of the 5th international conference on Computer graphics, virtual reality, visualisation and interaction in Africa, October 29-31, 2007, Grahamstown, South Africa
|
|