ACM Home Page
Please provide us with feedback. Feedback
Hierarchical rule generalisation for speaker identification in fiction books
Full text PdfPdf (687 KB)
Source SAICSIT; Vol. 204 archive
Proceedings of the 2006 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries table of contents
Somerset West, South Africa
Pages: 31 - 40  
Year of Publication: 2006
ISBN:1-59593-567-3
Authors
Kevin Glass  Rhodes University, Grahamstown, South Africa
Shaun Bangay  Rhodes University, Grahamstown, South Africa
Publisher
South African Institute for Computer Scientists and Information Technologists  , Republic of South Africa
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 12,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1216262.1216266
What is a DOI?

ABSTRACT

This paper presents a hierarchical pattern matching and generalisation technique which is applied to the problem of locating the correct speaker of quoted speech found in fiction books. Patterns from a training set are generalised to create a small number of rules, which can be used to identify items of interest within the text. The pattern matching technique is applied to finding the Speech-Verb, Actor and Speaker of quotes found in fiction books. The technique performs well over the training data, resulting in rule-sets many times smaller than the training set, but providing very high accuracy. While the rule-set generalised from one book is less effective when applied to different books than an approach based on hand coded heuristics, performance is comparable when testing on data closely related to the training set.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Ciravegna, F. 2001. Adaptive information extraction from text by rule induction and generalisation. In 17th International Joint Conference on Artificial Intelligence (IJCAI). Seattle.
4
 
5
 
6
Downey, D., Soderland, O. E. S., and Weld, D. 2004. Learning text patterns for web information extraction and assessment. In AAAI 2004 workshop on Adaptive Text Extraction and Mining. San Jose, CA.
 
7
 
8
 
9
Hobbs, J. R. 1978. Resolving pronoun references. Lingua 44, 311--338.
 
10
 
11
 
12
 
13
MUC-7. 1998. MUC-7 test scores introduction. In Proceedings of 7th Message Understanding Conference. Fairfax, Virginia.
14
 
15
 
16
Tapanainen, P. 1999. Parsing in two frameworks: finite-state and functional dependency grammar. Ph.D. thesis, University of Helsinki.
 
17
 
18
 
19
Zhang, J., Black, A., and Sproat, R. 2003. Identifying speakers in children's stories for speech synthesis. In Proceedings of EUROSPEECH 2003. Geneva.


Collaborative Colleagues:
Kevin Glass: colleagues
Shaun Bangay: colleagues