ACM Home Page
Please provide us with feedback. Feedback
A bootstrapping approach for identifying stakeholders in public-comment corpora
Full text PdfPdf (872 KB)
Source
dg.o; Vol. 228 archive
Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains table of contents
Philadelphia, Pennsylvania
SESSION: Advances in technology table of contents
Pages: 92 - 101  
Year of Publication: 2007
ISBN:1-59593-599-1
Authors
Jaime Arguello  Carnegie Mellon University, Pittsburgh, PA
Jamie Callan  Carnegie Mellon University, Pittsburgh, PA
Sponsors
: Center for Technology in Government
: CISCO
: Center for Statistical Ecology and Environmental Statistics
: CIMIC
Publisher
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 40,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  

ABSTRACT

A stakeholder is an individual, group, organization, or community that has an interest or stake in a consensus-building process. The goal of stakeholder identification is identifying stakeholder mentions in natural language text. We present novel work in using a bootstrapping approach for the identification of stakeholders in public comment corpora. We refine the definition of a stakeholder by categorizing stakeholders into 2 distinct stakeholder types and experiment with automatically identifying one of these two types: instances where the author identifies him/herself as a member of a particular group. An existing bootstrapping information extraction algorithm is combined individually with 3 distinct extraction pattern templates. Results show that this stakeholder group can be learned in a minimally supervised bootstrapping framework using 2 of the 3 extraction pattern templates. An experimental analysis explores the challenges in applying the third extraction pattern template to this problem. Results on all 3 extraction pattern templates provide insight on the unique and novel challenge of identifying stakeholders.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Collins, M. and Singer. Y. Unsupervised models for named entity Classification. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-99). 1999.
3
 
4
Hearst, M. Automatic Discovery of WordNet Relations. In WordNet: An Electronic Lexical Database and Some of its Applications. Christiane Fellbaum, MIT Press, 1998.
5
 
6
Nobata, C. and Sekine S. Towards Automatic Acquisition of Patterns for Information Extraction. In Proceedings of the International Conference of Computer Processing of Oriental Languages. 1999.
 
7
Patwardhan S. and Riloff, E. Learning Domain Specific Information Extraction Patterns from the Web. In Proceedings of the Workshop on Information Extraction beyond the Document (ACL-06). 2006.
 
8
 
9
Riloff, E. and Phillips, W. An Introduction to the Sundance and Autoslog Systems. University of Utah Technical Report #UUCS-04-015. 2004
 
10
 
11
 
12
13
14


Collaborative Colleagues:
Jaime Arguello: colleagues
Jamie Callan: colleagues