ACM Home Page
Please provide us with feedback. Feedback
Incorporation of corpus-specific semantic information into question answering context
Full text PdfPdf (157 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 2nd international workshop on Ontologies and nformation systems for the semantic web table of contents
Napa Valley, California, USA
SESSION: Session 2 table of contents
Pages 25-30  
Year of Publication: 2008
ISBN:978-1-60558-255-9
Authors
Protima Banerjee  Drexel University, Philadelphia, PA, USA
Hyoil Han  Drexel University, Philadelphia, PA, USA
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 65,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458484.1458497
What is a DOI?

ABSTRACT

In today's environment of information overload, Question Answering (QA) is a critically important research area for the Semantic Web. In order for humans to make effective use of the expansive information sources available to us, we require automated tools to help us make sense of large amounts of data. Within this framework, Question Context plays an important role. We define Question Context to be an semantic structure that can be used to enrich queries so that the user's information need is better represented. This paper describes the theoretical foundations of a novel approach that uses statistical language modeling techniques to create Question Context and to then integrate it into the Information Retrieval stage of QA. We base our approach on two established language modeling methods - the Aspect Model, which is the basis of Probabilistic Latent Semantic Analysis (PLSA) and Relevance-Based Language Models. Our approach proposes an Aspect-Based Relevance Language Model as the Question Context Model, and our methodology incorporates corpus-specific semantic concepts into the QA process. Words from the most heavily relevant aspects are then incorporated into the query. We present some interesting preliminary qualitative results that show the potential usefulness of the Question Context Model to both the first (IR) and second (Intelligent Information Processing) stages of QA.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Dempster, A. P., Laird, N. M., and Rubin, D. B., "Maximum Likelihood from Incomplete Data via the EM Algorithm," Journal of the Royal Statistical Society, vol. 39, pp. 1--38, 1977.
 
2
3
4
 
5
 
6
Strohman, T., Metzler, D., Turtle, H., and Croft, W. B., "Indri: A language model-based search engine for complex queries," in presented as a poster at the International Conference on Intelligence Analysis McLean, VA, 2005.
 
7
 
8
Voorhees, E. M., "Overview of the TREC 2006 Question Answering Track," in Online proceedings of 2006 Text Retrieval Conference, 2006.

Collaborative Colleagues:
Protima Banerjee: colleagues
Hyoil Han: colleagues