ACM Home Page
Please provide us with feedback. Feedback
Statistical entity-topic models
Full text PdfPdf (744 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Philadelphia, PA, USA
POSTER SESSION: Research track posters table of contents
Pages: 680 - 686  
Year of Publication: 2006
ISBN:1-59593-339-5
Authors
David Newman  Univ. of California, Irvine, CA
Chaitanya Chemudugunta  Univ. of California, Irvine, CA
Padhraic Smyth  Univ. of California, Irvine, CA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 109,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1150402.1150487
What is a DOI?

ABSTRACT

The primary purpose of news articles is to convey information about who, what, when and where. But learning and summarizing these relationships for collections of thousands to millions of articles is difficult. While statistical topic models have been highly successful at topically summarizing huge collections of text documents, they do not explicitly address the textual interactions between who/where, i.e. named entities (persons, organizations, locations) and what, i.e. the topics. We present new graphical models that directly learn the relationship between topics discussed in news articles and entities mentioned in each article. We show how these entity-topic models, through a better understanding of the entity-topic relationships, are better at making predictions about entities.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
D. Blei and J. Lafferty. Correlated topic models. In Neural Information Processing Systems, volume 18, 2006.
 
3
 
4
 
5
 
6
D. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. In Advances in Neural Information Processing Systems 13, pages 430--436. MIT Press, 2001.
 
7
E. Erosheva, S. Fienberg, and J. Lafferty. Mixed-membership models of scientific publications. Proceedings of the National Academy of Sciences, 101:5220--5227, 2004.
 
8
T. L. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101:5228--5235, 2004.
 
9
T. L. Griffiths, M. Steyvers, D. Blei, and J. B. Tenenbaum. Integrating topics and syntax. In Advances in Neural Information Processing Systems 17. MIT Press, Cambridge, MA, 2005.
10
 
11
A. McCallum, A. Corrada Emmanuel, and X. Wang. The author-recipient-topic model for topic and role discovery in social networks. Technical Report UM-CS-2004-096, Department of Computer Science, University of Massachusetts, 2004.
 
12
A. McCallum and B. Wellner. Conditional models of identity uncertainty with applications to noun coreference. In Neural Information Processing Systems, 2004.
13
 
14
J. Zhu, A. Goncalves, and V. Uren. Adaptive named entity recognition for social network analysis and domain ontology maintenance. In Proceedings of 3rd Professional Knowledge Management Conference, Springer, LNAI, 2005.

CITED BY  7

Collaborative Colleagues:
David Newman: colleagues
Chaitanya Chemudugunta: colleagues
Padhraic Smyth: colleagues