ACM Home Page
Please provide us with feedback. Feedback
Enabling enterprise mashups over unstructured text feeds with InfoSphere MashupHub and SystemT
Full text PdfPdf (1.84 MB)
Source
International Conference on Management of Data archive
Proceedings of the 35th SIGMOD international conference on Management of data table of contents
Providence, Rhode Island, USA
DEMONSTRATION SESSION: Demonstration session: group D table of contents
Pages 1123-1126  
Year of Publication: 2009
ISBN:978-1-60558-551-2
Authors
David E. Simmen  IBM Almaden Research Center, San Jose, CA, USA
Frederick Reiss  IBM Almaden Research Center, San Jose , CA, USA
Yunyao Li  IBM Almaden Research Center, San Jose , CA, USA
Suresh Thalamati  IBM Almaden Research Center, San Jose , CA, USA
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 37,   Downloads (12 Months): 117,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1559845.1559999
What is a DOI?

ABSTRACT

Enterprise mashup scenarios often involve feeds derived from data created primarily for eye consumption, such as email, news, calendars, blogs, and web feeds. These data sources can test the capabilities of current data mashup products, as the attributes needed to perform join, aggregation, and other operations are often buried within unstructured feed text. Information extraction technology is a key enabler in such scenarios, using annotators to convert unstructured text into structured information that can facilitate mashup operations.

Our demo presents the integration of SystemT, an information extraction system from IBM Research, with IBM's InfoSphere MashupHub. We show how to build domain-specific annotators with SystemT's declarative rule language, AQL, and how to use these annotators to combine structured and unstructured information in an enterprise mashup.



Collaborative Colleagues:
David E. Simmen: colleagues
Frederick Reiss: colleagues
Yunyao Li: colleagues
Suresh Thalamati: colleagues