| A method for online analytical processing of text data |
| Full text |
Pdf
(463 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
table of contents
Lisbon, Portugal
SESSION: OLAP and multi-dimensional databases (DB)
table of contents
Pages 455-464
Year of Publication: 2007
ISBN:978-1-59593-803-9
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 142, Citation Count: 0
|
|
|
ABSTRACT
There are increasingly visible demands for structured/ unstructured information integration and advanced analytics. However, conventional database technology has not been able to present a robust and practical implementation of a truly integrated architecture for such purposes. After working on several industrial applications (in particular, in the healthcare and life sciences area), we have identified fundamental issues and technical approaches to tackle the issues. In this paper, we propose data representations and algebraic operations for integrating semantic information (e.g., ontologies) into OLAP systems, which allow us to analyze a huge set of textual documents with their underlying semantic information. The performance of the prototype implementation has been evaluated using real world datasets, and the high scalability and flexibility of our approach have been confirmed with respect to the computation time.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Sameet Agarwal , Rakesh Agrawal , Prasad Deshpande , Ashish Gupta , Jeffrey F. Naughton , Raghu Ramakrishnan , Sunita Sarawagi, On the Computation of Multidimensional Aggregates, Proceedings of the 22th International Conference on Very Large Data Bases, p.506-521, September 03-06, 1996
|
| |
2
|
|
| |
3
|
P. Bonatti et al. An Ontology-extended Relational Algebra. Proc. of Int'l Conf. on Information Reuse and Integration, pp. 192--199, 2003.
|
| |
4
|
|
| |
5
|
|
 |
6
|
|
 |
7
|
|
 |
8
|
Ronald Fagin , R. Guha , Ravi Kumar , Jasmine Novak , D. Sivakumar , Andrew Tomkins, Multi-structural databases, Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 13-15, 2005, Baltimore, Maryland
[doi> 10.1145/1065167.1065191]
|
| |
9
|
|
 |
10
|
|
 |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
 |
15
|
Yong Kyu Lee , Seong-Joon Yoo , Kyoungro Yoon , P. Bruce Berra, Index structures for structured documents, Proceedings of the first ACM international conference on Digital libraries, p.91-99, March 20-23, 1996, Bethesda, Maryland, United States
[doi> 10.1145/226931.226950]
|
| |
16
|
|
 |
17
|
M. Catherine McCabe , Jinho Lee , Abdur Chowdhury , David Grossman , Ophir Frieder, On the design and evaluation of a multi-dimensional approach to information retrieval (poster session), Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, p.363-365, July 24-28, 2000, Athens, Greece
[doi> 10.1145/345508.345656]
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
T. Niemi et al. Logical Multidimensional Database Design for Ragged and Unbalanced Aggregation. Proc. of Int'l Wks. on Design and Management of Data Warehouses, pp. 7, 2001.
|
| |
22
|
|
| |
23
|
|
 |
24
|
|
| |
25
|
J. Pérez et al. IR and OLAP in XML Document Warehouses. Proc. of European Conf. on IR Research, pp. 536--539, 2005.
|
 |
26
|
Igor Tatarinov , Stratis D. Viglas , Kevin Beyer , Jayavel Shanmugasundaram , Eugene Shekita , Chun Zhang, Storing and querying ordered XML using a relational database system, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, June 03-06, 2002, Madison, Wisconsin
[doi> 10.1145/564691.564715]
|
| |
27
|
N. Uramoto , H. Matsuzawa , T. Nagano , A. Murakami , H. Takeuchi , K. Takeda, A text-mining system for knowledge discovery from biomedical documents, IBM Systems Journal, v.43 n.3, p.516-533, July 2004
|
| |
28
|
|
|