|
ABSTRACT
The need to manage diverse information sources has triggered the rise of very loosely structured data models, known as "dataspace models." Such information management systems must allow querying in simple ways, mostly by a form of searching. Motivated by these developments, we propose a theory of search queries in a general model of dataspaces. In this model, a dataspace is a collection of data objects, where each data object is a collection of data items. Basic search queries are expressed using filters on data items, following the basic model of boolean search in information retrieval. We characterise semantically the class of queries that can be expressed by searching. We apply our theory to classical relational databases, where we connect search queries to the known class of fully generic queries, and to dataspaces where data items are formed by attribute--value pairs. We also extend our theory to a more powerful, associative form of searching where one can ask for objects that are similar to objects satisfying given search conditions. Such associative search queries are shown to correspond to a very limited kind of joins. Specifically, we show that the basic search language extended with associative search can define exactly the queries definable in a restricted fragment of the semijoin algebra working on an explicit relational representation of the dataspace.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
RDF primer. W3C Recommendation, February 2004.
|
| |
2
|
SPARQL query language for RDF. W3C Recommendation, January 2008.
|
| |
3
|
|
 |
4
|
|
| |
5
|
|
 |
6
|
Catriel Beeri , Tova Milo , Paula Ta-Shma, On genericity and parametricity (extended abstract), Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.104-116, June 04-06, 1996, Montreal, Quebec, Canada
[doi> 10.1145/237661.237689]
|
| |
7
|
|
| |
8
|
|
| |
9
|
P. Blackburn, J. van Benthem, and F. Wolter, editors. Handbook of Modal Logic. Elsevier, 2007.
|
| |
10
|
A. Chandra and D. Harel. Computable queries for relational data bases. Journal of Computer and System Sciences, 21(2):156--178, 1980.
|
| |
11
|
|
 |
12
|
|
| |
13
|
G. H. L. Fletcher. An algebra for basic graph patterns. Presented at the workshop on Logic in Databases, Rome, Italy, May 2008.
|
 |
14
|
|
| |
15
|
V. Goranko and M. Otto. Model theory of modal logic. In Blackburn et al. {9}, chapter 5.
|
 |
16
|
|
 |
17
|
Alon Halevy , Michael Franklin , David Maier, Principles of dataspace systems, Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.1-9, June 26-28, 2006, Chicago, IL, USA
[doi> 10.1145/1142351.1142352]
|
 |
18
|
H. V. Jagadish , Adriane Chapman , Aaron Elkiss , Magesh Jayapandian , Yunyao Li , Arnab Nandi , Cong Yu, Making database systems usable, Proceedings of the 2007 ACM SIGMOD international conference on Management of data, June 11-14, 2007, Beijing, China
[doi> 10.1145/1247480.1247483]
|
| |
19
|
M. Jain, A. Mendhekar, and D. Van Gucht. A uniform data model for relational data and meta-data query processing. In Advances in Data Management '95, pages 146--165. Tata McGraw-Hill, 1995.
|
| |
20
|
|
| |
21
|
|
| |
22
|
J. Pérez, M. Arenas, and C. Gutierrez. Semantics and complexity of SPARQL. In The Semantic Web: Proceedings ISWC, volume 4273 of Lecture Notes in Computer Science, pages 30--43. Springer, 2006.
|
| |
23
|
K. A. Ross and A. Janevski. Querying faceted databases. In Proceedings 2nd International Workshop on Semantic Web and Databases, volume 3372 of Lecture Notes in Computer Science, pages 199--218. Springer, 2005.
|
| |
24
|
|
|