ACM Home Page
Please provide us with feedback. Feedback
MauveDB: supporting model-based user views in database systems
Full text PdfPdf (589 KB)
Source International Conference on Management of Data archive
Proceedings of the 2006 ACM SIGMOD international conference on Management of data table of contents
Chicago, IL, USA
SESSION: Database technology for novel applications table of contents
Pages: 73 - 84  
Year of Publication: 2006
ISBN:1-59593-434-0
Authors
Amol Deshpande  University of Maryland
Samuel Madden  MIT
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 122,   Citation Count: 16
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1142473.1142483
What is a DOI?

ABSTRACT

Real-world data --- especially when generated by distributed measurement infrastructures such as sensor networks --- tends to be incomplete, imprecise, and erroneous, making it impossible to present it to users or feed it directly into applications. The traditional approach to dealing with this problem is to first process the data using statistical or probabilistic models that can provide more robust interpretations of the data. Current database systems, however, do not provide adequate support for applying models to such data, especially when those models need to be frequently updated as new data arrives in the system. Hence, most scientists and engineers who depend on models for managing their data do not use database systems for archival or querying at all; at best, databases serve as a persistent raw data store.In this paper we define a new abstraction called model-based views and present the architecture of MauveDB, the system we are building to support such views. Just as traditional database views provide logical data independence, model-based views provide independence from the details of the underlying data generating mechanism and hide the irregularities of the data by using models to present a consistent view to the users. MauveDB supports a declarative language for defining model-based views, allows declarative querying over such views using SQL, and supports several different materialization strategies and techniques to efficiently maintain them in the face of frequent updates. We have implemented a prototype system that currently supports views based on regression and interpolation, using the Apache Derby open source DBMS, and we present results that show the utility and performance benefits that can be obtained by supporting several different types of model-based views in a database system.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
The Apache Derby Project. Web Site. http://db.apache.org/derby/.
 
4
5
6
 
7
8
 
9
M. Chu, H. Haussecker, and F. Zhao. Scalable information-driven sensor querying and routing for ad hoc heterogeneous sensor networks. In Intl Journal of High Performance Computing Applications, 2002.
 
10
Nilesh N. Dalvi and Dan Suciu. Efficient query evaluation on probabilistic databases. In VLDB, 2004.
 
11
 
12
Amol Deshpande, Carlos Guestrin, Sam Madden, Joe Hellerstein, and Wei Hong. Model-driven data acquisition in sensor networks. In VLDB, 2004.
13
 
14
G. Golub and C. Van Loan. Matrix Computations. Johns Hopkins, 1989.
15
 
16
17
 
18
 
19
 
20
DB2 Intelligent Miner. Web Site. http://www-306.ibm.com/software/data/iminer/.
21
22
23
24
 
25
 
26
L. Liao, D. Fox, and H. Kautz. Location-based activity recognition using relational markov networks. In IJCAI, 2005.
 
27
Sam Madden. Intel lab data, 2004. http://berkeley.intel-research.net/labdata.
 
28
Samuel Madden, Wei Hong, Joseph M. Hellerstein, and Michael Franklin. TinyDB web page. http://telegraph.cs.berkeley.edu/tinydb.
29
 
30
Erin McKean, editor. The Oxford English Dictionary (2nd Edition). Oxford Univeristy Press, 2005.
31
 
32
George M. Phillips. Interpolation and Approximation by Polynomials. Springer-Verlag, 2003.
 
33
PMML 3.0 Specification. Web Site. http://www.dmg.org/v3-0/GeneralStructure.html.
34
 
35
Business Analytics Software Solutions (SAS). Web Site. http://www.sas.com/technologies/analytics.
 
36
J. Widom. Trio: A system for integrated management of data, accuracy, and lineage. In CIDR, 2005.
37
 
38
Y. Yao and J. Gehrke. Query processing in sensor networks. In CIDR, 2003.

CITED BY  17

Collaborative Colleagues:
Amol Deshpande: colleagues
Samuel Madden: colleagues