ACM Home Page
Please provide us with feedback. Feedback
Why not?
Full text PdfPdf (538 KB)
Source
International Conference on Management of Data archive
Proceedings of the 35th SIGMOD international conference on Management of data table of contents
Providence, Rhode Island, USA
SESSION: Research session 14: understanding data and queries table of contents
Pages 523-534  
Year of Publication: 2009
ISBN:978-1-60558-551-2
Authors
Adriane Chapman  The MITRE Corporation, McLean, VA, USA
H. V. Jagadish  University of Michigan, Ann Arbor, MI, USA
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 82,   Downloads (12 Months): 335,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1559845.1559901
What is a DOI?

ABSTRACT

As humans, we have expectations for the results of any action, e.g. we expect at least one student to be returned when we query a university database for student records. When these expectations are not met, traditional database users often explore datasets via a series of slightly altered SQL queries. Yet most database access is via limited interfaces that deprive end users of the ability to alter their query in any way to garner better understanding of the dataset and result set. Users are unable to question why a particular data item is Not in the result set of a given query. In this work, we develop a model for answers to WHY NOT? queries. We show through a user study the usefulness of our answers, and describe two algorithms for finding the manipulation that discarded the data item of interest. Moreover, we work through two different methods for tracing the discarded data item that can be used with either algorithm. Using our algorithms, it is feasible for users to find the manipulation that excluded the data item of interest, and can eliminate the need for exhausting debugging.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Shawn Bowers, Timothy McPhillips, Martin Wu, and Bertram LudÃd'scher. Project histories: Managing data provenance across collection-oriented scientific workflow runs. In DILS, pages 27--29, 2007.
4
 
5
6
 
7
8
 
9
 
10
 
11
Susan Davidson, Sarah Cohen-Boulakia, Anat Eyal, Bertram Ludascher, Timothy McPhillips, Shawn Bowers, and Juliana Freire. Provenance in scientific workflow systems. IEEE Data Engineering Bulletin, 32(4):44--50, 2007.
 
12
 
13
 
14
 
15
Magesh Jayapandian, Adriane Chapman, et al. Michigan Molecular Interactions (MiMI): Putting the jigsaw puzzle together. Nucleic Acids Research, pages D566--D571, Jan 2007.
16
 
17
 
18
 
19
Michi Mutsuzaki, Martin Theobald, et al. Trio-One: Layering uncertainty and lineage on a conventional DBMS. In CIDR, pages 269--274, 2007.
20
 
21
22
 
23
D. De Roure and C. Goble. myExperiment - a web 2.0 virtual research environment. In International Workshop on Virtual Research Environments and Collaborative Work Environments, 2007.
 
24
Jennifer Widom. Trio: A system for integrated management of data, accuracy, and lineage. In CIDR, 2005.
 
25

Collaborative Colleagues:
Adriane Chapman: colleagues
H. V. Jagadish: colleagues