ACM Home Page
Please provide us with feedback. Feedback
Connections: using context to enhance file search
Full text PdfPdf (361 KB)
Source ACM Symposium on Operating Systems Principles archive
Proceedings of the twentieth ACM symposium on Operating systems principles table of contents
Brighton, United Kingdom
SESSION: History and context table of contents
Pages: 119 - 132  
Year of Publication: 2005
ISBN:1-59593-079-5
Also published in ...
Authors
Craig A. N. Soules  Carnegie Mellon University
Gregory R. Ganger  Carnegie Mellon University
Sponsors
ACM: Association for Computing Machinery
SIGOPS: ACM Special Interest Group on Operating Systems
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 149,   Citation Count: 16
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1095810.1095822
What is a DOI?

ABSTRACT

Connections is a file system search tool that combines traditional content-based search with context information gathered from user activity. By tracing file system calls, Connections can identify temporal relationships between files and use them to expand and reorder traditional content search results. Doing so improves both recall (reducing false-positives) and precision (reducing false-negatives). For example, Connections improves the average recall (from 13% to 22%) and precision (from 23% to 29%) on the first ten results. When averaged across all recall levels, Connections improves precision from 17% to 28%. Connections provides these benefits with only modest increases in average query time (2 seconds), indexing time (23 seconds daily), and index size(under 1% of the user's data set).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
N. Abdul-Jaleel, A. Corrada-Emmanuel, Q. Li, X. Liu, C. Wade, and J. Allan. UMass at TREC 2003: HARD and QA. Text Retrieval Conference, pages 715--725, 2003.
 
2
A. Amer, D. Long, J.-F. Paris, and R. Burns. File access prediction with adjustable accuracy. International Performance Conference on Computers and Communication. IEEE, 2002.
 
3
 
4
 
5
6
7
 
8
9
 
10
Google, http://www.google.com/.
 
11
Google Desktop, http://desktop.google.com/.
 
12
 
13
J. Griffioen and R. Appleton. Reducing file system latency using a predictive approach. Summer USENIX Technical Conference, pages 197--207. USENIX Association, 1994.
 
14
Grokker, http://www.grokker.com/.
 
15
T. H. Haveliwala. Topic-sensitive PageRank: a context-sensitive ranking algorithm for web search. IEEE Transactions on Knowledge and Data Engineering, 15(4):784--796. IEEE, August 2003.
 
16
B. Hayes. Terabyte territory. American Scientist, 90(3):212--216, May--June 2002.
 
17
E. Horvitz, J. Breese, D. Heckerman, D. Hovel, and K. Rommelse. The Lumiere project: bayesian user modeling for inferring the goals and needs of software users. Conference on Uncertainty in Artificial Intelligence, pages 256--265, 1998.
18
19
20
 
21
T. M. Kroeger and D. D. E. Long. Predicting file system actions from prior events. USENIX Annual Technical Conference, pages 319--328. USENIX Association, 1996.
 
22
G. H. Kuenning. Seer: predictive file hoarding for disconnected mobile operation. Technical Report UCLA--CSD--970015. University of California, Los Angeles, May 1997.
 
23
L. S. Larkey, J. Allan, M. E. Connell, A. Bolivar, and C. Wade. UMass at TREC 2002: cross language and novelty tracks. Text Retrieval Conference, 2002.
 
24
H. Lei and D. Duchamp. An analytical approach to file prefetching. USENIX Annual Technical Conference. USENIX Association,, 1997.
 
25
The Lemur Toolkit, http://www.lemurproject.org/.
 
26
Lycos, http://www.lycos.com/.
 
27
 
28
U. Manber, M. Smith, and B. Gopal. WebGlimpse: combining browsing and searching. USENIX Annual Technical Conference. USENIX Association, 1997.
 
29
U. Manber and S. Wu. GLIMPSE: a tool to search through entire file systems. Winter USENIX Technical Conference, pages 23--32. USENIX Association, 1994.
30
 
31
 
32
D. Metzler, T. Strohman, H. Turtle, and W. B. Croft. Indri at TREC 2004: terabyte track. Text Retrieval Conference, 2004.
 
33
M. A. Olson, K. Bostic, and M. Seltzer. Berkeley DB. Summer USENIX Technical Conference. USENIX Association, 1999.
 
34
D. Quan, D. Huynh, and D. R. Karger. Haystack: a platform for authoring end user semantic web applications. International Semantic Web Conference, 2003.
 
35
 
36
B. Rhodes and T. Starner. The Remembrance Agent: a continuously running automated information retrieval system. International Conference on The Practical Application of Intelligent Agents and Multi Agent Technology, pages 487--495, 1996.
 
37
S. Sechrest and M. McClennen. Blending hierarchical and attribute-based file naming. International Conference on Distributed Computing Systems, pages 572--580, 1992.
38
39
 
40
Terrier Information Retrieval Platform, http://ir.dcs.gla.ac.uk/terrier/.
41
 
42
E. M. Voorhees. Overview of TREC 2003. Text Retrieval Conference. NIST, 2003.
 
43
Merriam-Webster OnLine, http://www.m-w.com/.
 
44
X1 Desktop Search, http://www.x1.com/.
 
45
Yahoo!, http://www.yahoo.com/.

CITED BY  17

Collaborative Colleagues:
Craig A. N. Soules: colleagues
Gregory R. Ganger: colleagues