ACM Home Page
Please provide us with feedback. Feedback
Query length impact on misuse detection in information retrieval systems
Full text PdfPdf (154 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2005 ACM symposium on Applied computing table of contents
Santa Fe, New Mexico
SESSION: Information access and retrieval (IAR) table of contents
Pages: 1070 - 1075  
Year of Publication: 2005
ISBN:1-58113-964-0
Authors
Ling Ma  Illinois Institute of Technology
Nazli Goharian  Illinois Institute of Technology
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 46,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066677.1066921
What is a DOI?

ABSTRACT

Misuse is the abuse of privileges by an authorized user and is the second most common form of computer crime after viruses. Earlier we proposed a misuse detection approach for information retrieval systems that relied on relevance feedback. The central idea focused on the building of a user profile containing both query and feedback terms from prior queries. Our algorithm matched new activities to existing profiles and assigned a likelihood of misuse to an activity. Only initial evaluation was provided.We now expand and evaluate our system using both short and long queries noting the effect of query length in the accuracy of the detection. The results indicate an overall precision of 83.9% when short queries are used, and 82.2% for long queries. The rate of the undetected misuse for short queries is less than 2% and for long queries less than 6%. Although higher precision score configurations result in a lower false alarm rate, unfortunately, they increase the rate of undetected misuse both for short and long queries. Given this tradeoff, for any particular application constraint, system behavior can be tuned to minimize either false alarms or undetected misuse.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Press Release, Computer Crime and Intellectual Property section of the Criminal Division of US Dept. of Justice, 2001. http://www.usdoj.gov/criminal/cyber crime/turnerPlea.htm
 
3
Press Release, Computer Crime and Intellectual Property section of the Criminal Division of US Dept. of Justice, 2001. http://www.usdoj.gov/criminal/cybercrime/Osowski_TangSent.htm
 
4
Press Release, United State Attorney's Office Northern District of Texas, US Department of Justice, November 5, 2003. http://www.usdoj.gov/usao/txn/PressRel03/fudge_ind_pr.html
 
5
J. Marin, D. Ragsdale, and J. Surdu, A hybrid approach to the profile creation and intrusion detection, DARPA Info. Surv. Conf. and Expo. 2001.
 
6
C. Ling, J. Gao, H. Zhang, W. Qian, H. Zhang, Improving encarta search engine performance by mining user logs, Int. Journal of Pattern Recognition and Artificial Intelligence, 2002.
 
7
8
9
 
10
 
11
National Institute of Standards and Technology, Text retrieval conference(trec), December 2002, http://trec.nist.gov/.
12
13
 
14
Dwi H. Widyantoro and John Yen, Learning User Interest Dynamics with Three-Descriptor Representation, JASIS 2000.
15