ACM Home Page
Please provide us with feedback. Feedback
YAP3: improved detection of similarities in computer program and other texts
Full text PdfPdf (488 KB)
Source Technical Symposium on Computer Science Education archive
Proceedings of the twenty-seventh SIGCSE technical symposium on Computer science education table of contents
Philadelphia, Pennsylvania, United States
Pages: 130 - 134  
Year of Publication: 1996
ISBN:0-89791-757-X
Also published in ...
Author
Michael J. Wise  Department of Computer Science, University of Sydney, Australia
Sponsor
SIGCSE: ACM Special Interest Group on Computer Science Education
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 104,   Citation Count: 16
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/236452.236525
What is a DOI?

ABSTRACT

In spite of years of effort, plagiarism in student assignment submissions still causes considerable difficulties for course designers; if students' work is not their own, how can anyone be certain they have learnt anything? YAP is a system for detecting suspected plagiarism in computer programs and other texts submitted by students. The paper reviews YAP3, the third version of YAP, focusing on its novel underlying algorithm - Running-Karp-Rabin Greedy-String-Tiling (or RKS-GST), whose development arose from the observation with YAP and other systems that students shuffle independent code segments. YAP3 is able to detect transposed subsequences, and is less perturbed by spurious additional statements. The paper concludes with a discussion of recent extension of YAP to English texts, further illustrating the flexibility of the YAP approach.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
4
 
5
 
6
KRUSKAL, JOSEPH B., "An Overview of Sequence Comparison", Time Warps, String Edits and Macromolecules: The'Theory and Practice of Sequence Comparison, ed. David Sankoff and Joseph B. Kruskal, pp. 1--44, Addison Wesley (I983) (Chapter 1).
 
7
VERCO, KRISTINA L. AND MICHAEL J. WISE, "A Comparison of Structure-Metric and Counting Metric Plagiarism Detection Systems", Twenty-Seventh SIGCSE Technical Symposium, Philadelphia, U.S.A. (February 15-17, 1996) (Submitted to conference).
 
8
9
 
10
WISE, MICHAEL J, "Running Karp-Rabin Matching and Greedy String Tiling", Basser Department of Computer Science Technical Report, Sydney University(1994)(ftp://ftp.cs.su. oz.au/michaelw/rkr_gst.ps Revises Basset Technical Report 463, March 1993).
 
11
WISE, MICHAEL J, "Neweye~'. A System for Comparing Biological Sequences Using the Running Karp-Rabin Greedy String-Tiling Algorithm", Third hzternational Conference on Intelligent Systems for Molecular Biology, ambridge,England., pp. 393-401 (July 16-19, 1995).

CITED BY  16