|
ABSTRACT
In spite of years of effort, plagiarism in student assignment submissions still causes considerable difficulties for course designers; if students' work is not their own, how can anyone be certain they have learnt anything? YAP is a system for detecting suspected plagiarism in computer programs and other texts submitted by students. The paper reviews YAP3, the third version of YAP, focusing on its novel underlying algorithm - Running-Karp-Rabin Greedy-String-Tiling (or RKS-GST), whose development arose from the observation with YAP and other systems that students shuffle independent code segments. YAP3 is able to detect transposed subsequences, and is less perturbed by spurious additional statements. The paper concludes with a discussion of recent extension of YAP to English texts, further illustrating the flexibility of the YAP approach.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
KRUSKAL, JOSEPH B., "An Overview of Sequence Comparison", Time Warps, String Edits and Macromolecules: The'Theory and Practice of Sequence Comparison, ed. David Sankoff and Joseph B. Kruskal, pp. 1--44, Addison Wesley (I983) (Chapter 1).
|
| |
7
|
VERCO, KRISTINA L. AND MICHAEL J. WISE, "A Comparison of Structure-Metric and Counting Metric Plagiarism Detection Systems", Twenty-Seventh SIGCSE Technical Symposium, Philadelphia, U.S.A. (February 15-17, 1996) (Submitted to conference).
|
| |
8
|
|
 |
9
|
|
| |
10
|
WISE, MICHAEL J, "Running Karp-Rabin Matching and Greedy String Tiling", Basser Department of Computer Science Technical Report, Sydney University(1994)(ftp://ftp.cs.su. oz.au/michaelw/rkr_gst.ps Revises Basset Technical Report 463, March 1993).
|
| |
11
|
WISE, MICHAEL J, "Neweye~'. A System for Comparing Biological Sequences Using the Running Karp-Rabin Greedy String-Tiling Algorithm", Third hzternational Conference on Intelligent Systems for Molecular Biology, ambridge,England., pp. 393-401 (July 16-19, 1995).
|
CITED BY 14
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Robert N. Lass , Christopher D. Cera , Nathaniel T. Bomberger , Bruce Char , Jeffrey L. Popyack , Nira Herrmann , Paul Zoski, Tools and techniques for large scale grading using Web-based commercial off-the-shelf software, ACM SIGCSE Bulletin, v.35 n.3, September 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|