Multi-Agent based Sequence Algorithm for Detecting Plagiarism and Clones in Java Source Code using Abstract Syntax Tree
D. Poongodi,G. Tholkkappia Arasu +1 more
TL;DR: This paper describes method and statement based source code similarity detection, which detects the simple plagiarized code like exact match, near exact match and longest common sequence using multi-agent based detection which will perform the detection automatically.
read more
Abstract: Plagiarism and clone detection plays an important role in software security protection, software maintenance and license issues. Source-code similarity detection method can be classified as string-based, token-based, parse-tree-based and program-dependency-based. All of these approaches have certain limitations and can not meet the requirements when the source code is large and may produce false positives. But, parse-tree based detection improves the detection ability and efficiency. This paper describes method and statement based source code similarity detection, which detects the simple plagiarized code like exact match, near exact match and longest common sequence using multi-agent based detection which will perform the detection automatically. Automatic plagiarism detection will be helpful for code clone detection in software industry and plagiarism detection in projects.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Detecting source code plagiarism on introductory programming course assignments using a bytecode approach
Oscar Karnalim
- 12 Oct 2016
TL;DR: Based on evaluation, it can be concluded that the source code plagiarism detection approach is more effective to detect most plagiarism attack types than raw source code approach on introductory programming course.
50
References
Clone detection using abstract syntax trees
Ira D. Baxter,A. Yahin,Leonardo de Moura,Marcelo Sant'Anna,L. Bier +4 more
- 16 Mar 1998
TL;DR: The paper presents simple and practical methods for detecting exact and near miss clones over arbitrary program fragments in program source code by using abstract syntax trees and suggests that clone detection could be useful in producing more structured code, and in reverse engineering to discover domain concepts and their implementations.
DECKARD: Scalable and Accurate Tree-Based Detection of Code Clones
Lingxiao Jiang,Ghassan Misherghi,Zhendong Su,Stéphane Glondu +3 more
- 24 May 2007
TL;DR: This paper presents an efficient algorithm for identifying similar subtrees and apply it to tree representations of source code and implemented this algorithm as a clone detection tool called DECKARD and evaluated it on large code bases written in C and Java including the Linux kernel and JDK.
A Survey on Software Clone Detection Research
Chanchal K. Roy,James R. Cordy +1 more
- 01 Jan 2007
TL;DR: The state of the art in clone detection research is surveyed, the clone terms commonly used in the literature are described along with their corresponding mappings to the commonly used clone types and several open problems related to clone detectionResearch are pointed out.
Clone Detection Using Abstract Syntax Suffix Trees
Rainer Koschke,Raimar Falke,Pierre Frenzel +2 more
- 23 Oct 2006
TL;DR: This paper describes how to make use of suffix trees to find clones in abstract syntax trees and empirically compares the new technique to other techniques using the Bellon benchmark for clone detectors.
Understanding source code evolution using abstract syntax tree matching
Iulian Neamtiu,Jeffrey S. Foster,Michael Hicks +2 more
- 17 May 2005
TL;DR: A tool for quickly comparing the source code of different versions of a C program based on partial abstract syntax tree matching is presented, and can track simple changes to global variables, types and functions.