Journal Article10.1142/S0218348X9800016X
Correlations in computer programs
3
TL;DR: The results of the application of the Pearson and Spearman correlation methods to the source code are coupled with the random walk model applied to the binary code to identify cases of plagiarism.
read more
Abstract: Numerical and statistical methods are used to analyze and classify computer programs. Both computer source code and object files are examined. The results of the application of the Pearson and Spearman correlation methods to the source code are coupled with the random walk model applied to the binary code. One of the practical consequences of the analysis is the ability to quantify the degree of similarity between different computer programs and, hence, identify cases of plagiarism.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Source-code Similarity Detection and Detection Tools Used in Academia: A Systematic Review
TL;DR: This review gives an overview of definitions of plagiarism, plagiarism detection tools, comparison metrics, obfuscation methods, datasets used for comparison, and algorithm types and identifies interesting insights about metrics and datasets for quantitative tool comparison and categorisation of detection algorithms.
115
Copy detection for intellectual property protection of VLSI designs
Andrew B. Kahng,Darko Kirovski,Stefanus Mantik,Miodrag Potkonjak,Jennifer L. Wong +4 more
- 07 Nov 1999
TL;DR: This work proposes a generic methodology for copy detection based on determining basic elements within structural representations of solutions (IPs), calculating (context-independent) signatures for such elements, and performing fast comparisons to identify potential violators of IP rights.
(max, +) Dynamics Systems for Modeling Traffic with Long-Range Dependence
TL;DR: A simple first-order recurrence in a (max, +) dynamic system is numerically investigated and shown to exhibit statistical long-range dependence, characterized by slowly decaying aggregated variances and power-law evolutions of the autocorrelation and spectrum.
References
Long range correlation in human writings
TL;DR: In this paper, the authors study long range correlation in various human writings by mapping them into a simple 1d random walk model, which allows them to obtain better quality scaling data than the traditional power spectrum methods.
131
Language and codification dependence of long-range correlations in texts
TL;DR: The authors studied the long-range correlations in various translations of the Bible by mapping them onto a one dimensional random walk model and showed that the exponent α which characterizes the algebraic decay law of these correlations depends on both the language and the codification.
46
Numerical Analysis of Word Frequencies in Artificial and Natural Language Texts
TL;DR: The authors performed a numerical study of the statistical properties of natural texts written in English and of two types of artificial texts and found that the analysis of vocabulary growth showed the most striking difference between natural and artificial texts.