Clone-advisor: recommending code tokens and clone methods with deep learning and information retrieval

doi:10.7717/PEERJ-CS.737

Open AccessJournal Article10.7717/PEERJ-CS.737

Clone-advisor: recommending code tokens and clone methods with deep learning and information retrieval

Muhammad Hammad, +4 more

- 09 Nov 2021

- PeerJ

- Vol. 7

5

TL;DR: Zhang et al. as discussed by the authors proposed a deep learning approach for modeling code clones along with non-cloned code to predict the next set of tokens (possibly a complete clone method body) based on the code written so far.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/j.jss.2023.111934

A survey on machine learning techniques applied to source code

Tushar Sharma, +6 more

- 01 Dec 2023

- Journal of Systems and Software

TL;DR: This paper surveys 494 studies on machine learning techniques applied to source code analysis, summarizing 12 software engineering tasks, tools, and datasets, and highlighting increasing use, challenges, and a comprehensive list of available resources.

...read moreread less

15

Journal Article•10.1145/3674805.3690757

Code Clone Configuration as a Multi-Objective Search Problem

Denis Sousa, +3 more

- 15 Oct 2024

1

•Journal Article•10.1016/j.simpa.2022.100323

Clone-Writer: An effective editor for developing code by using code clones

Muhammad Hammad, +3 more

- 01 Aug 2022

- Software impacts

TL;DR: Clone-Writer as discussed by the authors is an automated software development tool that recommends code clones on the basis of code written so far, and developers can perform code clone search based on a search query written either as source code terms, or as natural language.

...read moreread less

1

•Journal Article•10.1109/access.2022.3145686

Clone-Seeker: Effective Code Clone Search Using Annotations

01 Jan 2022

- IEEE Access

TL;DR: Zhang et al. as mentioned in this paper proposed a novel approach called clone-Seeker that focuses on utilizing clone class features in retrieving code clones, which can help developers to perform code clone search based on a search query written either as source code terms, or as natural language.

...read moreread less

10.48550/arxiv.2110.09610

A Survey on Machine Learning Techniques for Source Code Analysis

Tushar Sharma, +6 more

TL;DR: This survey of 479 studies (2011-2021) on machine learning for source code analysis identifies increasing adoption, synthesizes workflows and techniques, and highlights challenges in standard datasets, reproducibility, and hardware resources for software engineering tasks.

...read moreread less

References

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

Journal Article•10.1002/J.1538-7305.1948.TB01338.X

A mathematical theory of communication

Claude E. Shannon

- 01 Jul 1948

- Bell System Technical Journal

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.

...read moreread less

74.4K

Proceedings Article•10.3115/V1/D14-1162

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

- 01 Oct 2014

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

41.6K

•Proceedings Article•10.3115/V1/D14-1179

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

- 01 Jan 2014

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

28.6K

•Proceedings Article

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013

TL;DR: Two novel model architectures for computing continuous vector representations of words from very large data sets are proposed and it is shown that these vectors provide state-of-the-art performance on the authors' test set for measuring syntactic and semantic word similarities.

...read moreread less

27.5K