Intelligent code completion

Topic Tools

Papers

Proceedings Article•10.1145/1595696.1595728•

Learning from examples to improve code completion systems

[...]

Marcel Bruch¹, Martin Monperrus¹, Mira Mezini¹•Institutions (1)

24 Aug 2009

TL;DR: Evidence is given that intelligent code completion systems which learn from examples significantly outperform mainstream codepletion systems in terms of the relevance of their suggestions and thus have the potential to enhance developers' productivity.

...read moreread less

Abstract: The suggestions made by current IDE's code completion features are based exclusively on static type system of the programming language. As a result, often proposals are made which are irrelevant for a particular working context. Also, these suggestions are ordered alphabetically rather than by their relevance in a particular context. In this paper, we present intelligent code completion systems that learn from existing code repositories. We have implemented three such systems, each using the information contained in repositories in a different way. We perform a large-scale quantitative evaluation of these systems, integrate the best performing one into Eclipse, and evaluate the latter also by a user study. Our experiments give evidence that intelligent code completion systems which learn from examples significantly outperform mainstream code completion systems in terms of the relevance of their suggestions and thus have the potential to enhance developers' productivity.

...read moreread less

457 citations

Proceedings Article•10.24963/IJCAI.2018/578•

Code Completion with Neural Attention and Pointer Networks.

[...]

Jian Li, Yue Wang, Irwin King, Michael R. Lyu

27 Nov 2017-arXiv: Computation and Language

TL;DR: Zhang et al. as discussed by the authors proposed a pointer mixture network to generate within-vocabulary words through an RNN component, or regenerate an OoV word from local context through a pointer component.

...read moreread less

Abstract: Intelligent code completion has become an essential research task to accelerate modern software development. To facilitate effective code completion for dynamically-typed programming languages, we apply neural language models by learning from large codebases, and develop a tailored attention mechanism for code completion. However, standard neural language models even with attention mechanism cannot correctly predict the out-of-vocabulary (OoV) words that restrict the code completion performance. In this paper, inspired by the prevalence of locally repeated terms in program source code, and the recently proposed pointer copy mechanism, we propose a pointer mixture network for better predicting OoV words in code completion. Based on the context, the pointer mixture network learns to either generate a within-vocabulary word through an RNN component, or regenerate an OoV word from local context through a pointer component. Experiments on two benchmarked datasets demonstrate the effectiveness of our attention mechanism and pointer mixture network on the code completion task.

...read moreread less

192 citations

Journal Article•10.1145/2744200•

Intelligent Code Completion with Bayesian Networks

[...]

Sebastian Proksch¹, Johannes Lerch¹, Mira Mezini¹•Institutions (1)

Technische Universität Darmstadt¹

02 Dec 2015-ACM Transactions on Software Engineering and Methodology

TL;DR: This work compares the new approach, Pattern-based Bayesian Networks (PBN), to the existing BMN algorithm, and shows that PBN can obtain comparable prediction quality to BMN, while model size and inference speed scale better with large input sizes.

...read moreread less

Abstract: Code completion is an integral part of modern Integrated Development Environments (IDEs). Developers often use it to explore Application Programming Interfaces (APIs). It is also useful to reduce the required amount of typing and to help avoid typos. Traditional code completion systems propose all type-correct methods to the developer. Such a list is often very long with many irrelevant items. More intelligent code completion systems have been proposed in prior work to reduce the list of proposed methods to relevant items.This work extends one of these existing approaches, the Best Matching Neighbor (BMN) algorithm. We introduce Bayesian networks as an alternative underlying model, use additional context information for more precise recommendations, and apply clustering techniques to improve model sizes. We compare our new approach, Pattern-based Bayesian Networks (PBN), to the existing BMN algorithm. We extend previously used evaluation methodologies and, in addition to prediction quality, we also evaluate model size and inference speed.Our results show that the additional context information we collect improves prediction quality, especially for queries that do not contain method calls. We also show that PBN can obtain comparable prediction quality to BMN, while model size and inference speed scale better with large input sizes.

...read moreread less

83 citations

Posted Content•

Code Completion by Modeling Flattened Abstract Syntax Trees as Graphs

[...]

Yanlin Wang¹, Hui Li²•Institutions (2)

Microsoft¹, Xiamen University²

17 Mar 2021-arXiv: Software Engineering

TL;DR: CCAG as mentioned in this paper models the flattened sequence of a partial AST as an AST graph and uses graph attention block to capture different dependencies in the AST graph for representation learning in code completion.

...read moreread less

Abstract: Code completion has become an essential component of integrated development environments. Contemporary code completion methods rely on the abstract syntax tree (AST) to generate syntactically correct code. However, they cannot fully capture the sequential and repetitive patterns of writing code and the structural information of the AST. To alleviate these problems, we propose a new code completion approach named CCAG, which models the flattened sequence of a partial AST as an AST graph. CCAG uses our proposed AST Graph Attention Block to capture different dependencies in the AST graph for representation learning in code completion. The sub-tasks of code completion are optimized via multi-task learning in CCAG, and the task balance is automatically achieved using uncertainty without the need to tune task weights. The experimental results show that CCAG has superior performance than state-of-the-art approaches and it is able to provide intelligent code completion.

...read moreread less

52 citations

Proceedings Article•10.1109/SCAM.2016.22•

Collective Intelligence for Smarter API Recommendations in Python

[...]

Andrea Renika D'Souza¹, Di Yang², Cristina V. Lopes²•Institutions (2)

National Institute of Technology, Karnataka¹, University of California, Irvine²

1 Oct 2016

TL;DR: PyReco is presented, an intelligent code completion system for Python which uses the mined API usages from open source repositories to order the results based on relevance rather than the conventional alphabetic order, and outperforms the alphabetically ordered API recommendation systems.

...read moreread less

Abstract: Software developers use Application Programming Interfaces (APIs) of libraries and frameworks extensively while writing programs. In this context, the recommendations provided in code completion pop-ups help developers choose the desired methods. The candidate lists recommended by these tools, however, tend to be large, ordered alphabetically and sometimes even incomplete. A fair amount of work has been done recently to improve the relevance of these code completion results, especially for statically typed languages like Java. However, these proposed techniques rely on the static type of the object and are therefore inapplicable for a dynamically typed language like Python. In this paper, we present PyReco, an intelligent code completion system for Python which uses the mined API usages from open source repositories to order the results based on relevance rather than the conventional alphabetic order. To recommend suggestions that are relevant for a working context, a nearest neighbor classifier is used to identify the best matching usage among all the extracted usage patterns. To evaluate the effectiveness of our system, the code completion queries are automatically extracted from projects and tested quantitatively using a ten-fold cross validation technique. The evaluation shows that our approach outperforms the alphabetically ordered API recommendation systems in recommending APIs for standard, as well as, third-party libraries.

...read moreread less

21 citations

Topic Tools

Papers

Learning from examples to improve code completion systems

Code Completion with Neural Attention and Pointer Networks.

Intelligent Code Completion with Bayesian Networks

Code Completion by Modeling Flattened Abstract Syntax Trees as Graphs

Collective Intelligence for Smarter API Recommendations in Python

Related Topics (5)

Performance Metrics

No. of papers in the topic in previous years
Year	Papers
2021	4
2019	1
2017	1
2016	4
2015	2
2009	1