Bridging (programming)

Topic Tools

Papers published on a yearly basis

Papers

Posted Content•

Julia: A Fresh Approach to Numerical Computing

[...]

Jeff Bezanson, Alan Edelman, Stefan Karpinski, Viral B. Shah

06 Nov 2014-arXiv: Mathematical Software

TL;DR: The Julia programming language as discussed by the authors combines expertise from the diverse fields of computer science and computational science to create a new approach to numerical computing, which is designed to be easy and fast.

...read moreread less

Abstract: Bridging cultures that have often been distant, Julia combines expertise from the diverse fields of computer science and computational science to create a new approach to numerical computing. Julia is designed to be easy and fast. Julia questions notions generally held as "laws of nature" by practitioners of numerical computing: 1. High-level dynamic programs have to be slow. 2. One must prototype in one language and then rewrite in another language for speed or deployment, and 3. There are parts of a system for the programmer, and other parts best left untouched as they are built by the experts. We introduce the Julia programming language and its design --- a dance between specialization and abstraction. Specialization allows for custom treatment. Multiple dispatch, a technique from computer science, picks the right algorithm for the right circumstance. Abstraction, what good computation is really about, recognizes what remains the same after differences are stripped away. Abstractions in mathematics are captured as code through another technique from computer science, generic programming. Julia shows that one can have machine performance without sacrificing human convenience.

...read moreread less

3,300 citations

Posted Content•

CodeSearchNet Challenge: Evaluating the State of Semantic Code Search.

[...]

Hamel Husain, Ho-Hsiang Wu, Tiferet Ahavah Gazit, Miltiadis Allamanis¹, Marc Brockschmidt¹ - Show less +1 more•Institutions (1)

Microsoft¹

20 Sep 2019-arXiv: Learning

TL;DR: The methodology used to obtain the corpus and expert labels, as well as a number of simple baseline solutions for the task are described.

...read moreread less

Abstract: Semantic code search is the task of retrieving relevant code given a natural language query. While related to other information retrieval tasks, it requires bridging the gap between the language used in code (often abbreviated and highly technical) and natural language more suitable to describe vague concepts and ideas. To enable evaluation of progress on code search, we are releasing the CodeSearchNet Corpus and are presenting the CodeSearchNet Challenge, which consists of 99 natural language queries with about 4k expert relevance annotations of likely results from CodeSearchNet Corpus. The corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping and preprocessing associated function documentation. In this article, we describe the methodology used to obtain the corpus and expert labels, as well as a number of simple baseline solutions for the task. We hope that CodeSearchNet Challenge encourages researchers and practitioners to study this interesting task further and will host a competition and leaderboard to track the progress on the challenge. We are also keen on extending CodeSearchNet Challenge to more queries and programming languages in the future.

...read moreread less

799 citations

Proceedings Article•10.1145/2884781.2884862•

From word embeddings to document similarities for improved information retrieval in software engineering

[...]

Xin Ye¹, Hui Shen¹, Xiao Ma¹, Razvan Bunescu¹, Chang Liu¹ - Show less +1 more•Institutions (1)

Ohio University¹

14 May 2016

TL;DR: This paper proposes bridging the lexical gap by projecting natural language statements and code snippets as meaning vectors in a shared representation space and shows that the learned vector space embeddings lead to improvements in a previously explored bug localization task and a newly introduced task of linking API documents to computer programming questions.

...read moreread less

Abstract: The application of information retrieval techniques to search tasks in software engineering is made difficult by the lexical gap between search queries, usually expressed in natural language (eg English), and retrieved documents, usually expressed in code (eg programming languages) This is often the case in bug and feature location, community question answering, or more generally the communication between technical personnel and non-technical stake holders in a software project In this paper, we propose bridging the lexical gap by projecting natural language statements and code snippets as meaning vectors in a shared representation space In the proposed architecture, word embeddings are first trained on API documents, tutorials, and reference documents, and then aggregated in order to estimate semantic similarities between documents Empirical evaluations show that the learned vector space embeddings lead to improvements in a previously explored bug localization task and a newly defined task of linking API documents to computer programming questions

...read moreread less

345 citations

Proceedings Article•10.1145/3238147.3238191•

API method recommendation without worrying about the task-API knowledge gap

[...]

Qiao Huang¹, Xin Xia², Zhenchang Xing³, David Lo⁴, Xinyu Wang¹ - Show less +1 more•Institutions (4)

Zhejiang University¹, Monash University², Australian National University³, Singapore Management University⁴

3 Sep 2018

TL;DR: An API recommendation approach named BIKER (Bi-Information source based KnowledgE Recommendation) to tackle the lexical gap and knowledge gap between the natural language description of the programming task and the API description in API documentation is proposed.

...read moreread less

Abstract: Developers often need to search for appropriate APIs for their programming tasks. Although most libraries have API reference documentation, it is not easy to find appropriate APIs due to the lexical gap and knowledge gap between the natural language description of the programming task and the API description in API documentation. Here, the lexical gap refers to the fact that the same semantic meaning can be expressed by different words, and the knowledge gap refers to the fact that API documentation mainly describes API functionality and structure but lacks other types of information like concepts and purposes, which are usually the key information in the task description. In this paper, we propose an API recommendation approach named BIKER (Bi-Information source based KnowledgE Recommendation) to tackle these two gaps. To bridge the lexical gap, BIKER uses word embedding technique to calculate the similarity score between two text descriptions. Inspired by our survey findings that developers incorporate Stack Overflow posts and API documentation for bridging the knowledge gap, BIKER leverages Stack Overflow posts to extract candidate APIs for a program task, and ranks candidate APIs by considering the query’s similarity with both Stack Overflow posts and API documentation. It also summarizes supplementary information (e.g., API description, code examples in Stack Overflow posts) for each API to help developers select the APIs that are most relevant to their tasks. Our evaluation with 413 API-related questions confirms the effectiveness of BIKER for both class- and method-level API recommendation, compared with state-of-the-art baselines. Our user study with 28 Java developers further demonstrates the practicality of BIKER for API search.

...read moreread less

227 citations

Journal Issue•10.1002/ASI.V58:3•

Process-aware information systems: Bridging people and software through process technology: Book Reviews

[...]

Hongyan Ma¹•Institutions (1)

University of California, Los Angeles¹

01 Feb 2007-Journal of the Association for Information Science and Technology

TL;DR: An analogy is established between the syntagm and paradigm from Saussurean linguistics and the message and messages for selection from the information theory initiated by Claude Shannon, and its analytic value in understanding patterns of retrieval from full-text systems.

...read moreread less

Abstract: This article describes a unique educational project that was implemented in the undergraduate study of computer science in 2002. Nesna University College has been using the example of sexual abuse of children in case study teaching in social informatics, in order to create an environment for intrinsically motivated learning. The project also gave the students a unique opportunity to get involved both emotionally and practically in the field of social informatics. The project is run in cooperation with Save the Children Norway and the Norwegian National Crime Squad. Nesna University College has the only computer science program in the world that has sexual abuse of children as the main topic on the curriculum. The computer science students provide both the Save the Children Norway and the National Criminal Investigation Service with reports on various topics such as secure chat, camera phones and possible abuse, Freenet as a tool for sexual abuse, etc. This exceptional cooperation between higher education and public and private organizations in this field makes the project not only unique, but might also be a major factor in both the willingness of students to learn social informatics and their development of skills in the various topics of social informatics. © 2007 Wiley Periodicals, Inc.

...read moreread less

220 citations

...

Expand

No. of papers in the topic in previous years
Year	Papers
2021	8
2020	2
2019	18
2018	7
2017	19
2016	19

Topic Tools

Papers published on a yearly basis

Papers

Julia: A Fresh Approach to Numerical Computing

CodeSearchNet Challenge: Evaluating the State of Semantic Code Search.

From word embeddings to document similarities for improved information retrieval in software engineering

API method recommendation without worrying about the task-API knowledge gap

Process-aware information systems: Bridging people and software through process technology: Book Reviews

Performance Metrics