Top 188 papers published in the topic of Program analysis in 2013

Showing papers on "Program analysis published in 2013"

Journal Article•10.13189/UJCMJ.2013.010301•

Axiomatic Basis for Computer Programming

[...]

Lauretta O. Osho, Francisca Ogwueleka, Oluwafemi Osho¹•Institutions (1)

1 Oct 2013

TL;DR: Using the axiomatic basis the completeness of two variants of integer multiplication program is proved and results show that computer programs can actually be verified sufficiently for correctness without necessarily testing them, or more practically put, to complement their testing.

...read moreread less

Abstract: This paper considers a formal method, known as axiomatic semantics, used to prove the correctness of a computer program. This formal method extracts, using some proof rules, the mathematical verification conditions from a computer program. The axioms of program flow, including, sequential flow, iteration, and alternation flows are presented. Using the axiomatic basis the completeness of two variants of integer multiplication program is proved. Results show that computer programs can actually be verified sufficiently for correctness without necessarily testing them, or more practically put, to complement their testing.

...read moreread less

1,152 citations

Proceedings Article•10.1109/FMCAD.2013.6679385•

Syntax-guided synthesis

[...]

Rajeev Alur¹, Rastislav Bodik², Garvit Juniwal², Milo M. K. Martin¹, Mukund Raghothaman¹, Sanjit A. Seshia², Rishabh Singh³, Armando Solar-Lezama³, Emina Torlak², Abhishek Udupa¹ - Show less +6 more•Institutions (3)

University of Pennsylvania¹, University of California, Berkeley², Massachusetts Institute of Technology³

11 Dec 2013

TL;DR: This work describes three different instantiations of the counter-example-guided-inductive-synthesis (CEGIS) strategy for solving the synthesis problem, reports on prototype implementations, and presents experimental results on an initial set of benchmarks.

...read moreread less

Abstract: The classical formulation of the program-synthesis problem is to find a program that meets a correctness specification given as a logical formula. Recent work on program synthesis and program optimization illustrates many potential benefits of allowing the user to supplement the logical specification with a syntactic template that constrains the space of allowed implementations. Our goal is to identify the core computational problem common to these proposals in a logical framework. The input to the syntax-guided synthesis problem (SyGuS) consists of a background theory, a semantic correctness specification for the desired program given by a logical formula, and a syntactic set of candidate implementations given by a grammar. The computational problem then is to find an implementation from the set of candidate expressions so that it satisfies the specification in the given theory. We describe three different instantiations of the counter-example-guided-inductive-synthesis (CEGIS) strategy for solving the synthesis problem, report on prototype implementations, and present experimental results on an initial set of benchmarks.

...read moreread less

541 citations

Proceedings Article•10.1109/ASE.2013.6693094•

Leveraging program equivalence for adaptive program repair: models and first results

[...]

Westley Weimer¹, Zachary P. Fry¹, Stephanie Forrest²•Institutions (2)

University of Virginia¹, University of New Mexico²

11 Nov 2013

TL;DR: Generate-and-validate program repair is shown to be a dual of mutation testing, suggesting several possible cross-fertilizations and a novel deterministic repair algorithm that computes a patch quotient space with respect to an approximate semantic equivalence relation.

...read moreread less

Abstract: Software bugs remain a compelling problem. Automated program repair is a promising approach for reducing cost, and many methods have recently demonstrated positive results. However, success on any particular bug is variable, as is the cost to find a repair. This paper focuses on generate-and-validate repair methods that enumerate candidate repairs and use test cases to define correct behavior. We formalize repair cost in terms of test executions, which dominate most test-based repair algorithms. Insights from this model lead to a novel deterministic repair algorithm that computes a patch quotient space with respect to an approximate semantic equivalence relation. This allows syntactic and dataflow analysis techniques to dramatically reduce the repair search space. Generate-and-validate program repair is shown to be a dual of mutation testing, suggesting several possible cross-fertilizations. Evaluating on 105 real-world bugs in programs totaling 5MLOC and involving 10,000 tests, our new algorithm requires an order-of-magnitude fewer test evaluations than the previous state-of-the-art and is over three times more efficient monetarily.

...read moreread less

322 citations

Proceedings Article•10.5555/2486788.2486801•

Estimating mobile application energy consumption using program analysis

[...]

Shuai Hao¹, Ding Li¹, William G. J. Halfond¹, Ramesh Govindan¹•Institutions (1)

University of Southern California¹

18 May 2013

TL;DR: This paper proposes a new approach that is both lightweight in terms of its developer requirements and provides fine-grained estimates of energy consumption at the code level using a novel combination of program analysis and per-instruction energy modeling.

...read moreread less

Abstract: Optimizing the energy efficiency of mobile applications can greatly increase user satisfaction. However, developers lack viable techniques for estimating the energy consumption of their applications. This paper proposes a new approach that is both lightweight in terms of its developer requirements and provides fine-grained estimates of energy consumption at the code level. It achieves this using a novel combination of program analysis and per-instruction energy modeling. In evaluation, our approach is able to estimate energy consumption to within 10% of the ground truth for a set of mobile applications from the Google Play store. Additionally, it provides useful and meaningful feedback to developers that helps them to understand application energy consumption behavior.

...read moreread less

314 citations

Proceedings Article•10.1145/2503210.2503296•

Precimonious: tuning assistant for floating-point precision

[...]

Cindy Rubio-González¹, Cuong Nguyen¹, Hong Diep Nguyen¹, James Demmel¹, William Kahan¹, Koushik Sen¹, David H. Bailey², Costin Iancu², David Hough³ - Show less +5 more•Institutions (3)

University of California, Berkeley¹, Lawrence Berkeley National Laboratory², Oracle Corporation³

17 Nov 2013

TL;DR: Premonious is a dynamic program analysis tool to assist developers in tuning the precision of floating-point programs and recommends a type instantiation that uses lower precision while producing an accurate enough answer without causing exceptions.

...read moreread less

Abstract: Given the variety of numerical errors that can occur, floating-point programs are difficult to write, test and debug. One common practice employed by developers without an advanced background in numerical analysis is using the highest available precision. While more robust, this can degrade program performance significantly. In this paper we present Precimonious, a dynamic program analysis tool to assist developers in tuning the precision of floating-point programs. Precimonious performs a search on the types of the floating-point program variables trying to lower their precision subject to accuracy constraints and performance goals. Our tool recommends a type instantiation that uses lower precision while producing an accurate enough answer without causing exceptions. We evaluate Precimonious on several widely used functions from the GNU Scientific Library, two NAS Parallel Benchmarks, and three other numerical programs. For most of the programs analyzed, Precimonious reduces precision, which results in performance improvements as high as 41%.

...read moreread less

310 citations

Proceedings Article•

Dowsing for overflows: a guided fuzzer to find buffer boundary violations

[...]

Istvan Haller¹, Asia Slowinska¹, Matthias Neugschwandtner², Herbert Bos¹•Institutions (2)

VU University Amsterdam¹, Vienna University of Technology²

14 Aug 2013

TL;DR: Dowser is a 'guided' fuzzer that combines taint tracking, program analysis and symbolic execution to find buffer overflow and underflow vulnerabilities buried deep in a program's logic.

...read moreread less

Abstract: Dowser is a 'guided' fuzzer that combines taint tracking, program analysis and symbolic execution to find buffer overflow and underflow vulnerabilities buried deep in a program's logic. The key idea is that analysis of a program lets us pinpoint the right areas in the program code to probe and the appropriate inputs to do so. Intuitively, for typical buffer overflows, we need consider only the code that accesses an array in a loop, rather than all possible instructions in the program. After finding all such candidate sets of instructions, we rank them according to an estimation of how likely they are to contain interesting vulnerabilities. We then subject the most promising sets to further testing. Specifically, we first use taint analysis to determine which input bytes influence the array index and then execute the program symbolically, making only this set of inputs symbolic. By constantly steering the symbolic execution along branch outcomes most likely to lead to overflows, we were able to detect deep bugs in real programs (like the nginx webserver, the inspircd IRC server, and the ffmpeg videoplayer). Two of the bugs we found were previously undocumented buffer overflows in ffmpeg and the poppler PDF rendering library.

...read moreread less

245 citations

Journal Article•10.1007/S10009-012-0223-4•

Template-based program verification and program synthesis

[...]

Saurabh Srivastava¹, Sumit Gulwani², Jeffrey S. Foster³•Institutions (3)

University of California¹, Microsoft², University of Maryland, College Park³

01 Oct 2013-International Journal on Software Tools for Technology Transfer

TL;DR: This work shows how to reduce the template-based search problem to satisfiability solving, which permits the use of off-the-shelf solvers to efficiently explore the search space.

...read moreread less

Abstract: Program verification is the task of automatically generating proofs for a program’s compliance with a given specification. Program synthesis is the task of automatically generating a program that meets a given specification. Both program verification and program synthesis can be viewed as search problems, for proofs and programs, respectively. For these search problems, we present approaches based on user-provided insights in the form of templates. Templates are hints about the syntactic forms of the invariants and programs, and help guide the search for solutions. We show how to reduce the template-based search problem to satisfiability solving, which permits the use of off-the-shelf solvers to efficiently explore the search space. Template-based approaches have allowed us to verify and synthesize programs outside the abilities of previous verifiers and synthesizers. Our approach can verify and synthesize difficult algorithmic textbook programs (e.g., sorting and dynamic programming-based algorithms) and difficult arithmetic programs.

...read moreread less

114 citations

Proceedings Article•10.1145/2451116.2451131•

Using likely invariants for automated software fault localization

[...]

Swarup Kumar Sahoo¹, John Criswell¹, Chase Geigle¹, Vikram Adve¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

16 Mar 2013

TL;DR: An automatic diagnosis technique for isolating the root cause(s) of software failures by using likely program invariants, automatically generated using correct inputs that are close to the fault-triggering input, to select a set of candidate program locations which are possible root causes.

...read moreread less

Abstract: We propose an automatic diagnosis technique for isolating the root cause(s) of software failures. We use likely program invariants, automatically generated using correct inputs that are close to the fault-triggering input, to select a set of candidate program locations which are possible root causes. We then trim the set of candidate root causes using software-implemented dynamic backwards slicing, plus two new filtering heuristics: dependence filtering, and filtering via multiple failing inputs that are also close to the failing input. Experimental results on reported software bugs of three large open-source servers show that we are able to narrow down the number of candidate bug locations to between 5 and 17 program expressions, even in programs that are hundreds of thousands of lines long.

...read moreread less

112 citations

Proceedings Article•10.5555/2486788.2486873•

Mining SQL injection and cross site scripting vulnerabilities using hybrid program analysis

[...]

Lwin Khin Shar¹, Hee Beng Kuan Tan¹, Lionel C. Briand²•Institutions (2)

Nanyang Technological University¹, University of Luxembourg²

18 May 2013

TL;DR: This paper presents prediction models that are based on both classification and clustering in order to predict vulnerabilities, working in the presence or absence of labeled training data, respectively.

...read moreread less

Abstract: In previous work, we proposed a set of static attributes that characterize input validation and input sanitization code patterns. We showed that some of the proposed static attributes are significant predictors of SQL injection and cross site scripting vulnerabilities. Static attributes have the advantage of reflecting general properties of a program. Yet, dynamic attributes collected from execution traces may reflect more specific code characteristics that are complementary to static attributes. Hence, to improve our initial work, in this paper, we propose the use of dynamic attributes to complement static attributes in vulnerability prediction. Furthermore, since existing work relies on supervised learning, it is dependent on the availability of training data labeled with known vulnerabilities. This paper presents prediction models that are based on both classification and clustering in order to predict vulnerabilities, working in the presence or absence of labeled training data, respectively. In our experiments across six applications, our new supervised vulnerability predictors based on hybrid (static and dynamic) attributes achieved, on average, 90% recall and 85% precision, that is a sharp increase in recall when compared to static analysis-based predictions. Though not nearly as accurate, our unsupervised predictors based on clustering achieved, on average, 76% recall and 39% precision, thus suggesting they can be useful in the absence of labeled training data.

...read moreread less

96 citations

Proceedings Article•10.1145/2483760.2483788•

Practical blended taint analysis for JavaScript

[...]

Shiyi Wei¹, Barbara G. Ryder¹•Institutions (1)

Virginia Tech¹

15 Jul 2013

TL;DR: Blended taint analysis is presented, an instantiation of the general-purpose analysis framework for JavaScript, to illustrate how a combined dynamic/static analysis approach can deal with dynamic features by collecting generated code and other information at runtime.

...read moreread less

Abstract: JavaScript is widely used in Web applications because of its flexibility and dynamic features. However, the latter pose challenges to static analyses aimed at finding security vulnerabilities, (e.g., taint analysis). We present blended taint analysis, an instantiation of our general-purpose analysis framework for JavaScript, to illustrate how a combined dynamic/static analysis approach can deal with dynamic features by collecting generated code and other information at runtime. In empirical comparisons with two pure static taint analyses, we show blended taint analysis to be both more scalable and precise on JavaScript benchmark codes extracted from 12 popular websites at alexa. Our results show that blended taint analysis discovered 13 unique violations in 6 of the websites. In contrast, each of the static analyses identified less than half of these violations. Moreover, given a reasonable time budget of 10 minutes, both static analyses encountered webpages they could not analyze, sometimes significantly many such pages. Case studies demonstrate the quality of the blended taint analysis solution in comparison to that of pure static analysis.

...read moreread less

87 citations

Book Chapter•10.1007/978-3-642-39799-8_3•

Multi-solver support in symbolic execution

[...]

Hristina Palikareva¹, Cristian Cadar¹•Institutions (1)

Imperial College London¹

13 Jul 2013

TL;DR: It is argued that symbolic execution tools can, and should, make use of multiple constraint solvers, and the importance of constraint caching and counterexample values on the (relative) performance of KLEE configured to use different SMT solvers is shown.

...read moreread less

Abstract: One of the main challenges of dynamic symbolic execution--an automated program analysis technique which has been successfully employed to test a variety of software--is constraint solving. A key decision in the design of a symbolic execution tool is the choice of a constraint solver. While different solvers have different strengths, for most queries, it is not possible to tell in advance which solver will perform better. In this paper, we argue that symbolic execution tools can, and should, make use of multiple constraint solvers. These solvers can be run competitively in parallel, with the symbolic execution engine using the result from the best-performing solver. We present empirical data obtained by running the symbolic execution engine KLEE on a set of real programs, and use it to highlight several important characteristics of the constraint solving queries generated during symbolic execution. In particular, we show the importance of constraint caching and counterexample values on the (relative) performance of KLEE configured to use different SMT solvers. We have implemented multi-solver support in KLEE, using the metaSMT framework, and explored how different state-of-the-art solvers compare on a large set of constraint-solving queries. We also report on our ongoing experience building a parallel portfolio solver in KLEE.

...read moreread less

Proceedings Article•10.1145/2485922.2485966•

Criticality stacks: identifying critical threads in parallel programs using synchronization behavior

[...]

Kristof Du Bois¹, Stijn Eyerman¹, Jennifer B. Sartor¹, Lieven Eeckhout¹•Institutions (1)

Ghent University¹

23 Jun 2013

TL;DR: This paper proposes a new metric for assessing thread criticality, which combines both how much time a thread is performing useful work and how many co-running threads are waiting and uses it to create criticality stacks that break total execution time into each thread's criticality component, allowing for easy visual analysis of parallel imbalance.

...read moreread less

Abstract: Analyzing multi-threaded programs is quite challenging, but is necessary to obtain good multicore performance while saving energy. Due to synchronization, certain threads make others wait, because they hold a lock or have yet to reach a barrier. We call these critical threads, i.e., threads whose performance is determinative of program performance as a whole. Identifying these threads can reveal numerous optimization opportunities, for the software developer and for hardware.In this paper, we propose a new metric for assessing thread criticality, which combines both how much time a thread is performing useful work and how many co-running threads are waiting. We show how thread criticality can be calculated online with modest hardware additions and with low overhead. We use our metric to create criticality stacks that break total execution time into each thread's criticality component, allowing for easy visual analysis of parallel imbalance.To validate our criticality metric, and demonstrate it is better than previous metrics, we scale the frequency of the most critical thread and show it achieves the largest performance improvement. We then demonstrate the broad applicability of criticality stacks by using them to perform three types of optimizations: (1) program analysis to remove parallel bottlenecks, (2) dynamically identifying the most critical thread and accelerating it using frequency scaling to improve performance, and (3) showing that accelerating only the most critical thread allows for targeted energy reduction.

...read moreread less

Book Chapter•10.1007/978-3-642-40885-4_24•

Term Rewriting with Logical Constraints

[...]

Cynthia Kop¹, Naoki Nishida²•Institutions (2)

University of Innsbruck¹, Nagoya University²

18 Sep 2013

TL;DR: This paper seeks to unify and generalise several definitions which combine rewriting with logical constraints, or with separate rules for integer functions, that have been proposed in recent works on program analysis.

...read moreread less

Abstract: In recent works on program analysis, transformations of various programming languages to term rewriting are used. In this setting, constraints appear naturally. Several definitions which combine rewriting with logical constraints, or with separate rules for integer functions, have been proposed. This paper seeks to unify and generalise these proposals.

...read moreread less

Book Chapter•10.1007/978-3-319-03542-0_7•

Separation Logic Modulo Theories

[...]

Juan Antonio Navarro Pérez¹, Andrey Rybalchenko²•Institutions (2)

University College London¹, Technische Universität München²

9 Dec 2013

TL;DR: In this article, the authors present an approach that integrates separation logic and existing SMT solving technology for reasoning about linked data structures on the heap and provide an efficient decision procedure for discharging verification conditions in program analysis and verification.

...read moreread less

Abstract: Logical reasoning about program behaviours often requires dealing with heap structures as well as scalar data types. Advances in Satisfiability Modulo Theories (SMT) offer efficient procedures for dealing with scalar values, yet they lack expressive support for dealing with heap structures. In this paper, we present an approach that integrates separation logica prominent logic for reasoning about linked data structures on the heapand existing SMT solving technology. Our model-based approach communicates heap aliasing information between theory and separation logic reasoning, providing an efficient decision procedure for discharging verification conditions in program analysis and verification.

...read moreread less

Patent•

Method and system for automated improvement of parallelism in program compilation

[...]

Loring G. Craymer

27 Jun 2013

TL;DR: In this paper, a method of program compilation to improve parallelism during the linking of the program by a compiler is presented. But this method requires the compiler to convert the program to canonical form and construct a traversable representation, such as an Abstract Syntax Tree (AST), for each procedure in the program.

...read moreread less

Abstract: A method of program compilation to improve parallelism during the linking of the program by a compiler. The method includes converting statements of the program to canonical form, constructing a traversable representation, such as an abstract syntax tree (AST), for each procedure in the program, and traversing the program to construct a graph by making each non-control flow statement and each control structure into at least one node of the graph.

...read moreread less

Coupling Profile and Historical Methods to Predict Execution Time of Parallel Applications

[...]

Thierry Monteil

1 Jul 2013

TL;DR: A hybrid method of time prediction that is both profile-based and historic-based is defined, which is achieved by combining a program structure analysis with an instance-based learning method.

...read moreread less

Abstract: This article describes some work in the domain of application execution time prediction, which is always necessary for schedulers. We define a hybrid method of time prediction that is both profile-based and historic-based. This prediction is achieved by combining a program structure analysis with an instance-based learning method. We demonstrate that taking account of an application's profile improves predictions compared with classical historic-based prediction methods.

...read moreread less

Proceedings Article•10.1145/2516760.2516769•

Sound and precise malware analysis for android via pushdown reachability and entry-point saturation

[...]

Shuying Liang¹, Andrew W. Keep¹, Matthew Might¹, Steven Lyde¹, Thomas Gilray¹, Petey Aldous¹, David Van Horn² - Show less +3 more•Institutions (2)

University of Utah¹, Northeastern University²

8 Nov 2013

TL;DR: Anadroid as discussed by the authors is a static malware analysis framework for Android apps that uses a pushdown system to precisely model dynamically dispatched interprocedural and exception-driven control-flow.

...read moreread less

Abstract: Sound malware analysis of Android applications is challenging. First, object-oriented programs exhibit highly interprocedural, dynamically dispatched control structure. Second, the Android programming paradigm relies heavily on the asynchronous execution of multiple entry points. Existing analysis techniques focus more on the second challenge, while relying on traditional analytic techniques that suffer from inherent imprecision or unsoundness to solve the first.We present Anadroid, a static malware analysis framework for Android apps. Anadroid exploits two techniques to soundly raise precision: (1) it uses a pushdown system to precisely model dynamically dispatched interprocedural and exception-driven control-flow; (2) it uses Entry-Point Saturation (EPS) to soundly approximate all possible interleavings of asynchronous entry points in Android applications. (It also integrates static taint-flow analysis and least permissions analysis to expand the class of malicious behaviors which it can catch.) Anadroid provides rich user interface support for human analysts which must ultimately rule on the "maliciousness" of a behavior.To demonstrate the effectiveness of Anadroid's malware analysis, we had teams of analysts analyze a challenge suite of 52 Android applications released as part of the Automated Program Analysis for Cybersecurity (APAC) DARPA program. The first team analyzed the apps using a version of Anadroid that uses traditional (finite-state-machine-based) control-flow-analysis found in existing malware analysis tools; the second team analyzed the apps using a version of Anadroid that uses our enhanced pushdown-based control-flow-analysis. We measured machine analysis time, human analyst time, and their accuracy in flagging malicious applications. With pushdown analysis, we found statistically significant (p

...read moreread less

Proceedings Article•10.5555/2555754.2555767•

On composing and proving the correctness of reactive behavior

[...]

David Harel¹, Amir Kantor¹, Guy Katz¹, Assaf Marron¹, Lior Mizrahi², Gera Weiss² - Show less +2 more•Institutions (2)

Weizmann Institute of Science¹, Ben-Gurion University of the Negev²

29 Sep 2013

TL;DR: It is shown how program verification can be automated and streamlined by combining properties of individual modules, specified and verified separately, with application-independent specifications both of the BP semantics and of general theories.

...read moreread less

Abstract: We present a method and a tool for composing a reactive system and for accompanying the development and documentation process with a proof of its correctness. The approach is based on behavioral programming (BP) and the Z3 SMT solver. We show how program verification can be automated and streamlined by combining properties of individual modules, specified and verified separately, with application-independent specifications both of the BP semantics and of general theories. The method may yield an exponential acceleration of the verification process when compared with model-checking the composite application. We show that formalization of properties of independent modules in preparation for the correctness proofs can be useful as documentation for future development. We view this work as a further step towards making formal correctness proofs standard practice in the development of reactive systems, and carried out by programmers at large.

...read moreread less

Book Chapter•10.1007/978-3-642-36742-7_45•

CPACHECKER with sequential combination of explicit-state analysis and predicate analysis

[...]

Philipp Wendler¹•Institutions (1)

University of Passau¹

16 Mar 2013

TL;DR: A CPAchecker configuration that uses a sequential combination of two approaches that starts with an explicit-state analysis, and, if no answer can be found within some time, switches to a predicate analysis with adjustable-block encoding and CEGAR.

...read moreread less

Abstract: CPAchecker is an open-source framework for software verification, based on the concepts of Configurable Program Analysis (CPA). We submit a CPAchecker configuration that uses a sequential combination of two approaches. It starts with an explicit-state analysis, and, if no answer can be found within some time, switches to a predicate analysis with adjustable-block encoding and CEGAR.

...read moreread less

Journal Article•10.1007/S10619-013-7130-X•

Automatic optimization of stream programs via source program operator graph transformations

[...]

Miyuru Dayarathna¹, Toyotaro Suzumura¹•Institutions (1)

Tokyo Institute of Technology¹

01 Dec 2013-Distributed and Parallel Databases

TL;DR: An automated process for stream program performance optimization that uses semantic preserving automatic code transformation to improve stream processing job performance and was able to identify a 31.1 times higher performance version of the CDR application within seven minutes time on a cluster of 4 nodes.

...read moreread less

Abstract: Distributed data stream processing is a data analysis paradigm where massive amounts of data produced by various sources are analyzed online within real-time constraints. Execution performance of a stream program/query executed on such middleware is largely dependent on the ability of the programmer to fine tune the program to match the topology of the stream processing system. However, manual fine tuning of a stream program is a very difficult, error prone process that demands huge amounts of programmer time and expertise which are expensive to obtain. We describe an automated process for stream program performance optimization that uses semantic preserving automatic code transformation to improve stream processing job performance. We first identify the structure of the input program and represent the program structure in a Directed Acyclic Graph. We transform the graph using the concepts of Tri-OP Transformation and Bi-Op Transformation. The resulting sample program space is pruned using both empirical as well as profiling information to obtain a ranked list of sample programs which have higher performance compared to their parent program. We successfully implemented this methodology on a prototype stream program performance optimization mechanism called Hirundo. The mechanism has been developed for optimizing SPADE programs which run on System S stream processing run-time. Using five real world applications (called VWAP, CDR, Twitter, Apnoea, and Bargain) we show the effectiveness of our approach. Hirundo was able to identify a 31.1 times higher performance version of the CDR application within seven minutes time on a cluster of 4 nodes.

...read moreread less

Book Chapter•10.1007/978-3-642-38856-9_3•

Abstract Interpretation over Non-lattice Abstract Domains

[...]

Graeme Gange¹, Jorge A. Navas¹, Peter Schachte¹, Harald Søndergaard¹, Peter J. Stuckey¹ - Show less +1 more•Institutions (1)

University of Melbourne¹

20 Jun 2013

TL;DR: The lack of domain structure has negative consequences, both for the precision of program analysis and for the termination of standard Kleene iteration.

...read moreread less

Abstract: The classical theoretical framework for static analysis of programs is abstract interpretation. Much of the power and elegance of that framework rests on the assumption that an abstract domain is a lattice. Nonetheless, and for good reason, the literature on program analysis provides many examples of non-lattice domains, including non-convex numeric domains. The lack of domain structure, however, has negative consequences, both for the precision of program analysis and for the termination of standard Kleene iteration. In this paper we explore these consequences and present general remedies.

...read moreread less

Proceedings Article•10.1109/HIPC.2013.6799106•

MIL: A language to build program analysis tools through static binary instrumentation

[...]

Andres S. Charif-Rubial, Denis Barthou¹, Cédric Valensi, Sameer Shende², Allen D. Malony², William Jalby - Show less +2 more•Institutions (2)

University of Bordeaux¹, University of Oregon²

18 Dec 2013

TL;DR: This paper proposes a language, MIL, for the development of program analysis tools based on static binary instrumentation to ease the integration of static, global program analysis with instrumentation and shows how this enables both a precise targeting of the code regions to analyze and a better understanding of the optimized program behavior.

...read moreread less

Abstract: As software complexity increases, the analysis of code behavior during its execution is becoming more important. Instrumentation techniques, through the insertion of code directly into binaries, are essential for program analyses used in debugging, runtime profiling, and performance evaluation. In the context of high-performance parallel applications, building an instrumentation framework is quite challenging. One of the difficulties is due to the necessity to capture both coarse-grain behavior, such as the execution time of different functions, as well as finer-grain actions, in order to pinpoint performance issues. In this paper, we propose a language, MIL, for the development of program analysis tools based on static binary instrumentation. The key feature of MIL is to ease the integration of static, global program analysis with instrumentation. We will show how this enables both a precise targeting of the code regions to analyze and a better understanding of the optimized program behavior.

...read moreread less

Proceedings Article•10.1145/2505879.2505884•

Eventual Linear Ranking Functions

[...]

Roberto Bagnara¹, Fred Mesnard²•Institutions (2)

University of Parma¹, University of La Réunion²

08 Jun 2013-arXiv: Programming Languages

TL;DR: In this article, an algorithm based on Farkas' Lemma for discovering linear ranking functions proving termination of a certain class of loops is presented. But it is not shown that the algorithm can be extended to prove the existence of an eventual linear ranking function after a finite unrolling of the loop.

...read moreread less

Abstract: Program termination is a hot research topic in program analysis. The last few years have witnessed the development of termination analyzers for programming languages such as C and Java with remarkable precision and performance. These systems are largely based on techniques and tools coming from the field of declarative constraint programming. In this paper, we first recall an algorithm based on Farkas' Lemma for discovering linear ranking functions proving termination of a certain class of loops. Then we propose an extension of this method for showing the existence of eventual linear ranking functions, i.e., linear functions that become ranking functions after a finite unrolling of the loop. We show correctness and completeness of this algorithm.

...read moreread less

Book•

Systematic Program Design: From Clarity to Efficiency

[...]

Yanhong A. Liu¹•Institutions (1)

State University of New York System¹

1 May 2013

TL;DR: A method that starts with a clear specification of a computation and derives an efficient implementation by step-wise program analysis and transformations is described, which applies to problems specified in imperative, database, functional, logic, and object-oriented programming languages with different data, control, and module abstractions.

...read moreread less

Abstract: A systematic program design method can help developers ensure the correctness and performance of programs while minimizing the development cost. This book describes a method that starts with a clear specification of a computation and derives an efficient implementation by step-wise program analysis and transformations. The method applies to problems specified in imperative, database, functional, logic, and object-oriented programming languages with different data, control, and module abstractions. Designed for courses or self-study, this book includes numerous exercises and examples that require minimal computer science background, making it accessible to novices. Experienced practitioners and researchers will appreciate the detailed examples in a wide range of application areas including hardware design, image processing, access control, query optimization, and program analysis. The last section of the book points out directions for future studies.

...read moreread less

Proceedings Article•10.5555/2555754.2555779•

DIME: time-aware dynamic binary instrumentation using rate-based resource allocation

[...]

Pansy Arafa¹, Hany Kashif¹, Sebastian Fischmeister¹•Institutions (1)

University of Waterloo¹

29 Sep 2013

TL;DR: DIME, a time-aware dynamic binary instrumentation technique that adds an adjustable bound on the timing overhead to the program under analysis, is proposed and implemented using the dynamic instrumentation framework, Pin.

...read moreread less

Abstract: Program analysis tools are essential for understanding programs, analyzing performance, and optimizing code. Some of these tools use code instrumentation to extract information at runtime. The instrumentation process can alter program behavior such as timing behavior and memory consumption. Time-sensitive programs, however, must meet specific timing constraints and thus require that the instrumentation process, for instance, bounds the timing overhead. Time-aware instrumentation techniques try to honor the timing constraints of such programs. All previous techniques, however, support only static source-code instrumentation methods. Hence, they become impractical beyond microcontroller code for instrumenting large programs along with all their library dependencies. In this work, we propose DIME, a time-aware dynamic binary instrumentation technique that adds an adjustable bound on the timing overhead to the program under analysis. We implement DIME using the dynamic instrumentation framework, Pin. Quantitative evaluation of the three implementation alternatives shows an average reduction of the instrumentation overhead by 12, 7, and 3 folds compared to native Pin. Instrumenting the VLC media player and a laser beam stabilization experiment demonstrate the practicality and scalability of DIME.

...read moreread less

Proceedings Article•10.1109/ICSESS.2013.6615286•

Security assurance with program path analysis and metamorphic testing

[...]

Guowei Dong, Tao Guo, Puhan Zhang

23 May 2013

TL;DR: The Path-Combination-Based MT method is provided, which mines the relationships among inputs that could execute different paths and their corresponding outputs based on the analysis of program structure, and then tests the program with these relationships.

...read moreread less

Abstract: The correctness of mission-critical software is an important part of information security and oracle problem[1] is often a great constraint for their testing. Metamorphic testing(MT) is practical for oracle problem, but calls for more executions and only focuses on program's mathematics properties in most situations. This article provides the Path-Combination-Based MT method, which mines the relationships among inputs that could execute different paths and their corresponding outputs based on the analysis of program structure, and then tests the program with these relationships. The experimental results prove its efficiency.

...read moreread less

Proceedings Article•10.1145/2509578.2509591•

Conversational programming: exploring interactive program analysis

[...]

Alexander Repenning¹•Institutions (1)

University of Colorado Boulder¹

29 Oct 2013

TL;DR: This paper describes conversational programming through design principles and use cases as a way to harness the computing power to inspect program meaning through a combination of partial program execution and semantic program annotation.

...read moreread less

Abstract: Our powerful computers help very little in debugging the program we have so we can change it into the program we want. We introduce Conversational Programming as a way to harness our computing power to inspect program meaning through a combination of partial program execution and semantic program annotation. A programmer in our approach interactively selects highly autonomous "agents" in a program world as conversation topics and then changes the world to explore the potential behaviors of a selected agent in different scenarios. In this way, the programmer proactively knows how their code affects program execution as they explore various contexts. This paper describes conversational programming through design principles and use cases.

...read moreread less

Book Chapter•10.1007/978-3-642-39634-2_16•

Program analysis and verification based on kleene algebra in Isabelle/HOL

[...]

Alasdair Armstrong¹, Georg Struth¹, Tjark Weber²•Institutions (2)

University of Sheffield¹, Uppsala University²

22 Jul 2013

TL;DR: This work formalises SKAT in Isabelle/HOL, using the quotient type package to reason equationally in this algebra and extends SKAT with assertion statements and derive the inference rules of Hoare logic.

...read moreread less

Abstract: Schematic Kleene algebra with tests (SKAT) supports the equational verification of flowchart scheme equivalence and captures simple while-programs with assignment statements. We formalise SKAT in Isabelle/HOL, using the quotient type package to reason equationally in this algebra. We apply this formalisation to a complex flowchart transformation proof from the literature. We extend SKAT with assertion statements and derive the inference rules of Hoare logic. We apply this extension in simple program verification examples and the derivation of additional Hoare-style rules. This shows that algebra can provide an abstract semantic layer from which different program analysis and verification tasks can be implemented in a simple lightweight way.

...read moreread less

Proceedings Article•10.1109/QSIC.2013.50•

Static Slicing for Python First-Class Objects

[...]

Zhaogui Xu¹, Ju Qian¹, Lin Chen¹, Zhifei Chen¹, Baowen Xu¹ - Show less +1 more•Institutions (1)

Nanjing University¹

29 Jul 2013

TL;DR: This paper proposes a static slicing method for Python first-class objects by adding all the definitions of first- class objects into the dependence model and uniformly constructing the program dependence graphs for all the functions, classes, methods, and modules.

...read moreread less

Abstract: Program slicing is an important program analysis technique and now has been used in many fields of software engineering. However, most existing program slicing methods focus on static programming languages such as C/C++ and Java, and methods on dynamic languages like Python are rarely seen. Python, a typical dynamic object-oriented language, has been more and more widely used now. In Python, everything is a first-class object, including functions, classes, methods, and modules. Existing slicing methods cannot handle the issue of these first-class objects. Therefore, this paper proposes a static slicing method for Python first-class objects. By adding all the definitions of first-class objects into the dependence model and uniformly constructing the program dependence graphs for all the functions, classes, methods, and modules, this method can effectively solve the slicing problems caused by arbitrary definitions and uses of first-class objects in Python.

...read moreread less

Book Chapter•10.1007/978-3-642-39742-4_13•

Predicting Regression Test Failures Using Genetic Algorithm-Selected Dynamic Performance Analysis Metrics

[...]

Michael Mayo¹, Simon Spacey¹•Institutions (1)

University of Waikato¹

24 Aug 2013

TL;DR: It is shown that if a program is modified and some tests subsequently fail, then it is possible to predict with considerable accuracy which of the remaining tests will also fail which can be used to help prioritise tests in time constrained testing environments.

...read moreread less

Abstract: A novel framework for predicting regression test failures is proposed. The basic principle embodied in the framework is to use performance analysis tools to capture the runtime behaviour of a program as it executes each test in a regression suite. The performance information is then used to build a dynamically predictive model of test outcomes. Our framework is evaluated using a genetic algorithm for dynamic metric selection in combination with state-of-the-art machine learning classifiers. We show that if a program is modified and some tests subsequently fail, then it is possible to predict with considerable accuracy which of the remaining tests will also fail which can be used to help prioritise tests in time constrained testing environments.

...read moreread less

...

Expand