Dynamic compilation

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Journal Article•10.1145/1064978.1065034•

Pin: building customized program analysis tools with dynamic instrumentation

[...]

Chi-Keung Luk¹, Robert Cohn¹, Robert Muth¹, Harish Patil¹, Artur Klauser¹, Geoff Lowney¹, Steven Wallace¹, Vijay Janapa Reddi², Kim Hazelwood¹ - Show less +5 more•Institutions (2)

Intel¹, University of Colorado Boulder²

12 Jun 2005

TL;DR: The goals are to provide easy-to-use, portable, transparent, and efficient instrumentation, and to illustrate Pin's versatility, two Pintools in daily use to analyze production software are described.

...read moreread less

Abstract: Robust and powerful software instrumentation tools are essential for program analysis tasks such as profiling, performance evaluation, and bug detection. To meet this need, we have developed a new instrumentation system called Pin. Our goals are to provide easy-to-use, portable, transparent, and efficient instrumentation. Instrumentation tools (called Pintools) are written in C/C++ using Pin's rich API. Pin follows the model of ATOM, allowing the tool writer to analyze an application at the instruction level without the need for detailed knowledge of the underlying instruction set. The API is designed to be architecture independent whenever possible, making Pintools source compatible across different architectures. However, a Pintool can access architecture-specific details when necessary. Instrumentation with Pin is mostly transparent as the application and Pintool observe the application's original, uninstrumented behavior. Pin uses dynamic compilation to instrument executables while they are running. For efficiency, Pin uses several techniques, including inlining, register re-allocation, liveness analysis, and instruction scheduling to optimize instrumentation. This fully automated approach delivers significantly better instrumentation performance than similar tools. For example, Pin is 3.3x faster than Valgrind and 2x faster than DynamoRIO for basic-block counting. To illustrate Pin's versatility, we describe two Pintools in daily use to analyze production software. Pin is publicly available for Linux platforms on four architectures: IA32 (32-bit x86), EM64T (64-bit x86), Itanium®, and ARM. In the ten months since Pin 2 was released in July 2004, there have been over 3000 downloads from its website.

...read moreread less

4,491 citations

Proceedings Article•10.1145/1167473.1167488•

The DaCapo benchmarks: java benchmarking development and analysis

[...]

Stephen M. Blackburn¹, Robin Garner¹, Chris Hoffmann², Asjad M. Khang², Kathryn S. McKinley³, Rotem Bentzur⁴, Amer Diwan⁵, Daniel Feinberg⁴, Daniel Frampton¹, Samuel Z. Guyer⁶, Martin Hirzel⁷, Antony L. Hosking⁸, Maria Jump³, Han Lee⁹, J. Eliot B. Moss², Aashish Phansalkar³, Darko Stefanovic⁴, Thomas VanDrunen¹⁰, Daniel von Dincklage⁵, Ben Wiedermann³ - Show less +16 more•Institutions (10)

Australian National University¹, University of Massachusetts Amherst², University of Texas at Austin³, University of New Mexico⁴, University of Colorado Boulder⁵, Tufts University⁶, IBM⁷, Purdue University⁸, Intel⁹, Wheaton College (Illinois)¹⁰

16 Oct 2006

TL;DR: This paper recommends benchmarking selection and evaluation methodologies, and introduces the DaCapo benchmarks, a set of open source, client-side Java benchmarks that improve over SPEC Java in a variety of ways, including more complex code, richer object behaviors, and more demanding memory system requirements.

...read moreread less

Abstract: Since benchmarks drive computer science research and industry product development, which ones we use and how we evaluate them are key questions for the community. Despite complex runtime tradeoffs due to dynamic compilation and garbage collection required for Java programs, many evaluations still use methodologies developed for C, C++, and Fortran. SPEC, the dominant purveyor of benchmarks, compounded this problem by institutionalizing these methodologies for their Java benchmark suite. This paper recommends benchmarking selection and evaluation methodologies, and introduces the DaCapo benchmarks, a set of open source, client-side Java benchmarks. We demonstrate that the complex interactions of (1) architecture, (2) compiler, (3) virtual machine, (4) memory management, and (5) application require more extensive evaluation than C, C++, and Fortran which stress (4) much less, and do not require (3). We use and introduce new value, time-series, and statistical metrics for static and dynamic properties such as code complexity, code size, heap composition, and pointer mutations. No benchmark suite is definitive, but these metrics show that DaCapo improves over SPEC Java in a variety of ways, including more complex code, richer object behaviors, and more demanding memory system requirements. This paper takes a step towards improving methodologies for choosing and evaluating benchmarks to foster innovation in system design and implementation for Java and other managed languages.

...read moreread less

1,686 citations

Proceedings Article•10.1145/800017.800542•

Efficient implementation of the smalltalk-80 system

[...]

L. Peter Deutsch¹, Allan M. Schiffman²•Institutions (2)

PARC¹, Fairchild Semiconductor International, Inc.²

15 Jan 1984

TL;DR: The most significant optimization techniques developed over the course of the Smalltalk-80 programming system are discussed, many of which are applicable to other languages.

...read moreread less

Abstract: The Smalltalk-80* programming language includes dynamic storage allocation, full upward funargs, and universally polymorphic procedures; the Smalltalk-80 programming system features interactive execution with incremental compilation, and implementation portability. These features of modern programming systems are among the most difficult to implement efficiently, even individually. A new implementation of the Smalltalk-80 system, hosted on a small microprocessor-based computer, achieves high performance while retaining complete (object code) compatibility with existing implementations. This paper discusses the most significant optimization techniques developed over the course of the project, many of which are applicable to other languages. The key idea is to represent certain runtime state (both code and data) in more than one form, and to convert between forms when needed.

...read moreread less

659 citations

Journal Article•10.1613/JAIR.989•

A knowledge compilation map

[...]

Adnan Darwiche¹, Pierre Marquis²•Institutions (2)

University of California, Los Angeles¹, Artois University²

01 Jul 2002-Journal of Artificial Intelligence Research

TL;DR: In this article, the authors propose a perspective on knowledge compilation which calls for analyzing different compilation approaches according to two key dimensions: the succinctness of the target compilation language, and the class of queries and transformations that the language supports in polytime.

...read moreread less

Abstract: We propose a perspective on knowledge compilation which calls for analyzing different compilation approaches according to two key dimensions: the succinctness of the target compilation language, and the class of queries and transformations that the language supports in polytime. We then provide a knowledge compilation map, which analyzes a large number of existing target compilation languages according to their succinctness and their polytime transformations and queries. We argue that such analysis is necessary for placing new compilation approaches within the context of existing ones. We also go beyond classical, flat target compilation languages based on CNF and DNF, and consider a richer, nested class based on directed acyclic graphs (such as OBDDs), which we show to include a relatively large number of target compilation languages.

...read moreread less

617 citations

Proceedings Article•10.1145/1669112.1669121•

Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping

[...]

Chi-Keung Luk¹, Sunpyo Hong², Hyesoon Kim²•Institutions (2)

Intel¹, Georgia Institute of Technology²

12 Dec 2009

TL;DR: Adaptive mapping is proposed, a fully automatic technique to map computations to processing elements on a CPU+GPU machine and it is shown that, by judiciously distributing works over the CPU and GPU, automatic adaptive mapping achieves a 25% reduction in execution time and a 20% reduced in energy consumption than static mappings on average for a set of important computation benchmarks.

...read moreread less

Abstract: Heterogeneous multiprocessors are increasingly important in the multi-core era due to their potential for high performance and energy efficiency. In order for software to fully realize this potential, the step that maps computations to processing elements must be as automated as possible. However, the state-of-the-art approach is to rely on the programmer to specify this mapping manually and statically. This approach is not only labor intensive but also not adaptable to changes in runtime environments like problem sizes and hardware/software configurations. In this study, we propose adaptive mapping, a fully automatic technique to map computations to processing elements on a CPU+GPU machine. We have implemented it in our experimental heterogeneous programming system called Qilin. Our results show that, by judiciously distributing works over the CPU and GPU, automatic adaptive mapping achieves a 25% reduction in execution time and a 20% reduction in energy consumption than static mappings on average for a set of important computation benchmarks. We also demonstrate that our technique is able to adapt to changes in the input problem size and system configuration.

...read moreread less

585 citations

...

Expand

Year	Papers
2025	1
2022	2
2021	5
2020	6
2019	10
2018	10

Topic Tools

Papers published on a yearly basis

Papers

Pin: building customized program analysis tools with dynamic instrumentation

The DaCapo benchmarks: java benchmarking development and analysis

Efficient implementation of the smalltalk-80 system

A knowledge compilation map

Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping

Related Topics (5)

Performance Metrics