Advanced Synchronization Facility

Topic Tools

Papers

Proceedings Article•10.1145/1755913.1755918•

Evaluation of AMD's advanced synchronization facility within a complete transactional memory stack

[...]

Dave Christie¹, Jaewoong Chung¹, Stephan Diestelhorst¹, Michael P. Hohmuth¹, Martin T. Pohlack¹, Christof Fetzer², Martin Nowack², Torvald Riegel², Pascal Felber³, Patrick Marlier³, Etienne Rivière³ - Show less +7 more•Institutions (3)

Advanced Micro Devices¹, Dresden University of Technology², University of Neuchâtel³

13 Apr 2010

TL;DR: Measurements on a wide range of benchmarks indicate that the overheads traditionally associated with software transactional memories can be significantly reduced with the help of ASF.

...read moreread less

Abstract: AMD's Advanced Synchronization Facility (ASF) is an x86 instruction set extension proposal intended to simplify and speed up the synchronization of concurrent programs. In this paper, we report our experiences using ASF for implementing transactional memory. We have extended a C/C++ compiler to support language-level transactions and generate code that takes advantage of ASF. We use a software fallback mechanism for transactions that cannot be committed within ASF (e.g., because of hardware capacity limitations). Our evaluation uses a cycle-accurate x86 simulator that we have extended with ASF support. Building a complete ASF-based software stack allows us to evaluate the performance gains that a user-level program can obtain from ASF. Our measurements on a wide range of benchmarks indicate that the overheads traditionally associated with software transactional memories can be significantly reduced with the help of ASF.

...read moreread less

139 citations

Proceedings Article•10.1109/MICRO.2010.40•

ASF: AMD64 Extension for Lock-Free Data Structures and Transactional Memory

[...]

Jaewoong Chung¹, Luke Yen¹, Stephan Diestelhorst¹, Martin T. Pohlack¹, Michael P. Hohmuth¹, David S. Christie¹, Dan Grossman² - Show less +3 more•Institutions (2)

Advanced Micro Devices¹, University of Washington²

4 Dec 2010

TL;DR: An out-of-order hardware design to implement ASF on a future AMD processor is developed and the experimental results show that the combined use of the L1 cache and the LS unit is very helpful for the performance robustness of ASF-based lock free data structures, and that the selective use of speculative accesses enables transactional programs to scale with limited ASF hardware resources.

...read moreread less

Abstract: Advanced Synchronization Facility (ASF) is an AMD64 hardware extension for lock-free data structures and transactional memory. It provides a speculative region that atomically executes speculative accesses in the region. Five new instructions are added to demarcate the region, use speculative accesses selectively, and control the speculative hardware context. Programmers can use speculative regions to build flexible multi-word atomic primitives with no additional software support by relying on the minimum guarantee of available ASF hardware resources for lock-free programming. Transactional programs with high-level TM language constructs can either be compiled directly to the ASF code or be linked to software TM systems that use ASF to accelerate transactional execution. In this paper we develop an out-of-order hardware design to implement ASF on a future AMD processor and evaluate it with an in-house simulator. The experimental results show that the combined use of the L1 cache and the LS unit is very helpful for the performance robustness of ASF-based lock free data structures, and that the selective use of speculative accesses enables transactional programs to scale with limited ASF hardware resources.

...read moreread less

78 citations

Proceedings Article•10.1145/1989493.1989501•

Optimizing hybrid transactional memory: the importance of nonspeculative operations

[...]

Torvald Riegel¹, Patrick Marlier, Martin Nowack¹, Pascal Felber, Christof Fetzer¹ - Show less +1 more•Institutions (1)

Dresden University of Technology¹

4 Jun 2011

TL;DR: Several new hybrid TM algorithms are presented that can execute HTM and STM transactions concurrently and can thus provide good performance over a large spectrum of workloads and are evaluated based on AMD's Advanced Synchronization Facility.

...read moreread less

Abstract: Transactional memory (TM) is a speculative shared-memory synchronization mechanism used to speed up concurrent programs. Most current TM implementations are software-based (STM) and incur noticeable overheads for each transactional memory access. Hardware TM proposals (HTM) address this issue but typically suffer from other restrictions such as limits on the number of data locations that can be accessed in a transaction.In this paper, we present several new hybrid TM algorithms that can execute HTM and STM transactions concurrently and can thus provide good performance over a large spectrum of workloads. The algorithms exploit the ability of some HTMs to have both speculative and nonspeculative (nontransactional) memory accesses within a transaction to decrease the transactions' runtime overhead, abort rates, and hardware capacity requirements. We evaluate implementations of these algorithms based on AMD's Advanced Synchronization Facility, an x86 instruction set extension proposal that has been shown to provide a sound basis for HTM.

...read moreread less

77 citations

Hardware acceleration for lock-free data structures and software-transactional memory

[...]

Stephan Diestelhorst¹, Michael P. Hohmuth²•Institutions (2)

Dresden University of Technology¹, Advanced Micro Devices²

1 Jan 2008

TL;DR: An initial performance simulation and usability study of ASF’s application to a lock-free data structure (a singly linked list) and to accelerating a state-of-the-art STM system, TinySTM, indicate that ASF can significantly increase the throughput and scaling behavior of these workloads.

...read moreread less

Abstract: In this paper, we report on a new CPU-architecture extension proposal, named Advanced Synchronization Facility (ASF), which is geared toward accelerating and easing lock-free programming and software transactional memory (STM). We present an initial performance simulation and usability study of ASF’s application to a lock-free data structure (a singly linked list) and to accelerating a state-of-the-art STM system, TinySTM. Our results indicate that ASF can significantly increase the throughput and scaling behavior of these workloads: Single-thread performance increased by up to 15 %, and the factor of scaling to eight CPUs increased by up to 20 %.

...read moreread less

35 citations

Proceedings Article•10.1145/2312005.2312014•

Delegation and nesting in best-effort hardware transactional memory

[...]

Yujie Liu¹, Stephan Diestelhorst², Michael Spear¹•Institutions (2)

Lehigh University¹, Advanced Micro Devices²

25 Jun 2012

TL;DR: This paper exploits support for immediate non-transactional stores in the AMD Advanced Synchronization Facility to build a mechanism for communication among transactions, and explores which forms of nesting are possible, and identifies constraints on nesting that are a consequence of how BEHTM is designed.

...read moreread less

Abstract: The guiding design principle behind best-effort hardware transactional memory (BEHTM) is simplicity of implementation and verification. Only minimal modifications to the base processor architecture are allowed, thereby reducing the burden of verification and long-term support. In exchange, the hardware can support only relatively simple multiword atomic operations, and must fall back to a software run-time for any operation that exceeds the abilities of the hardware.This paper demonstrates that BEHTM simplicity does not prohibit advanced and complex transactional behaviors. We exploit support for immediate non-transactional stores in the AMD Advanced Synchronization Facility to build a mechanism for communication among transactions. While our system allows arbitrary communication patterns, we focus on a design point where each transaction communicates with a system-wide manager thread. The API for the manager thread allows BEHTM transactions to delegate unsafe operations (such as system calls) to helper threads, and also enables the creation of nested parallel transactions. This paper also explores which forms of nesting are possible, and identifies constraints on nesting that are a consequence of how BEHTM is designed.

...read moreread less

9 citations

Topic Tools

Papers

Evaluation of AMD's advanced synchronization facility within a complete transactional memory stack

ASF: AMD64 Extension for Lock-Free Data Structures and Transactional Memory

Optimizing hybrid transactional memory: the importance of nonspeculative operations

Hardware acceleration for lock-free data structures and software-transactional memory

Delegation and nesting in best-effort hardware transactional memory

Related Topics (5)

Performance Metrics