Efficient deterministic multithreading without global barriers

doi:10.1145/2555243.2555252

Proceedings Article10.1145/2555243.2555252

Efficient deterministic multithreading without global barriers

Kai Lu, +3 more

- 06 Feb 2014

- Vol. 49, Iss: 8, pp 287-300

62

TL;DR: This paper implemented a DMT system based on an execution model called deterministic lazy release consistency (DLRC), which guarantees that programs execute deterministically even when they contain data races, and evaluated it using 16 parallel applications.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.5555/2685048.2685061

Code-pointer integrity

Volodymyr Kuznetsov, +5 more

- 06 Oct 2014

TL;DR: This chapter describes code-pointer integrity (CPI), a new design point that guarantees the integrity of all code pointers in a program and thereby prevents all control-flow hijack attacks that exploit memory corruption errors, including attacks that bypass control- flow integrity mechanisms, such as control-flows bending.

...read moreread less

508

•Proceedings Article•10.1109/SP.2019.00021

SoK: The Challenges, Pitfalls, and Perils of Using Hardware Performance Counters for Security

Sanjeev Das, +4 more

- 19 May 2019

TL;DR: A year-long effort to study the best practices for obtaining accurate measurement of events using performance counters, understand the challenges and pitfalls of using HPCs in various settings, and explore ways to obtain consistent and accurate measurements across different settings and architectures, and empirically evaluated how failure to accommodate for various subtleties in the use of HPS can undermine the effectiveness of security applications.

...read moreread less

187

•Proceedings Article•10.1109/HPCA.2016.7446070

LASER: Light, Accurate Sharing dEtection and Repair

Liang Luo, +6 more

- 12 Mar 2016

TL;DR: The Light, Accurate Sharing dEtection and Repair (LASER) system is presented, which leverages new performance counter capabilities available on Intel's Haswell architecture that identify the source of expensive cache coherence events.

...read moreread less

35

Proceedings Article•10.1145/2908080.2908090

Remix: online detection and repair of cache contention for the JVM

Ariel Eizenberg, +3 more

- 02 Jun 2016

TL;DR: Remix is a modified version of the Oracle HotSpot JVM which can detect cache contention bugs and repair false sharing at runtime and incurs no statistically-significant performance overhead on other benchmarks that do not exhibit cache contention, making Remix practical for always-on use.

...read moreread less

33

•Proceedings Article•10.1145/3064176.3064178

Taming Parallelism in a Multi-Variant Execution Environment

Stijn Volckaert, +5 more

- 23 Apr 2017

TL;DR: An MVEE-specific synchronization scheme is developed that lets us execute a set of multithreaded variants in lockstep without causing benign divergence, which makes MVEEs a viable defense for a far greater range of realistic workloads.

...read moreread less

31

...

Expand

References

Proceedings Article•10.1145/223982.223990

The SPLASH-2 programs: characterization and methodological considerations

Steven Cameron Woo, +4 more

- 01 May 1995

TL;DR: This paper quantitatively characterize the SPLASH-2 programs in terms of fundamental properties and architectural interactions that are important to understand them well, including the computational load balance, communication to computation ratio and traffic needs, important working set sizes, and issues related to spatial locality.

...read moreread less

4.1K

Proceedings Article•10.1145/1454115.1454128

The PARSEC benchmark suite: characterization and architectural implications

Christian Bienia, +3 more

- 25 Oct 2008

TL;DR: This paper presents and characterizes the Princeton Application Repository for Shared-Memory Computers (PARSEC), a benchmark suite for studies of Chip-Multiprocessors (CMPs), and shows that the benchmark suite covers a wide spectrum of working sets, locality, data sharing, synchronization and off-chip traffic.

...read moreread less

3.8K

•Proceedings Article•10.1109/HPCA.2007.346181

Evaluating MapReduce for Multi-core and Multiprocessor Systems

C. Ranger, +4 more

- 10 Feb 2007

TL;DR: It is established that, given a careful implementation, MapReduce is a promising model for scalable performance on shared-memory systems with simple parallel code.

...read moreread less

1.1K

Journal Article•10.1109/MC.2006.180

The problem with threads

Edward A. Lee

- 01 May 2006

- IEEE Computer

TL;DR: For concurrent programming to become mainstream, threads must be discarded as a programming model, and nondeterminism should be judiciously and carefully introduced where needed, and it should be explicit in programs.

...read moreread less

1K

Journal Article•10.1145/356989.357000

Hoard: a scalable memory allocator for multithreaded applications

Emery D. Berger, +3 more

- 12 Nov 2000

TL;DR: Hoard as mentioned in this paper combines one global heap and per-processor heaps with a novel discipline that provably bounds memory consumption and has very low synchronization costs in the common case, which is the first allocator to simultaneously solve the above problems.

...read moreread less

540

...

Expand

Efficient deterministic multithreading without global barriers

Chat with Paper

AI Agents for this Paper

Citations

Code-pointer integrity

SoK: The Challenges, Pitfalls, and Perils of Using Hardware Performance Counters for Security

LASER: Light, Accurate Sharing dEtection and Repair

Remix: online detection and repair of cache contention for the JVM

Taming Parallelism in a Multi-Variant Execution Environment

References

The SPLASH-2 programs: characterization and methodological considerations

The PARSEC benchmark suite: characterization and architectural implications

Evaluating MapReduce for Multi-core and Multiprocessor Systems

The problem with threads

Hoard: a scalable memory allocator for multithreaded applications

Related Papers (5)

Dthreads: efficient deterministic multithreading

Kendo: efficient deterministic multithreading in software

CoreDet: a compiler and runtime system for deterministic multithreaded execution

DMP: deterministic shared memory multiprocessing

Parrot: a practical runtime for deterministic, stable, and reliable threads