Challenging the "embarrassingly sequential": parallelizing finite state machine-based computations through principled speculation

doi:10.1145/2541940.2541989

Proceedings Article10.1145/2541940.2541989

Challenging the "embarrassingly sequential": parallelizing finite state machine-based computations through principled speculation

Zhijia Zhao, +2 more

- 24 Feb 2014

- Vol. 42, Iss: 1, pp 543-558

41

TL;DR: This paper offers the first disciplined way to exploit application-specific information to inform speculations for parallelization, and presents a probabilistic model that captures the relations between speculative executions and the properties of the target FSM and its inputs.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1145/3173162.3173180

Tigr: Transforming Irregular Graphs for GPU-Friendly Graph Processing

Amir Hossein Nodehi Sabet, +2 more

- 19 Mar 2018

TL;DR: Inspired by the question, Tigr is introduced -- a graph transformation framework that can effectively reduce the irregularity of real-world graphs with correctness guarantees for a wide range of graph analytics.

...read moreread less

94

•Journal Article•10.1145/2938369

A Survey on Thread-Level Speculation Techniques

Alvaro Estebanez, +2 more

- 30 Jun 2016

- ACM Computing Surveys

TL;DR: This work introduces the technique, presents a taxonomy of TLS solutions, and summarizes and put into perspective the most relevant advances in this field.

...read moreread less

35

Proceedings Article•10.1145/3018743.3018772

Grammar-aware Parallelization for Scalable XPath Querying

Lin Jiang, +1 more

- 26 Jan 2017

TL;DR: GAP leverages static analysis to infer feasible execution paths for specific con- texts based on the grammar of the semi-structured data and reduces the execution paths from all paths to a minimum, therefore maximizing the parallelization efficiency and scalability.

...read moreread less

12

Journal Article•10.2200/S01109ED1V01Y202106CAC057

In-/Near-Memory Computing

Daichi Fujiki, +3 more

- 12 Aug 2021

- Synthesis Lectures on Computer Architect...

TL;DR: This book provides a structured introduction of the key concepts and techniques that enable in-/near-memory computing.

...read moreread less

11

•Proceedings Article•10.1145/2660193.2660229

Space-efficient multi-versioning for input-adaptive feedback-driven program optimizations

Mingzhou Zhou, +3 more

- 15 Oct 2014

TL;DR: This study proves selecting the best set of versions under a space constraint is NP-complete and proposes a heuristic algorithm named CHoGS which yields near optimal results in quadratic time.

...read moreread less

11

...

Expand

References

•Book

Compilers: Principles, Techniques, and Tools

Alfred V. Aho, +2 more

- 01 Jan 1986

TL;DR: This book discusses the design of a Code Generator, the role of the Lexical Analyzer, and other topics related to code generation and optimization.

...read moreread less

9.7K

Proceedings Article•10.1145/165123.165164

Transactional memory: architectural support for lock-free data structures

Maurice Herlihy, +1 more

- 01 May 1993

TL;DR: Simulation results show that transactional memory matches or outperforms the best known locking techniques for simple benchmarks, even in the absence of priority inversion, convoying, and deadlock.

...read moreread less

2.5K

The Landscape of Parallel Computing Research: A View from Berkeley

Krste Asanovic, +10 more

- 18 Dec 2006

TL;DR: The parallel landscape is frame with seven questions, and the following are recommended to explore the design space rapidly: • The overarching goal should be to make it easy to write programs that execute efficiently on highly parallel computing systems • The target should be 1000s of cores per chip, as these chips are built from processing elements that are the most efficient in MIPS (Million Instructions per Second) per watt, MIPS per area of silicon, and MIPS each development dollar.

...read moreread less

2.4K

Proceedings Article•10.1145/1094811.1094852

X10: an object-oriented approach to non-uniform cluster computing

Philippe Charles, +7 more

- 12 Oct 2005

TL;DR: A modern object-oriented programming language, X10, is designed for high performance, high productivity programming of NUCC systems and an overview of the X10 programming model and language, experience with the reference implementation, and results from some initial productivity comparisons between the X 10 and Java™ languages are presented.

...read moreread less

1.5K

•Proceedings Article•10.1145/277650.277725

The implementation of the Cilk-5 multithreaded language

Matteo Frigo, +2 more

- 01 May 1998

TL;DR: Cilk-5's novel "two-clone" compilation strategy and its Dijkstra-like mutual-exclusion protocol for implementing the ready deque in the work-stealing scheduler are presented.

...read moreread less

1.4K

...

Expand

Challenging the "embarrassingly sequential": parallelizing finite state machine-based computations through principled speculation

Chat with Paper

AI Agents for this Paper

Citations

Tigr: Transforming Irregular Graphs for GPU-Friendly Graph Processing

A Survey on Thread-Level Speculation Techniques

Grammar-aware Parallelization for Scalable XPath Querying

In-/Near-Memory Computing

Space-efficient multi-versioning for input-adaptive feedback-driven program optimizations

References

Compilers: Principles, Techniques, and Tools

Transactional memory: architectural support for lock-free data structures

The Landscape of Parallel Computing Research: A View from Berkeley

X10: an object-oriented approach to non-uniform cluster computing

The implementation of the Cilk-5 multithreaded language

Related Papers (5)

On-the-Fly Principled Speculation for FSM Parallelization

An Efficient and Scalable Semiconductor Architecture for Parallel Automata Processing

Parallel Prefix Computation

Enabling scalability-sensitive speculative parallelization for FSM computations

Safe programmable speculative parallelism