Exploring processor parallelism: Estimation methods and optimization strategies
Roel Jordans,Rosilde Corvino,Lech Jozwiak,Henk Corporaal +3 more
- 08 Apr 2013
- Vol. 4, Iss: 2, pp 18-23
TL;DR: In this article, the issue-width of an application specific VLIW issue is automatically selected based on a force-based parallelism measure, which is capable of estimating the required issuewidth within 3% on average.
read more
Abstract: Former research on automatic exploration of ASIP architectures mostly focused on either the internal memory hierarchy, or the addition of complex custom operations to RISC based architectures. This paper focuses on VLIW architectures and, more specifically, on automating the selection of an application specific VLIW issue-width. An accurate and efficient issue-width estimation strongly influences all the important processor properties (e.g. processing speed, silicon area, and power consumption). We first compare different methods for estimating the required issue-width, and subsequently introduce a new force-based parallelism measure which is capable of estimating the required issue-width within 3% on average. Moreover, we show that we can quickly estimate the latency-parallelism Pareto-front of an example ECG application with less than 10% error using our issue-width estimations.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Instruction-set architecture exploration strategies for deeply clustered VLIW ASIPs
Roel Jordans,Rosilde Corvino,Lech Jozwiak,Henk Corporaal +3 more
- 15 Jun 2013
TL;DR: A two-phase method is proposed which can quickly explore many different architectures and is capable of automatically achieving a 50% improvement on the energy-delay product cost of an automatically generated architecture for an ECG detection application and a 1% energy- delay product cost improvement compared to a hand-crafted design.
12
•Proceedings Article
Heterogeneous MPSoC technology for modern cyber-physical systems
Lech Jozwiak
- 01 Jan 2014
TL;DR: This paper discusses the heterogeneous MPSoC technology needed to implement modern cyberphysical systems and exploits heterogeneous computation and communication resources involving application-specific instruction-set processors, hardware accelerators, distributed parallel memories and hierarchical communication structures.
7
BuildMaster: Efficient ASIP architecture exploration through compilation and simulation result caching
Roel Jordans,Erkan Diken,Lech Jozwiak,Henk Corporaal +3 more
- 23 Apr 2014
TL;DR: The Build-Master framework supports the design space exploration of application specific VLIW processors and offers automated caching of intermediate compilation and simulation results and can greatly help to shorten the exploration time.
A Probabilistic Approach for the System-Level Design of Multi-ASIP Platforms
Laura Micconi
- 01 Jan 2015
TL;DR: A Probabilistic Approach for the System-Level Design of Multi-ASIP Platforms Application Specific Instruction-set Processors (ASIPs) offer a good trade off between performance and flexibility when compared to general purpose processors or ASICs.
Instruction-set architecture synthesis for VLIW processors
Roel Jordans
- 01 Dec 2015
TL;DR: The focus for the system architecture exploration by PICO is on the trade-off between the area and timing, and the architecture template for the project was quite similar to those of the ASAM project, of which this dissertation is a part.
References
•Journal Article
[''R"--project for statistical computing].
TL;DR: An introduction to the R project for statistical computing (www.R-project.org) is presented to make the professional community aware of "R" as a potent and free software for graphical and statistical analysis of medical data.
3.4K
Software pipelining: an effective scheduling technique for VLIW machines
Monica S. Lam
- 01 Jun 1988
TL;DR: This paper shows that software pipelining is an effective and viable scheduling technique for VLIW processors, and proposes a hierarchical reduction scheme whereby entire control constructs are reduced to an object similar to an operation in a basic block.
Iterative module scheduling: an algorithm for software pipelining loops
B. Ramakrishna Rau
- 30 Nov 1994
TL;DR: This paper presents a practical algorithm, iterative modulo scheduling, that is capable of dealing with realistic machine models and characterizes the algorithm in terms of the quality of the generated schedules as well the computational expense incurred.
749
Limits of instruction-level parallelism
David W. Wall
- 01 Apr 1991
TL;DR: The results of simulations of 18 different test programs under 375 different models of available parallelism analysis are presented, showing how simulations based on instruction traces can model techniques at the limits of feasibility and even beyond.
•Book
Limits of instruction-level parallelism
David W. Wall
- 01 Mar 1995
TL;DR: In this paper, the authors present the results of simulations of 18 different test programs under 375 different models of available parallelism analysis, including branch prediction, register renaming and alias analysis.
592
Related Papers (5)
Hillery C. Hunter,Jaime H. Moreno +1 more
- 30 Oct 2003
Jesmin Jahan Tithi,Neal Crago,Joel Emer +2 more
- 23 Mar 2014