Exploring processor parallelism: Estimation methods and optimization strategies
Roel Jordans,Rosilde Corvino,Lech Jozwiak,Henk Corporaal +3 more
- 08 Apr 2013
- Vol. 4, Iss: 2, pp 18-23
TL;DR: In this article, the issue-width of an application specific VLIW issue is automatically selected based on a force-based parallelism measure, which is capable of estimating the required issuewidth within 3% on average.
read more
Abstract: Former research on automatic exploration of ASIP architectures mostly focused on either the internal memory hierarchy, or the addition of complex custom operations to RISC based architectures. This paper focuses on VLIW architectures and, more specifically, on automating the selection of an application specific VLIW issue-width. An accurate and efficient issue-width estimation strongly influences all the important processor properties (e.g. processing speed, silicon area, and power consumption). We first compare different methods for estimating the required issue-width, and subsequently introduce a new force-based parallelism measure which is capable of estimating the required issue-width within 3% on average. Moreover, we show that we can quickly estimate the latency-parallelism Pareto-front of an example ECG application with less than 10% error using our issue-width estimations.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
ASAM: Automatic architecture synthesis and application mapping
Lech Jozwiak,Menno Lindwer,Rosilde Corvino,Paolo Meloni,Laura Micconi,Jan Madsen,Erkan Diken,Deepak Gangadharan,Roel Jordans,Sebastiano Pomata,Paul Pop,Giuseppe Tuveri,Luigi Raffo,Giuseppe Notarangelo +13 more
TL;DR: An overview of the research being currently performed in the scope of the European project ASAM of the ARTEMIS program is presented, which system, design, and electronic design automation (EDA) concepts seem to be adequate to address the challenges and solve the problems.
30
Embedded Computing Technology for Highly-demanding Cyber-physical Systems
TL;DR: The embedded computing technology needed for the modern complex and highly-demanding mobile and autonomous CPS is discussed.
28
Advanced mobile and wearable systems
TL;DR: The huge heterogeneous area of these systems, and serious issues and challenges in their design are considered, and the embedded computing and design technologies needed to adequately address the issues and overcome the challenges in order to satisfy the stringent requirements of the modern mobile systems are discussed.
24
ASAM: Automatic Architecture Synthesis and Application Mapping
Lech Jozwiak,Menno Lindwer,Rosilde Corvino,Paolo Meloni,Laura Micconi,Jan Madsen,Erkan Diken,Deepak Gangadharan,Roel Jordans,Sebastiano Pomata,Paul Pop,Giuseppe Tuveri,Luigi Raffo +12 more
- 05 Sep 2012
TL;DR: An over-view of the research being currently performed in the scope of the European project ASAM of the ARTEMIS program is presented and which system, design, and electronic design automation concepts seem to be adequate to resolve the problems and address the challenges.
Construction and exploitation of VLIW ASIPs with heterogeneous vector-widths
TL;DR: The use of heterogeneous vector widths and a method to explore the heterogeneousvector widths for VLIW ASIPs are proposed and the associated design automation tools are explained.
13
References
•Journal Article
[''R"--project for statistical computing].
TL;DR: An introduction to the R project for statistical computing (www.R-project.org) is presented to make the professional community aware of "R" as a potent and free software for graphical and statistical analysis of medical data.
3.4K
Software pipelining: an effective scheduling technique for VLIW machines
Monica S. Lam
- 01 Jun 1988
TL;DR: This paper shows that software pipelining is an effective and viable scheduling technique for VLIW processors, and proposes a hierarchical reduction scheme whereby entire control constructs are reduced to an object similar to an operation in a basic block.
Iterative module scheduling: an algorithm for software pipelining loops
B. Ramakrishna Rau
- 30 Nov 1994
TL;DR: This paper presents a practical algorithm, iterative modulo scheduling, that is capable of dealing with realistic machine models and characterizes the algorithm in terms of the quality of the generated schedules as well the computational expense incurred.
749
Limits of instruction-level parallelism
David W. Wall
- 01 Apr 1991
TL;DR: The results of simulations of 18 different test programs under 375 different models of available parallelism analysis are presented, showing how simulations based on instruction traces can model techniques at the limits of feasibility and even beyond.
•Book
Limits of instruction-level parallelism
David W. Wall
- 01 Mar 1995
TL;DR: In this paper, the authors present the results of simulations of 18 different test programs under 375 different models of available parallelism analysis, including branch prediction, register renaming and alias analysis.
592
Related Papers (5)
Hillery C. Hunter,Jaime H. Moreno +1 more
- 30 Oct 2003
Jesmin Jahan Tithi,Neal Crago,Joel Emer +2 more
- 23 Mar 2014