Exploring processor parallelism: Estimation methods and optimization strategies
Roel Jordans,Rosilde Corvino,Lech Jozwiak,Henk Corporaal +3 more
- 08 Apr 2013
- Vol. 4, Iss: 2, pp 18-23
TL;DR: In this article, the issue-width of an application specific VLIW issue is automatically selected based on a force-based parallelism measure, which is capable of estimating the required issuewidth within 3% on average.
read more
Abstract: Former research on automatic exploration of ASIP architectures mostly focused on either the internal memory hierarchy, or the addition of complex custom operations to RISC based architectures. This paper focuses on VLIW architectures and, more specifically, on automating the selection of an application specific VLIW issue-width. An accurate and efficient issue-width estimation strongly influences all the important processor properties (e.g. processing speed, silicon area, and power consumption). We first compare different methods for estimating the required issue-width, and subsequently introduce a new force-based parallelism measure which is capable of estimating the required issue-width within 3% on average. Moreover, we show that we can quickly estimate the latency-parallelism Pareto-front of an example ECG application with less than 10% error using our issue-width estimations.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
ASAM: Automatic architecture synthesis and application mapping
Lech Jozwiak,Menno Lindwer,Rosilde Corvino,Paolo Meloni,Laura Micconi,Jan Madsen,Erkan Diken,Deepak Gangadharan,Roel Jordans,Sebastiano Pomata,Paul Pop,Giuseppe Tuveri,Luigi Raffo,Giuseppe Notarangelo +13 more
TL;DR: An overview of the research being currently performed in the scope of the European project ASAM of the ARTEMIS program is presented, which system, design, and electronic design automation (EDA) concepts seem to be adequate to address the challenges and solve the problems.
30
Embedded Computing Technology for Highly-demanding Cyber-physical Systems
TL;DR: The embedded computing technology needed for the modern complex and highly-demanding mobile and autonomous CPS is discussed.
28
Advanced mobile and wearable systems
TL;DR: The huge heterogeneous area of these systems, and serious issues and challenges in their design are considered, and the embedded computing and design technologies needed to adequately address the issues and overcome the challenges in order to satisfy the stringent requirements of the modern mobile systems are discussed.
24
ASAM: Automatic Architecture Synthesis and Application Mapping
Lech Jozwiak,Menno Lindwer,Rosilde Corvino,Paolo Meloni,Laura Micconi,Jan Madsen,Erkan Diken,Deepak Gangadharan,Roel Jordans,Sebastiano Pomata,Paul Pop,Giuseppe Tuveri,Luigi Raffo +12 more
- 05 Sep 2012
TL;DR: An over-view of the research being currently performed in the scope of the European project ASAM of the ARTEMIS program is presented and which system, design, and electronic design automation concepts seem to be adequate to resolve the problems and address the challenges.
Construction and exploitation of VLIW ASIPs with heterogeneous vector-widths
TL;DR: The use of heterogeneous vector widths and a method to explore the heterogeneousvector widths for VLIW ASIPs are proposed and the associated design automation tools are explained.
13
References
Force-directed scheduling for the behavioral synthesis of ASICs
P.G. Paulin,J.P. Knight +1 more
TL;DR: A general scheduling methodology is presented that can be integrated into specialized or general-purpose high-level synthesis systems and reduces the number of functional units, storage units, and buses required by balancing the concurrency of operations assigned to them.
Exact and approximate algorithms for the extension of embedded processor instruction sets
TL;DR: In this paper, a set of algorithms are proposed to find the best instruction set extensions (ISEs) for a given application, based on a detailed analysis of the application code.
Optimal basic block instruction scheduling for multiple-issue processors using constraint programming
TL;DR: The optimal scheduler is presented, which is a scheduler that finds provably optimal schedules for basic blocks using techniques from constraint programming and scaled to the largest basic blocks, including basic blocks with up to 2600 instructions.
The Inhibition of Potential Parallelism by Conditional Jumps
E.M. Riseman,C.C. Foster +1 more
TL;DR: An infinite machine is postulate, one with an infinite memory and instruction stack, infinite registers and memory, and an infinite number of functional units, to execute a program in parallel at maximum speed by executing each instruction at the earliest possible moment.
PICO: automatically designing custom computers
Vinod Kathail,Shail Aditya,Robert Schreiber,B. Ramakrishna Rau,Darren C. Cronquist,Mukund Sivaraman +5 more
TL;DR: The paper discusses the PICO (program in, chip out) project, a long-range HP Labs research effort that aims to automate the design of optimized, application-specific computing systems - thus enabling the rapid and cost-effective design of custom chips when no adequately specialized, off-the-shelf design is available.
Related Papers (5)
Hillery C. Hunter,Jaime H. Moreno +1 more
- 30 Oct 2003
Jesmin Jahan Tithi,Neal Crago,Joel Emer +2 more
- 23 Mar 2014