Time squeezing for tiny devices
Yuanbo Fan,Simone Campanoni,Russ Joseph +2 more
- 22 Jun 2019
- pp 657-670
TL;DR: This paper describes compiler and architecture co-design that opens new opportunities for timing slack that are otherwise impossible, and introduces novel mechanisms in the hardware and in the compiler that work together to improve the benefit of circuit-level timing speculation by effectively squeezing time during execution.
read more
Abstract: Dynamic timing slack has emerged as a compelling opportunity for eliminating inefficiency in ultra-low power embedded systems. This slack arises when all the signals have propagated through logic paths well in advance of the clock signal. When it is properly identified, the system can exploit this unused cycle time for energy savings. In this paper, we describe compiler and architecture co-design that opens new opportunities for timing slack that are otherwise impossible. Through cross-layer optimization, we introduce novel mechanisms in the hardware and in the compiler that work together to improve the benefit of circuit-level timing speculation by effectively squeezing time during execution. This approach is particularly well-suited to tiny embedded devices. Our evaluation on a gate-level model of a complete processor shows that our co-design saves (on average) 40.5% of the original energy consumption (additional 16.5% compared to the existing clock scheduling technique) across 13 workloads while retaining transparency to developers.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A Dynamic Timing Enhanced DNN Accelerator With Compute-Adaptive Elastic Clock Chain Technique
Tianyu Jia,Yuhao Ju,Jie Gu +2 more
TL;DR: In this paper, an elastic clock chain scheme was proposed to provide a flexible multi-domain clock management scheme for in situ compute adaptability for deep neural network (DNN) accelerators.
16
An Adaptive Clock Scheme Exploiting Instruction-Based Dynamic Timing Slack for a GPGPU Architecture
TL;DR: An adaptive clock scheme to exploit instruction-based dynamic timing slack (DTS) for a general-purpose graphics processor unit (GPGPU) architecture and an elastic pipeline clocking scheme is developed to redistribute the timing margin across pipeline stages for machine learning computations.
9
NOELLE Offers Empowering LLVM Extensions
02 Apr 2022
TL;DR: NOELLE as mentioned in this paper extends abstractions and functionalities provided by LLVM enabling advanced, program-wide code analyses and transformations, and shows the power of NOELLE by presenting a diverse set of 11 custom tools built upon it.
6
Introducing the pseudorandom value generator selection in the compilation toolchain
Michael Leonard,Simone Campanoni +1 more
- 22 Feb 2020
TL;DR: This work builds PRV Jeeves, the first fully automatic PRVG selector and provides the first deep study into the tradeoffs among the PRVGs in the C++ standard, finding no silver bullet for all programs and architectures.
3
Low-power Near-data Instruction Execution Leveraging Opcode-based Timing Analysis
TL;DR: This work proposes a near-data processing and better-than-worst-case co-design methodology to efficiently move the instruction execution to the DRAM side and, at the same time, to allow the pipeline to operate at higher clock frequencies compared to the worst-case approach.
2
References
Moore's law: past, present and future
TL;DR: Moore's Law has become the central driving force of one of the most dynamic of the world's industries as discussed by the authors, and it is viewed as a reliable method of calculating future trends as well, setting the pace of innovation, and defining the rules and the very nature of competition.
1.9K
Secrets of optical flow estimation and their principles
Deqing Sun,Stefan Roth,Michael J. Black +2 more
- 13 Jun 2010
TL;DR: It is discovered that “classical” flow formulations perform surprisingly well when combined with modern optimization and implementation techniques, and while median filtering of intermediate flow fields during optimization is a key to recent performance gains, it leads to higher energy solutions.
Parameter variations and impact on circuits and microarchitecture
Shekhar Borkar,Tanay Karnik,Siva G. Narendra,James W. Tschanz,Ali Keshavarzi,Vivek De +5 more
- 02 Jun 2003
TL;DR: Process, voltage and temperature variations; and their impact on circuit and microarchitecture; and possible solutions to reduce the impact of parameter variations and to achieve higher frequency bins are presented.
Razor: a low-power pipeline based on circuit-level timing speculation
Daniel J. Ernst,Nam Sung Kim,Shidhartha Das,Sanjay Pant,Rajeev R. Rao,Toan Pham,Conrad H. Ziesler,David Blaauw,Todd Austin,Krisztian Flautner,Trevor Mudge +10 more
- 03 Dec 2003
TL;DR: A solution by which the circuit can be operated even below the ‘critical’ voltage, so that no margins are required and thus more energy can be saved.
1.3K
Science, technology and the future of small autonomous drones
Dario Floreano,Robert J. Wood +1 more
TL;DR: This work identifies scientific and technological advances that are expected to translate, within appropriate regulatory frameworks, into pervasive use of autonomous drones for civilian applications.