Time squeezing for tiny devices

doi:10.1145/3307650.3322268

Open AccessProceedings Article10.1145/3307650.3322268

Time squeezing for tiny devices

Yuanbo Fan, +2 more

- 22 Jun 2019

- pp 657-670

10

TL;DR: This paper describes compiler and architecture co-design that opens new opportunities for timing slack that are otherwise impossible, and introduces novel mechanisms in the hardware and in the compiler that work together to improve the benefit of circuit-level timing speculation by effectively squeezing time during execution.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/JSSC.2020.3027953

A Dynamic Timing Enhanced DNN Accelerator With Compute-Adaptive Elastic Clock Chain Technique

Tianyu Jia, +2 more

- 01 Jan 2021

- IEEE Journal of Solid-state Circuits

TL;DR: In this paper, an elastic clock chain scheme was proposed to provide a flexible multi-domain clock management scheme for in situ compute adaptability for deep neural network (DNN) accelerators.

...read moreread less

16

Journal Article•10.1109/JSSC.2020.2979451

An Adaptive Clock Scheme Exploiting Instruction-Based Dynamic Timing Slack for a GPGPU Architecture

Tianyu Jia, +3 more

- 23 Mar 2020

- IEEE Journal of Solid-state Circuits

TL;DR: An adaptive clock scheme to exploit instruction-based dynamic timing slack (DTS) for a general-purpose graphics processor unit (GPGPU) architecture and an elastic pipeline clocking scheme is developed to redistribute the timing margin across pipeline stages for machine learning computations.

...read moreread less

9

•Proceedings Article•10.1109/cgo53902.2022.9741276

NOELLE Offers Empowering LLVM Extensions

02 Apr 2022

TL;DR: NOELLE as mentioned in this paper extends abstractions and functionalities provided by LLVM enabling advanced, program-wide code analyses and transformations, and shows the power of NOELLE by presenting a diverse set of 11 custom tools built upon it.

...read moreread less

6

•Proceedings Article•10.1145/3368826.3377906

Introducing the pseudorandom value generator selection in the compilation toolchain

Michael Leonard, +1 more

- 22 Feb 2020

TL;DR: This work builds PRV Jeeves, the first fully automatic PRVG selector and provides the first deep study into the tradeoffs among the PRVGs in the C++ standard, finding no silver bullet for all programs and architectures.

...read moreread less

3

•Journal Article•10.1145/3504005

Low-power Near-data Instruction Execution Leveraging Opcode-based Timing Analysis

Tziouvaras Athanasios, +2 more

- 31 Jan 2022

- ACM Transactions on Architecture and Cod...

TL;DR: This work proposes a near-data processing and better-than-worst-case co-design methodology to efficiently move the instruction execution to the DRAM side and, at the same time, to allow the pipeline to operate at higher clock frequencies compared to the worst-case approach.

...read moreread less

2

References

•Journal Article•10.1109/JIOT.2014.2306328

Internet of Things for Smart Cities

Andrea Zanella, +4 more

- 14 Feb 2014

- IEEE Internet of Things Journal

TL;DR: This paper will present and discuss the technical solutions and best-practice guidelines adopted in the Padova Smart City project, a proof-of-concept deployment of an IoT island in the city of Padova, Italy, performed in collaboration with the city municipality.

...read moreread less

5.5K

•Proceedings Article•10.5555/977395.977673

LLVM: a compilation framework for lifelong program analysis & transformation

Chris Lattner, +1 more

- 20 Mar 2004

TL;DR: The design of the LLVM representation and compiler framework is evaluated in three ways: the size and effectiveness of the representation, including the type information it provides; compiler performance for several interprocedural problems; and illustrative examples of the benefits LLVM provides for several challenging compiler problems.

...read moreread less

5.4K

•Proceedings Article•10.1109/WWC.2001.15

MiBench: A free, commercially representative embedded benchmark suite

Matthew R. Guthaus, +5 more

- 02 Dec 2001

TL;DR: A new version of SimpleScalar that has been adapted to the ARM instruction set is used to characterize the performance of the benchmarks using configurations similar to current and next generation embedded processors.

...read moreread less

3.7K

•Journal Article

Internet of Things for Smart Cities

Sneha A. Dalvi, +1 more

- 01 Jul 2017

- Imperial journal of interdisciplinary re...

TL;DR: This paper focuses specifically to an urban IoT systems that, while still being quite a broad category, are characterized by their specific application domain and are designed to support the Smart City vision.

...read moreread less

3.6K

...

Expand

Time squeezing for tiny devices

Chat with Paper

AI Agents for this Paper

Citations

A Dynamic Timing Enhanced DNN Accelerator With Compute-Adaptive Elastic Clock Chain Technique

An Adaptive Clock Scheme Exploiting Instruction-Based Dynamic Timing Slack for a GPGPU Architecture

NOELLE Offers Empowering LLVM Extensions

Introducing the pseudorandom value generator selection in the compilation toolchain

Low-power Near-data Instruction Execution Leveraging Opcode-based Timing Analysis

References

Internet of Things for Smart Cities

LLVM: a compilation framework for lifelong program analysis & transformation

The gem5 simulator

MiBench: A free, commercially representative embedded benchmark suite

Internet of Things for Smart Cities

Related Papers (5)

Compiler-guided instruction-level clock scheduling for timing speculative processors

Exploiting Timing Error Resilience in Processor Architecture

Compiler-directed proactive power management for networks

Energy Efficient and Predictable Design of Real-Time Embedded Systems

Compiler-Directed Frequency and Voltage Scaling for a Multiple Clock Domain