ARETE: Accurate Error Assessment via Machine Learning-Guided Dynamic-Timing Analysis

doi:10.1109/tc.2022.3191966

Journal Article10.1109/tc.2022.3191966

ARETE: Accurate Error Assessment via Machine Learning-Guided Dynamic-Timing Analysis

01 Apr 2023

- IEEE Transactions on Computers

- Vol. 72, Iss: 4, pp 1026-1040

3

TL;DR: ARETE as discussed by the authors is a cross-layer fault-injection framework that combines dynamic-binary instrumentation with machine learning-guided dynamic-timing analysis to estimate the location of the injecting errors via dynamic-time analysis.

Abstract: Nanometer circuits are increasingly prone to timing errors, escalating the need for <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">fault injection</i> frameworks to accurately evaluate their impact on applications. In this paper, we propose ARETE, a novel cross-layer, fault-injection framework that combines dynamic-binary instrumentation with machine learning-guided dynamic-timing analysis. ARETE enables accurate fault-injection into any application by estimating the location of the injecting errors via dynamic-timing analysis. To accelerate fault-injection, we develop a novel, data-aware, machine learning-based mechanism that dynamically pre-selects the error-prone instructions and limits the application of the costly dynamic-timing analysis only to them. To evaluate ARETE's accuracy, our fully automated toolflow is configured to support fault-injection based on detailed post-layout gate-level simulations as well as via existing workload-agnostic error models. Our results for various workloads, including an autonomous-driving library, show that the location and time of injected errors performed by ARETE, is 89.9% consistent with fault-injection based on full gate-level simulation. On average, ARETE executes 84.6× faster than gate-level simulation and at a cost of 3.4% loss in the program output quality estimation. When compared to the existing statistical fault-injection tools that are based on workload-agnostic error models, ARETE improves the accuracy of fault-injection rate and output quality estimation by 143.9% and 40.4% on average, respectively.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/iolts59296.2023.10224868

Microarchitecture-Aware Timing Error Prediction via Deep Neural Networks

Styliani Tompazi, +1 more

- 03 Jul 2023

TL;DR: Microarchitecture-aware timing error prediction via deep neural networks accurately predicts timing errors in nanometer circuits considering microarchitecture and workload parameters. The novel framework combines post-layout dynamic timing analysis and genetic algorithms to generate error-prone microarchitecture-aware samples. NN models are trained and evaluated on these samples, achieving high accuracy and improved scalability.

...read moreread less

1

Journal Article•10.1145/3665314.3670836

ePredictNet: Low Cost Error Prediction Neural Network

Georgios Chatzitsompanis, +1 more

- 05 Aug 2024

Proceedings Article•10.1109/iccd58817.2023.00012

A Compressed and Accurate Sparse Deep Learning-based Workload-Aware Timing Error Model

Styliani Tompazi, +1 more

- 06 Nov 2023

- International Conference on Community De...

TL;DR: This study shows that DL can help increase the accuracy and true positive rate (TPR) of workload-aware models for a pipelined floating-point core compared to existing models and demonstrates that removing up to 40% of the total neurons has minimal impact on the accuracy and overall predictive performance of the DL-based timing error models.

...read moreread less

References

Proceedings Article•10.1109/CVPR.2012.6248074

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, +2 more

- 16 Jun 2012

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.

...read moreread less

16.3K

•Proceedings Article•10.1109/IISWC.2009.5306797

Rodinia: A benchmark suite for heterogeneous computing

Shuai Che, +6 more

- 04 Oct 2009

TL;DR: This characterization shows that the Rodinia benchmarks cover a wide range of parallel communication patterns, synchronization techniques and power consumption, and has led to some important architectural insight, such as the growing importance of memory-bandwidth limitations and the consequent importance of data layout.

...read moreread less

3.2K

•Journal Article•10.1186/S40537-019-0192-5

Survey on deep learning with class imbalance

Justin M. Johnson, +1 more

- 01 Mar 2019

- Journal of Big Data

TL;DR: Examination of existing deep learning techniques for addressing class imbalanced data finds that research in this area is very limited, that most existing work focuses on computer vision tasks with convolutional neural networks, and that the effects of big data are rarely considered.

...read moreread less

2.4K

Proceedings Article•10.1145/125826.125925

The NAS parallel benchmarks—summary and preliminary results

David H. Bailey, +12 more

- 01 Aug 1991

Abstract: No abstract available

...read moreread less

614

•Dissertation

Efficient, transparent, and comprehensive runtime code manipulation

Derek L. Bruening, +1 more

- 01 Jan 2004

TL;DR: D DynamoRIO is presented, a fully-implemented runtime code manipulation system that supports code transformations on any part of a program, while it executes, with zero to thirty percent time and memory overhead on both Windows and Linux.

...read moreread less

433

...

Expand

ARETE: Accurate Error Assessment via Machine Learning-Guided Dynamic-Timing Analysis

Chat with Paper

AI Agents for this Paper

Citations

Microarchitecture-Aware Timing Error Prediction via Deep Neural Networks

ePredictNet: Low Cost Error Prediction Neural Network

A Compressed and Accurate Sparse Deep Learning-based Workload-Aware Timing Error Model

References

Are we ready for autonomous driving? The KITTI vision benchmark suite

Rodinia: A benchmark suite for heterogeneous computing

Survey on deep learning with class imbalance

The NAS parallel benchmarks—summary and preliminary results

Efficient, transparent, and comprehensive runtime code manipulation

Related Papers (5)

FGSA for optimal quality of service based transaction in real-time database systems under different workload condition

Design of Fault Injection Tool Based on JTAG

Self-Prediction of Performance Metrics for the Database Management System Workload

Optimizations for Eliminating Control Dependence in Multimedia Programs

Precise Fault Injection and Fault Location System for SRAM-based FPGAs