Exploiting tightly-coupled cores

doi:10.1007/S11265-014-0944-6

Open AccessJournal Article10.1007/S11265-014-0944-6

Exploiting tightly-coupled cores

Daniel Bates, +3 more

- 15 Jul 2013

- Vol. 80, Iss: 1, pp 103-120

12

TL;DR: This paper focuses on the design of a single 8-core tile, conceived as the building block for a larger many-core system, and explores the tile’s ability to support a range of parallelisation opportunities and detail the control and communication mechanisms needed to exploit each cores’ resources in a flexible manner.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/MM.2015.10

Always-on Vision Processing Unit for Mobile Applications

Brendan Barry, +7 more

- 27 Jan 2015

- IEEE Micro

TL;DR: The vision processing unit incorporates parallelism, instruction set architecture, and microarchitectural features to provide highly sustainable performance efficiency across a range of computational-Imaging and computer vision applications, including those with low latency requirements on the order of milliseconds.

...read moreread less

138

•Proceedings Article•10.1109/FPL.2018.00030

Accelerating Database Systems Using FPGAs: A Survey

Philippos Papaphilippou, +1 more

- 01 Aug 2018

TL;DR: This survey presents a systematic review of research relating to accelerating analytical database systems using FPGAs, including studies of database acceleration frameworks and accelerator implementations for various database operators.

...read moreread less

48

Proceedings Article•10.1109/COOLCHIPS.2019.8721304

Statistical Access Interval Prediction for Tightly Coupled Memory Systems

Robert Wittig, +3 more

- 01 Apr 2019

TL;DR: It is argued that most memory transaction of embedded processors can be reliably predicted in the time domain, therefore, preallocation of shared resources can be used to avoid collisions in the memory system.

...read moreread less

9

Book Chapter•10.1007/978-3-030-27562-4_16

Access Interval Prediction for Tightly Coupled Memory Systems.

Robert Wittig, +3 more

- 07 Jul 2019

TL;DR: A method for memory Access Interval Prediction is introduced that minimizes conflicts by predicting the interval between two consecutive memory accesses, thereby significantly reducing the number of access conflicts.

...read moreread less

7

Accelerating control-flow intensive code in spatial hardware

Ali Mustafa Zaidi

- 01 Jan 2015

TL;DR: This work demonstrates that it is possible to use custom and/or reconfigurable hardware in heterogeneous systems to improve the efficiency of frequently executed sequential code, without compromising performance relative to an energy inefficient out-of-order superscalar processor.

...read moreread less

6

References

Journal Article•10.21276/IJRE.2018.5.5.4

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 06 Dec 2004

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.

...read moreread less

22.7K

Journal Article•10.1093/BIOMET/52.3-4.591

An Analysis of Variance Test for Normality (Complete Samples)

S. S. Shapiro, +1 more

- 01 Dec 1965

- Biometrika

TL;DR: In this article, a new statistical procedure for testing a complete sample for normality is introduced, which is obtained by dividing the square of an appropriate linear combination of the sample order statistics by the usual symmetric estimate of variance.

...read moreread less

20K

Journal Article•10.1145/1327452.1327492

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 01 Jan 2008

- Communications of The ACM

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.

...read moreread less

18.6K

•Proceedings Article•10.1109/WWC.2001.15

MiBench: A free, commercially representative embedded benchmark suite

Matthew R. Guthaus, +5 more

- 02 Dec 2001

TL;DR: A new version of SimpleScalar that has been adapted to the ARM instruction set is used to characterize the performance of the benchmarks using configurations similar to current and next generation embedded processors.

...read moreread less

3.7K