The SPLASH-2 programs

doi:10.1145/225830.223990

Journal Article10.1145/225830.223990

The SPLASH-2 programs

WooSteven Cameron, +4 more

- 01 May 1995

- ACM Sigarch Computer Architecture News

485

TL;DR: The SPLASH-2 suite of parallel applications has recently been released to facilitate the study of centralized and distributed shared-address-space multiprocessors.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1145/3352460.3358314

CleanupSpec: An "Undo" Approach to Safe Speculation

Gururaj Saileshwar, +1 more

- 12 Oct 2019

TL;DR: CleanupSpec is a hardware-based solution that mitigates speculation-based attacks by undoing the changes to the cache sub-system caused by speculative instructions, in the event they are squashed on a mis-speculation.

...read moreread less

141

•Proceedings Article•10.5555/1874620.1874803

Power and performance of read-write aware hybrid caches with non-volatile memories

Xiaoxia Wu, +4 more

- 20 Apr 2009

TL;DR: It is demonstrated that a RWHCA design with a conservative setup can provide a geometric mean 55% power reduction and yet 5% IPC improvement over a baseline SRAM cache design across a collection of 30 workloads.

...read moreread less

128

Journal Article•10.1109/TCAD.2018.2878168

Machine Learning for Power, Energy, and Thermal Management on Multicore Processors: A Survey

Santiago Pagani, +3 more

- 01 Jan 2020

- IEEE Transactions on Computer-Aided Desi...

TL;DR: This paper presents an overview of several research efforts that propose to use machine learning techniques for power and thermal management on single-core and multicore processors, and can potentially adapt to varying system conditions and workloads.

...read moreread less

118

Journal Article•10.1109/TPDS.2014.2383384

A-WiNoC: Adaptive Wireless Network-on-Chip Architecture for Chip Multiprocessors

Dominic DiTomaso, +5 more

- 01 Dec 2015

- IEEE Transactions on Parallel and Distri...

TL;DR: This paper proposes A-WiNoC, a scalable, adaptable wireless Network-on-Chip architecture that uses energy efficient wireless transceivers and improves network throughput by dynamically re-assigning channels in response to bandwidth demands from different cores.

...read moreread less

103

•Journal Article•10.1109/MWC.2012.6339473

Wireless networks-on-chips: architecture, wireless channel, and devices

David Matolak, +5 more

- 26 Oct 2012

- IEEE Wireless Communications

TL;DR: It is shown that the integration of wireless interconnects with wired interConnects in NoCs can reduce overall network power by 34 percent while achieving a speedup of 2.54 on real applications.

...read moreread less

98

...

Expand

References

•Proceedings Article•10.1145/800133.804339

Parallelism in random access machines

Steven Fortune, +1 more

- 01 May 1978

TL;DR: A model of computation based on random access machines operating in parallel and sharing a common memory is presented and can accept in polynomial time exactly the sets accepted by nondeterministic exponential time bounded Turing machines.

...read moreread less

1K

Proceedings Article•10.1145/122718.122740

A rapid hierarchical radiosity algorithm

Pat Hanrahan, +2 more

- 01 Jul 1991

TL;DR: Standard techniques for shooting and gathering can be used with the hierarchical representation to solve for equilibrium radiosities, but the paper also discusses using a brightness-weighted error criteria, in conjunction with multigridding, to even more rapidly progressively refine the image.

...read moreread less

642

Proceedings Article•10.1145/113379.113380

A comparison of sorting algorithms for the connection machine CM-2

Guy E. Blelloch, +5 more

- 01 Jun 1991

TL;DR: A fast sorting algorithm for the Connection Machine Supercomputer model CM-2 is developed and it is shown that any U(lg n)-depth family of sorting networks can be used to sort n numbers in U( lg n) time in the bounded-degree fixed interconnection network domain.

...read moreread less

375

Journal Article•10.1109/12.286299

False sharing and spatial locality in multiprocessor caches

Josep Torrellas, +2 more

- 01 Jun 1994

- IEEE Transactions on Computers

TL;DR: To mitigate false sharing and to enhance spatial locality, the layout of shared data in cache blocks is optimized in a programmer-transparent manner and it is shown that this approach can reduce the number of misses on shared data by about 10% on average.

...read moreread less

269

Journal Article•10.1007/BF00162341

FFTs in external or hierarchical memory

David H. Bailey

- 01 Mar 1990

- The Journal of Supercomputing

TL;DR: Advanced techniques for computing an ordered FFT on a computer with external or hierarchical memory that require as few as two passes through the external data set, employ strictly unit stride, long vector transfers between main memory and external storage, and are well suited for vector and parallel computation are described.

...read moreread less

266