Parallel visualization algorithms: performance and architectural implications

doi:10.1109/2.299410

Journal Article10.1109/2.299410

Parallel visualization algorithms: performance and architectural implications

J. Pal Singh, +2 more

- 01 Jul 1994

- IEEE Computer

- Vol. 27, Iss: 7, pp 45-55

180

TL;DR: This article demonstrates that simple and natural parallelizations work very well, the sequential implementations do not have to be fundamentally restructured, and the high degree of temporal locality obviates the need for explicit data distribution and communication management on the best known visualization algorithms.

Abstract: Recently, a new class of scalable, shared-address-space multiprocessors has emerged. Like message-passing machines, these multiprocessors have a distributed interconnection network and physically distributed main memory. However, they provide hardware support for efficient implicit communication through a shared address space, and they automatically exploit temporal locality by caching both local and remote data in a processor's hardware cache. In this article, we show that these architectural characteristics make it much easier to obtain very good speedups on the best known visualization algorithms. Simple and natural parallelizations work very well, the sequential implementations do not have to be fundamentally restructured, and the high degree of temporal locality obviates the need for explicit data distribution and communication management. We demonstrate our claims through parallel versions of three state-of-the-art algorithms: a recent hierarchical radiosity algorithm by Hanrahan et al. (1991), a parallelized ray-casting volume renderer by Levoy (1992), and an optimized ray-tracer by Spach and Pulleyblank (1992). We also discuss a new shear-warp volume rendering algorithm that provides the first demonstration of interactive frame rates for a 256/spl times/256/spl times/256 voxel data set on a general-purpose multiprocessor. >

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1145/223982.223990

The SPLASH-2 programs: characterization and methodological considerations

Steven Cameron Woo, +4 more

- 01 May 1995

TL;DR: This paper quantitatively characterize the SPLASH-2 programs in terms of fundamental properties and architectural interactions that are important to understand them well, including the computational load balance, communication to computation ratio and traffic needs, important working set sizes, and issues related to spatial locality.

...read moreread less

4.1K

•Proceedings Article•10.1145/192161.192283

Fast volume rendering using a shear-warp factorization of the viewing transformation

Philippe Lacroute, +1 more

- 24 Jul 1994

TL;DR: A new object-order rendering algorithm based on the factorization of a shear-warp factorization for perspective viewing transformations is described that is significantly faster than published algorithms with minimal loss of image quality.

...read moreread less

1.3K

Journal Article•10.1007/S00224-001-0004-Z

Thread Scheduling for Multiprogrammed Multiprocessors

Nimar S. Arora, +2 more

- 01 Jan 2001

- Theory of Computing Systems \/ Mathemati...

TL;DR: This work presents a user-level thread scheduler for shared-memory multiprocessors, and it achieves linear speedup whenever P is small relative to the parallelism T1/T∈fty .

...read moreread less

513

Journal Article•10.1145/225830.223990

The SPLASH-2 programs

WooSteven Cameron, +4 more

- 01 May 1995

- ACM Sigarch Computer Architecture News

TL;DR: The SPLASH-2 suite of parallel applications has recently been released to facilitate the study of centralized and distributed shared-address-space multiprocessors.

...read moreread less

489

Proceedings Article•10.1145/1508244.1508256

Kendo: efficient deterministic multithreading in software

Marek Olszewski, +2 more

- 07 Mar 2009

TL;DR: Kendo is a new software-only system that provides deterministic multithreading of parallel applications that is easier to develop, debug, and test and can run on today's commodity hardware while incurring only a modest performance cost.

...read moreread less

405

...

Expand

References

•Proceedings Article•10.1145/192161.192283

Fast volume rendering using a shear-warp factorization of the viewing transformation

Philippe Lacroute, +1 more

- 24 Jul 1994

TL;DR: A new object-order rendering algorithm based on the factorization of a shear-warp factorization for perspective viewing transformations is described that is significantly faster than published algorithms with minimal loss of image quality.

...read moreread less

1.3K

Journal Article•10.1145/378456.378487

A progressive refinement approach to fast radiosity image generation

Michael F. Cohen, +3 more

- 01 Jun 1988

TL;DR: A reformulated radiosity algorithm is presented that produces initial images in time linear to the number of patches, which brings the use of radiosity for interactive rendering within reach and has implications for the use and development of current and future graphics workstations.

...read moreread less

689

Proceedings Article•10.1145/122718.122740

A rapid hierarchical radiosity algorithm

Pat Hanrahan, +2 more

- 01 Jul 1991

TL;DR: Standard techniques for shooting and gathering can be used with the hierarchical representation to solve for equilibrium radiosities, but the paper also discusses using a brightness-weighted error criteria, in conjunction with multigridding, to even more rapidly progressively refine the image.

...read moreread less

642

•Book

Multiprocessor simulation and tracing using Tango

Helen Davis, +2 more

- 01 Jan 1995

173

Proceedings Article•10.1145/147130.147141

Volume rendering on scalable shared-memory MIMD architectures

Jason Nieh, +1 more

- 01 Dec 1992

TL;DR: A parallel volume rendering algorithm for MIMD architectures based on ray tracing and a novel task queue image partitioning technique that achieves nearly linear speedups and near real-time frame update rates on a 48 processor machine.

...read moreread less

151

Parallel visualization algorithms: performance and architectural implications

Chat with Paper

AI Agents for this Paper

Citations

The SPLASH-2 programs: characterization and methodological considerations

Fast volume rendering using a shear-warp factorization of the viewing transformation

Thread Scheduling for Multiprogrammed Multiprocessors

The SPLASH-2 programs

Kendo: efficient deterministic multithreading in software

References

Fast volume rendering using a shear-warp factorization of the viewing transformation

A progressive refinement approach to fast radiosity image generation

A rapid hierarchical radiosity algorithm

Multiprocessor simulation and tracing using Tango

Volume rendering on scalable shared-memory MIMD architectures

Related Papers (5)

The SPLASH-2 programs: characterization and methodological considerations

A rapid hierarchical radiosity algorithm

Fast volume rendering using a shear-warp factorization of the viewing transformation

The SGI Origin: a ccNUMA highly scalable server

A hierarchical O(N log N) force-calculation algorithm