Topic

Cannon's algorithm

About: Cannon's algorithm is a research topic. Over the lifetime, 8 publications have been published within this topic receiving 79 citations.

...read moreread less

Topic Tools

Find unexplored research gaps

Generate a literature review

Explore related concepts

Papers

Proceedings Article•

HERA: A reconfigurable and mixed-mode parallel computing engine on platform FPGAs

[...]

Xiaofang Wang¹, Sotirios G. Ziavras¹•Institutions (1)

New Jersey Institute of Technology¹

1 Dec 2004

TL;DR: This paper presents the design and implementation of a HERA (HEterogeneous Reconfigurable Architecture) machine that employs FPGAs to allow the simultaneous execution of a variety of parallel processing modes, including SIMD, MIMD, and MSIMD.

...read moreread less

Abstract: The high price, long design and development cycles, programming difficulty and high maintenance cost of supercomputers limit their range of potential applications. Recent advances in Field-Programmable Gate Arrays (FPGAs) have made feasible the development of highperformance and programmable parallel systems on a programmable chip (PSOPC). PSOPC’s yield highperformance at low cost for many parallel applications. We present in this paper the design and implementation of our HERA (HEterogeneous Reconfigurable Architecture) machine that employs FPGAs to allow the simultaneous execution of a variety of parallel processing modes, including SIMD (Single-Instruction, Multiple-Data), MIMD (Multiple-Instruction, Multiple-Data) and MSIMD (Multiple-SIMD). The processing element is centered on a single-precision IEEE 754 floating-point unit (FPU) and employs a 7-stage pipeline. To demonstrate the robustness and viability of our approach, we propose a data partitioning scheme and employ mixedmode scheduling for Cannon’s matrix-matrix multiplication algorithm with matrices of arbitrary size and shape. Performance results on our 64-PE machine that employs a dual-FPGA system are better than the optimized performance on a dual-Xeon PC.

...read moreread less

10 citations

Proceedings Article•10.1109/DCABES.2012.61•

Optimization of Parallel I/O for Cannon's Algorithm Based on Lustre

[...]

Yunchun Li¹, Hongda Li¹•Institutions (1)

Beihang University¹

19 Oct 2012

TL;DR: A new aggregation pattern (Stripe-continuous aggregation pattern), which fully considers the stripping mechanism and lock protocol of Lustre file system, is proposed to improve the performance of Collective I/O of Cannon's program.

...read moreread less

Abstract: Matrix multiplication is one of the most important operations in linear algebra, widely used in many fields of science and engineering. Cannon's algorithm is a classical distributed algorithm for matrix multiplication for two-dimensional meshes. Generally, MPI-IO is used for its I/O requirements. However it has been well documented that MPI-IO performs poorly in a Lustre file system environment. As the scale of matrix multiplication increased, this problem trends to be serious, becoming one key factor impacting performance of the program. In order to improve the performance of Collective I/O of Cannon's program, we proposed a new aggregation pattern (Stripe-continuous aggregation pattern), which fully considers the stripping mechanism and lock protocol of Lustre file system. The theoretical analysis and experimental results show that the pattern can make full use of the capacity of Lustre file system compared with the other patterns, and improve the I/O performance of the Cannon's program efficiently.

...read moreread less

7 citations

Proceedings Article•10.1109/ICPP.2005.46•

Incremental parallelization using navigational programming: a case study

[...]

Lei Pan¹, Wendy Y. Zhang¹, Arthur U. Asuncion¹, Ming Kin Lai¹, Michael B. Dillencourt¹, Lubomir Bic¹ - Show less +2 more•Institutions (1)

University of California, Irvine¹

14 Jun 2005

TL;DR: The NavP methodology is based on the principle of self-migrating computations and is truly incremental, in that each step represents a functioning program and every intermediate program is an improvement over its predecessor.

...read moreread less

Abstract: We show how a series of transformations can be applied to incrementally parallelize sequential programs. Our navigational programming (NavP) methodology is based on the principle of self-migrating computations and is truly incremental, in that each step represents a functioning program and every intermediate program is an improvement over its predecessor. The transformations are mechanical and straightforward to apply. We illustrate our methodology in the context of matrix multiplication. Our final stage is similar to the classical Gentleman's algorithm. The NavP methodology is conducive to new ways of thinking that lead to ease of programming and high performance.

...read moreread less

3 citations

Journal Article•10.1080/1063719031000087996•

The problem of small and large matrices in parallel Matrix Multiplication

[...]

Cristiana Piano

01 May 2003-Parallel Algorithms and Applications

TL;DR: The case in which, using the generalized Cannon's algorithm, it is possible to reduce communications in matrix multiplication is discussed, and two strategies are proposed to solve the problem of multiplying two large squared matrices.

...read moreread less

Abstract: In this paper we discuss the case in which, using the generalized Cannon's algorithm, it is possible to reduce communications in matrix multiplication We then apply reduction of communications to the case in which we have to multiply large matrices, in particular rectangular matrices Two strategies are proposed to solve the problem of multiplying two large squared matrices For the case in which we have to deal with small matrices, some methods are proposed to use the entire number of processors

...read moreread less

1 citations

Journal Article•10.1145/1940475.1940502•

Heuristics for cannon's algorithm with an application to lyons sporadic group

[...]

Yannick Saouter¹•Institutions (1)

École nationale supérieure des télécommunications de Bretagne¹

28 Jan 2011-ACM Communications in Computer Algebra

TL;DR: The method is used to derive a new presentation for the Lyons sporadic group and is devoted to describing heuristics to improve the efficiency of Cannon's algorithm.

...read moreread less

Abstract: This article is devoted to describing heuristics to improve the efficiency of Cannon's algorithm As an application, the method is used to derive a new presentation for the Lyons sporadic group

...read moreread less

1 citations

Performance Metrics

Papers

Citations

No. of papers in the topic in previous years
Year	Papers
2019	1
2012	1
2011	2
2005	1
2004	1
2003	1

Cannon's algorithm

Topic Tools

Papers

HERA: A reconfigurable and mixed-mode parallel computing engine on platform FPGAs

Optimization of Parallel I/O for Cannon's Algorithm Based on Lustre

Incremental parallelization using navigational programming: a case study

The problem of small and large matrices in parallel Matrix Multiplication

Heuristics for cannon's algorithm with an application to lyons sporadic group

Related Topics (5)

Performance Metrics