Top 144 papers published in the topic of Multiplication in 1992

Showing papers on "Multiplication published in 1992"

Book•

Multiplication of distributions

[...]

1 Jan 1992

Abstract: If Ω denotes an open subset of Rn (n = 1, 2,…), we define an algebra g (Ω) which contains the space D′(Ω) of all distributions on Ω and such that C∞(Ω) is a subalgebra of G (Ω). The elements of G (Ω) may be considered as “generalized functions” on Ω and they admit partial derivatives at any order that generalize exactly the derivation of distributions. The multiplication in G(Ω) gives therefore a natural meaning to any product of distributions, and we explain how these results agree with remarks of Schwartz on difficulties concerning a multiplication of distributions. More generally if q = 1, 2,…, and ƒ∈OM(R2q)—a classical Schwartz notation—for any G1,…,Gq∈G(σ), we define naturally an element ƒG1,…,Gq∈G(σ). These results are applied to some differential equations and extended to the vector valued case, which allows the multiplication of vector valued distributions of physics.

...read moreread less

341 citations

Multiplication and division as models of situations.

[...]

Brian Greer¹•Institutions (1)

Queen's University¹

1 Jan 1992

317 citations

Book•

Systolic parallel processing

[...]

Nicolai Petkov

18 Dec 1992

TL;DR: The Systolic Mode of Parallel Processing is introduced, with examples: Mapping Different Filter Banks onto the Same Fixed-Size Processor Array, and Unidirectional Full-Systolic Arrays with Bidirectional Data Flow.

...read moreread less

Abstract: The Systolic Mode of Parallel Processing. Introduction to the Underlying Concept. The Original Motivation: VSLI Implementation. The Present Trend: Efficient Algorithms for Massively Parallel Computers. A List of Known Applications. Defining and Expressing Systolic Arrays and Algorithms. Using Automata Notions. Defining Systolic Automata, Arrays, and Algorithms. Expressing Systolic Algorithms. Analysis and Comparison of Systolic Algorithms. Matrix-Vector and Matrix Multiplication. Introduction to Vectors and Matrices. Matrix-Vector Multiplication. Systolic Simulation of Feedforward Artificial Neural Networks. Matrix Multiplication. Solving Systems of Linear Algebraic Equations. Introduction to Linear Systems. Gaussian Elimination. Systolic Arrays for Triangularization and LU/QR Decomposition. Systolic Algorithms for Back Substitution. Systolic Implementation of Iterative Methods. Further Problems of Linear Algebra. Computing the Inverse of a Matrix. Generalized Elimination. Computing the Characteristic Polynomial. Matrix Transposition and Related Operations. Convolution and Linear Filters. Convolution, Correlation, FIR and IIR Filters. Semi-Systolic Realizations. Unidirectional Full-Systolic Arrays. Systolic Arrays with Bidirectional Data Flow. Bit-Level Systolic Convolver. Operations with Polynomials. Introduction. Multiplication of Polynomials and Integers. Division of Polynomials. Computing the Greatest Common Divisor. Polynomial Interpolation. Evaluation of Polynomials. Comparison Problems. Sorting. Selection and Running Order Statistics. Sorting and Order Statistics for Rank Filtering. A Data Structure: Priority Queue. Dynamic Programming and its Applications. Introduction. Implementing the Dynamic Programming Recurrence in a Two-Dimensional Systolic Array. Implementation in One-Dimensional Arrays. Further Dynamic Programming Recurrences. Computational Geometry. Convex Hull. Nearest-Neighbours Problems. Systematic Design of Systolic Algorithms. Dependence Graphs. Systolic Array Dependence Graphs. Extracting Systolic Algorithms from Dependence Graphs. Modifying the Properties of Systolic Algorithms. Partitioning of Systolic Algorithms. Partitioning, Algorithm Mapping, Design of Flexible Systolic Structures, Time Sharing. Application of c-Slow Automata to the Realization of Parallel Structures. Examples: Mapping Different Filter Banks onto the Same Fixed-Size Processor Array. A Summary of the Technique and Alternative Approaches. References and Additional Literature. Subject Index.

...read moreread less

110 citations

Patent•

Dct/idct processor and data processing method

[...]

Shinichi Uramoto¹, Yoshitsugu Inoue¹•Institutions (1)

Mitsubishi¹

27 Mar 1992

TL;DR: In this paper, the product sum operation is performed by a ROM table and an adder, and the number of times of multiplication is reduced by utilizing inherent characteristics of coefficients of DCT/IDCT processing.

...read moreread less

Abstract: A one-dimensional discrete cosine transform processor of N (N: positive integer)-term input data X includes a preprocessing section for carrying out addition and subtraction of (i)th-term data x (i) and (N-i)th-term data x (N-1) of input data X, and a unit for performing a product sum operation for sets of intermediate data subjected to preprocessing by addition and sets of intermediate data subjected to preprocessing by subtraction, respectively. The product sum operation unit includes a data rearranging unit for outputting, in parallel and in order, bit data of the same figure of a set of data, a partial sum generator for generating a partial sum by using the parallel bit data as an address, and an accumulator for accumulating outputs of the partial sum generator. A one-dimensional inverse discrete cosine transform processor of N-term input data X includes a unit for performing a product sum operation of input data, and a postprocessing section for carrying out addition and subtraction of 2-term data in a predetermined combination of an output of the product sum operation unit. The number of times of multiplication is reduced by utilizing inherent characteristics of coefficients of DCT/IDCT processing. Since the product sum operation is performed by a ROM table and an adder, a faster multiplication is realized.

...read moreread less

86 citations

Journal Article•10.1016/0024-3795(92)90393-O•

On practical algorithms for accelerated matrix multiplication

[...]

Julian Laderman¹, Victor Y. Pan², Victor Y. Pan¹, Xuan-He Sha³•Institutions (3)

Lehman College¹, University at Albany, SUNY², The Graduate Center, CUNY³

01 Feb 1992-Linear Algebra and its Applications

TL;DR: Surprisingly, this enables us to decrease the bilinear complexity of n X n matrix multiplication below the current record upper bound for the same computation over the infinite fields of complex, real, or rational numbers.

...read moreread less

86 citations

Journal Article•10.1049/IP-E.1992.0036•

Division and bit-serial multiplication over GF(qm)

[...]

M.A. Hasan¹, Vijay K. Bhargava¹•Institutions (1)

Victoria University, Australia¹

1 May 1992

TL;DR: In this paper, it was shown that, when field elements are represented by polynomials, division over finite fields can be performed by solving a system of m linear equations over GF(q).

...read moreread less

Abstract: Division and bit-serial multiplication in finite fields are considered. Using co-ordinates of the supporting elements it is shown that, when field elements are represented by polynomials, division over GF(qm) can be performed by solving a system of m linear equations over GF(q). For a canonical basis representation, a relationship between the division and the discrete-time Wiener-Hopf equation of degree m over GF(q) is derived. This relationship leads to a bit-serial multiplication scheme that can be easily realised for all irreducible polynomials.

...read moreread less

81 citations

Mapping unstructured grid computations to massively parallel computers

[...]

Steven Warren Hammond¹•Institutions (1)

Research Institute for Advanced Computer Science¹

1 Jan 1992

TL;DR: In this article, a taxonomy of objective functions and heuristics used to solve the mapping problem is presented, and a highly parallel heuristic mapping algorithm, called Cyclic Pairwise Exchange (CPE), is developed.

...read moreread less

Abstract: This thesis investigates the mapping problem: assign the tasks of a parallel program to the processors of a parallel computer such that the execution time is minimized. First, a taxonomy of objective functions and heuristics used to solve the mapping problem is presented. Next, we develop a highly parallel heuristic mapping algorithm, called Cyclic Pairwise Exchange (CPE), and discuss its place in the taxonomy. CPE uses local pairwise exchanges of processor assignments to iteratively improve an initial mapping. A variety of initial mapping schemes are tested and recursive spectral bipartitioning (RSB) followed by CPE is shown to result in the best mappings. For the test cases studied here, problems arising in computational fluid dynamics and structural mechanics on unstructured triangular and tetrahedral meshes, RSB and CPE outperform methods based on simulated annealing. Much less time is required to do the mapping and the results obtained are better. Compared with random and naive mappings, RSB and CPE reduce the communication time twofold for the test problems used. Finally, we use CPE in two applications on a CM-2. The first application is a data parallel mesh-vertex upwind finite volume scheme for solving the Euler equations on 2-D triangular unstructured meshes. CPE is used to map grid points to processors. The performance of this code is compared with a similar code on a Cray-YMP and an Intel iPSC/860. The second application is parallel sparse matrix-vector multiplication used in the iterative solution of large sparse linear systems of equations. We map rows of the matrix to processors and use an inner-product based matrix-vector multiplication. We demonstrate that this method is an order of magnitude faster than methods based on scan operations for our test cases.

...read moreread less

71 citations

Journal Article•10.1109/12.214659•

On-the-fly rounding (computing arithmetic)

[...]

Milos D. Ercegovac¹, Tomás Lang²•Institutions (2)

University of California, Los Angeles¹, University of California, Irvine²

01 Dec 1992-IEEE Transactions on Computers

TL;DR: Three ways to modify this conversion process so that the result is rounded are described, which can be done on-the-fly as the digits are produced, without the use of a carry-propagate adder.

...read moreread less

Abstract: In implementations of operations based on digit-recurrence algorithms such as division, left-to-right multiplication and square root, the result is obtained in digit-serial form, from most significant digit to least significant. To reduce the complexity of the result-digit selection and allow the use of redundant addition, the result-digit has values from a signed-digit set. As a consequence, the result has to be converted to conventional representation, which can be done on-the-fly as the digits are produced, without the use of a carry-propagate adder. The authors describe three ways to modify this conversion process so that the result is rounded. The resulting operation is fast because no carry-propagate addition is needed. The schemes described apply also to online arithmetic operations. >

...read moreread less

52 citations

Journal Article•10.1109/12.166599•

Bit-parallel arithmetic in a massively-parallel associative processor

[...]

Isaac D. Scherson¹, D.A. Kramer², Brian D. Alleyne²•Institutions (2)

University of California, Irvine¹, Princeton University²

01 Oct 1992-IEEE Transactions on Computers

TL;DR: A simple but powerful architecture based on the classical associative processor model, by distributing logic among slices of storage cells such that a number of bit-planes share a simple logic unit, bit-parallel arithmetic for massively parallel processing becomes feasible.

...read moreread less

Abstract: A simple but powerful architecture based on the classical associative processor model is proposed. By distributing logic among slices of storage cells such that a number of bit-planes share a simple logic unit, bit-parallel arithmetic for massively parallel processing becomes feasible. For m-bit operands, this architecture enables complex operations such as multiplication and division to execute in O(m) cycles as opposed to O(m/sup 2/) for bit-serial machines. Algorithms which utilize this bit-parallel property to efficiently perform operations on floating point data have been developed. The simplicity of the architecture enables its implementation using VLSI technology, and hence allows the construction of a word-parallel, bit-parallel, massively parallel (P/sup 3/) computing system. Implementations of the fast Fourier transform and matrix multiplication are presented to illustrate the operation of this system. >

...read moreread less

49 citations

Patent•

Method and apparatus for implementing a digital filter employing coefficients expressed as sums of 2 to an integer power

[...]

Kun Lin

26 May 1992

TL;DR: In the context of shift-and-add algorithms, the lower order terms require fewer shifting operations and less total hardware to effect multiplication than the corresponding higher-order terms as mentioned in this paper.

...read moreread less

Abstract: Method and apparatus for implementing a digital filter employing coefficients expressed as sums of 2 to an integer power. Coefficients expressed as sums of powers of 2 may be algebraically manipulated such that higher order terms are replaced by an equivalent group of lower order terms. In the context of a shift-and-add algorithm, the lower order terms require fewer shifting operations and less total hardware to effect multiplication than the corresponding higher order terms.

...read moreread less

47 citations

...

Expand

Showing papers on "Multiplication published in 1992"

Multiplication of distributions

Multiplication and division as models of situations.

Systolic parallel processing

Dct/idct processor and data processing method

On practical algorithms for accelerated matrix multiplication

Division and bit-serial multiplication over GF(qm)

Mapping unstructured grid computations to massively parallel computers

On-the-fly rounding (computing arithmetic)

Bit-parallel arithmetic in a massively-parallel associative processor

Method and apparatus for implementing a digital filter employing coefficients expressed as sums of 2 to an integer power

An optimal multiplication algorithm for reconfigurable mesh

Method and apparatus for multiplying two numbers using signed arithmetic

Children's Solutions to Multiplication and Division Word Problems: A Longitudinal Study.

Efficient matrix multiplication on SIMD computers

Optimal carry save networks

The Psychological Analysis of Multiplication Procedures.

High-speed VLSI arithmetic processor architectures using hybrid number representation

Analytic reproducing kernels and multiplication operators

Electrically-driven power steering device

A modular exponentiation unit based on systolic arrays

Matrix-vector multiplication using digital partitioning for more accurate optical computing.

Modified Booth algorithm for high radix multiplication

Game cards for playing a game and for learning arithmetic

Arithmetic unit for multiplying long integers modulo M and R.S.A. converter provided with such multiplication device

A high performance algorithm using pre-processing for the sparse matrix-vector multiplication

Higher radix square root with prescaling

On the multiplication of reduced biquaternions and applications

Adaptive m-ary segmentation and canonical recoding algorithms for multiplication of large binary numbers

The scheduling of sparse matrix-vector multiplication on a massively parallel dap computer

The graph of multiplication is equivalent to counting