Sign function

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1016/J.CMA.2006.06.020•

A three dimensional large deformation meshfree method for arbitrary evolving cracks

[...]

Timon Rabczuk¹, Ted Belytschko¹•Institutions (1)

Northwestern University¹

15 May 2007-Computer Methods in Applied Mechanics and Engineering

TL;DR: In this paper, a new approach for modeling discrete cracks in mesh-free particle methods in 3D is described, where cracks can be arbitrarily oriented, but their growth is represented by activation of crack surfaces at individual particles, so no representation of the crack's topology is needed.

...read moreread less

1,110 citations

Book Chapter•10.1007/978-3-030-01267-0_44•

Bi-Real Net: Enhancing the Performance of 1-Bit CNNs with Improved Representational Capability and Advanced Training Algorithm

[...]

Zechun Liu¹, Baoyuan Wu², Wenhan Luo², Xin Yang³, Wei Liu², Kwang-Ting Cheng¹ - Show less +2 more•Institutions (3)

Hong Kong University of Science and Technology¹, Tencent², Huazhong University of Science and Technology³

8 Sep 2018

TL;DR: In this paper, the authors proposed a Bi-Real Network (Bi-Net) which connects the real activations (after the 1-bit convolution and/or batchNorm layer, before the sign function) to activations of the consecutive block, through an identity shortcut.

...read moreread less

Abstract: In this work, we study the 1-bit convolutional neural networks (CNNs), of which both the weights and activations are binary. While being efficient, the classification accuracy of the current 1-bit CNNs is much worse compared to their counterpart real-valued CNN models on the large-scale dataset, like ImageNet. To minimize the performance gap between the 1-bit and real-valued CNN models, we propose a novel model, dubbed Bi-Real net, which connects the real activations (after the 1-bit convolution and/or BatchNorm layer, before the sign function) to activations of the consecutive block, through an identity shortcut. Consequently, compared to the standard 1-bit CNN, the representational capability of the Bi-Real net is significantly enhanced and the additional cost on computation is negligible. Moreover, we develop a specific training algorithm including three technical novelties for 1-bit CNNs. Firstly, we derive a tight approximation to the derivative of the non-differentiable sign function with respect to activation. Secondly, we propose a magnitude-aware gradient with respect to the weight for updating the weight parameters. Thirdly, we pre-train the real-valued CNN model with a clip function, rather than the ReLU function, to better initialize the Bi-Real net. Experiments on ImageNet show that the Bi-Real net with the proposed training algorithm achieves 56.4% and 62.2% top-1 accuracy with 18 layers and 34 layers, respectively. Compared to the state-of-the-arts (e.g., XNOR Net), Bi-Real net achieves up to 10% higher top-1 accuracy with more memory saving and lower computational cost.

...read moreread less

746 citations

Proceedings Article•10.1109/CVPR.2017.574•

Deep Learning with Low Precision by Half-Wave Gaussian Quantization

[...]

Zhaowei Cai¹, Xiaodong He², Jian Sun, Nuno Vasconcelos¹•Institutions (2)

University of California, San Diego¹, Microsoft²

1 Jul 2017

TL;DR: An half-wave Gaussian quantizer (HWGQ) is proposed for forward approximation and shown to have efficient implementation, by exploiting the statistics of of network activations and batch normalization operations, and to achieve much closer performance to full precision networks than previously available low-precision networks.

...read moreread less

Abstract: The problem of quantizing the activations of a deep neural network is considered. An examination of the popular binary quantization approach shows that this consists of approximating a classical non-linearity, the hyperbolic tangent, by two functions: a piecewise constant sign function, which is used in feedforward network computations, and a piecewise linear hard tanh function, used in the backpropagation step during network learning. The problem of approximating the widely used ReLU non-linearity is then considered. An half-wave Gaussian quantizer (HWGQ) is proposed for forward approximation and shown to have efficient implementation, by exploiting the statistics of of network activations and batch normalization operations. To overcome the problem of gradient mismatch, due to the use of different forward and backward approximations, several piece-wise backward approximators are then investigated. The implementation of the resulting quantized network, denoted as HWGQ-Net, is shown to achieve much closer performance to full precision networks, such as AlexNet, ResNet, GoogLeNet and VGG-Net, than previously available low-precision networks, with 1-bit binary weights and 2-bit quantized activations.

...read moreread less

677 citations

Journal Article•10.1080/00207178008922881•

Linear model reduction and solution of the algebraic Riccati equation by use of the sign function

[...]

J. D. Roberts¹•Institutions (1)

University of Reading¹

01 Oct 1980-International Journal of Control

TL;DR: The sign function of a square matrix can be defined in terms of a contour integral or as the result of an iterated map as discussed by the authors, which enables a matrix to be decomposed into two components whose spectra lie on opposite sides of the imaginary axis.

...read moreread less

Abstract: The sign function of a square matrix can be defined in terms of a contour integral or as the result of an iterated map $. Application of this function enables a matrix to be decomposed into two components whose spectra lie on opposite sides of the imaginary axis. This has application in reduction of linear systems to lower order models and in the solution of the matrix Lyapunov and algebraic Riccati equations.

...read moreread less

470 citations

Book•

Generalized Functions: Theory and Technique

[...]

Ram P. Kanwal

1 Jan 1998

TL;DR: The Dirac delta function and delta sequences the Heaviside function the Dirac Delta function the delta sequences a unit dipole the heaviside sequences exercises as discussed by the authors, which is the most common sequence exercises.

...read moreread less

Abstract: The Dirac delta function and delta sequences the Heaviside function the Dirac delta function the delta sequences a unit dipole the Heaviside sequences exercises. (Part contents)

...read moreread less

413 citations

...

Expand

Year	Papers
2025	5
2024	8
2023	29
2022	56
2021	32
2020	24

Topic Tools

Papers published on a yearly basis

Papers

A three dimensional large deformation meshfree method for arbitrary evolving cracks

Bi-Real Net: Enhancing the Performance of 1-Bit CNNs with Improved Representational Capability and Advanced Training Algorithm

Deep Learning with Low Precision by Half-Wave Gaussian Quantization

Linear model reduction and solution of the algebraic Riccati equation by use of the sign function

Generalized Functions: Theory and Technique

Related Topics (5)

Performance Metrics