About: Sign function is a research topic. Over the lifetime, 535 publications have been published within this topic receiving 9229 citations. The topic is also known as: signum function & sgn.
TL;DR: In this paper, a new approach for modeling discrete cracks in mesh-free particle methods in 3D is described, where cracks can be arbitrarily oriented, but their growth is represented by activation of crack surfaces at individual particles, so no representation of the crack's topology is needed.
TL;DR: In this paper, the authors proposed a Bi-Real Network (Bi-Net) which connects the real activations (after the 1-bit convolution and/or batchNorm layer, before the sign function) to activations of the consecutive block, through an identity shortcut.
Abstract: In this work, we study the 1-bit convolutional neural networks (CNNs), of which both the weights and activations are binary. While being efficient, the classification accuracy of the current 1-bit CNNs is much worse compared to their counterpart real-valued CNN models on the large-scale dataset, like ImageNet. To minimize the performance gap between the 1-bit and real-valued CNN models, we propose a novel model, dubbed Bi-Real net, which connects the real activations (after the 1-bit convolution and/or BatchNorm layer, before the sign function) to activations of the consecutive block, through an identity shortcut. Consequently, compared to the standard 1-bit CNN, the representational capability of the Bi-Real net is significantly enhanced and the additional cost on computation is negligible. Moreover, we develop a specific training algorithm including three technical novelties for 1-bit CNNs. Firstly, we derive a tight approximation to the derivative of the non-differentiable sign function with respect to activation. Secondly, we propose a magnitude-aware gradient with respect to the weight for updating the weight parameters. Thirdly, we pre-train the real-valued CNN model with a clip function, rather than the ReLU function, to better initialize the Bi-Real net. Experiments on ImageNet show that the Bi-Real net with the proposed training algorithm achieves 56.4% and 62.2% top-1 accuracy with 18 layers and 34 layers, respectively. Compared to the state-of-the-arts (e.g., XNOR Net), Bi-Real net achieves up to 10% higher top-1 accuracy with more memory saving and lower computational cost.
TL;DR: An half-wave Gaussian quantizer (HWGQ) is proposed for forward approximation and shown to have efficient implementation, by exploiting the statistics of of network activations and batch normalization operations, and to achieve much closer performance to full precision networks than previously available low-precision networks.
Abstract: The problem of quantizing the activations of a deep neural network is considered. An examination of the popular binary quantization approach shows that this consists of approximating a classical non-linearity, the hyperbolic tangent, by two functions: a piecewise constant sign function, which is used in feedforward network computations, and a piecewise linear hard tanh function, used in the backpropagation step during network learning. The problem of approximating the widely used ReLU non-linearity is then considered. An half-wave Gaussian quantizer (HWGQ) is proposed for forward approximation and shown to have efficient implementation, by exploiting the statistics of of network activations and batch normalization operations. To overcome the problem of gradient mismatch, due to the use of different forward and backward approximations, several piece-wise backward approximators are then investigated. The implementation of the resulting quantized network, denoted as HWGQ-Net, is shown to achieve much closer performance to full precision networks, such as AlexNet, ResNet, GoogLeNet and VGG-Net, than previously available low-precision networks, with 1-bit binary weights and 2-bit quantized activations.
TL;DR: The sign function of a square matrix can be defined in terms of a contour integral or as the result of an iterated map as discussed by the authors, which enables a matrix to be decomposed into two components whose spectra lie on opposite sides of the imaginary axis.
Abstract: The sign function of a square matrix can be defined in terms of a contour integral or as the result of an iterated map $. Application of this function enables a matrix to be decomposed into two components whose spectra lie on opposite sides of the imaginary axis. This has application in reduction of linear systems to lower order models and in the solution of the matrix Lyapunov and algebraic Riccati equations.
TL;DR: The Dirac delta function and delta sequences the Heaviside function the Dirac Delta function the delta sequences a unit dipole the heaviside sequences exercises as discussed by the authors, which is the most common sequence exercises.
Abstract: The Dirac delta function and delta sequences the Heaviside function the Dirac delta function the delta sequences a unit dipole the Heaviside sequences exercises. (Part contents)