Alex Sergeev
5 Papers
Alex Sergeev is an academic researcher. The author has contributed to research in topics: Convolutional neural network & Deep learning. The author has an hindex of 4, co-authored 4 publications.
Chat about Author
Papers
•Proceedings Article
An intriguing failing of convolutional neural networks and the CoordConv solution
Rosanne Liu,Joel Lehman,Piero Molino,Felipe Petroski Such,Eric Frank,Alex Sergeev,Jason Yosinski +6 more
- 01 Jan 2018
TL;DR: CoordConv as discussed by the authors proposes to give convolution access to its own input coordinates through the use of extra coordinate channels, allowing networks to learn either complete translation invariance or varying degrees of translation dependence, as required by the end task.
•Posted Content
An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution
Rosanne Liu,Joel Lehman,Piero Molino,Felipe Petroski Such,Eric Frank,Alex Sergeev,Jason Yosinski +6 more
TL;DR: Preliminary evidence that swapping convolution for CoordConv can improve models on a diverse set of tasks is shown, which works by giving convolution access to its own input coordinates through the use of extra coordinate channels without sacrificing the computational and parametric efficiency of ordinary convolution.
528
•Proceedings Article
Faster Neural Networks Straight from JPEG
Lionel Gueguen,Alex Sergeev,Ben Kadlec,Rosanne Liu,Jason Yosinski +4 more
- 12 Feb 2018
TL;DR: A simple idea is proposed and explored: train CNNs directly on the blockwise discrete cosine transform (DCT) coefficients computed and available in the middle of the JPEG codec, modified to produce DCT coefficients directly, and evaluated on ImageNet.
•Proceedings Article
Faster Neural Networks Straight from JPEG.
Lionel Gueguen,Alex Sergeev,Rosanne Liu,Jason Yosinski +3 more
- 01 Jan 2018
92
Proceedings Article
Accelerating Collective Communication in Data Parallel Training across Deep Learning Frameworks
Joshua Romero,Junqi Yin,Nouamane Laanait,Bing Xie,M. Young,Sean J. Treichler,Vitalii Starchenko,Albina Y. Borisevich,Alex Sergeev,Michael A. Matheson +9 more
TL;DR: The Horovod control plane is improved by implementing a new coordination scheme that takes advantage of the characteristics of the typical data parallel training paradigm, namely the repeated execution of collectives on the gradients of a set of tensors.