Rep-Tracker: a lightweight tiny ball tracking method using structure re-parameterization
15 May 2023
TL;DR: In this paper , a lightweight tiny ball tracking method is developed based on deep learning call Rep-Tracker, which incorporates the structure re-parameterization strategy on the basis of the pruned VGG-style model to construct a tennis tracking network.
read more
Abstract: Abstract Vision-based object tracking techniques are used to analyze sport competition videos. The latest tennis tracking models achieve very high accuracy. However, high precision networks often imply higher computational complexity, which in turn brings higher hardware requirements. Also, part of the networks suffer from the problem of different degrees of adaptation to different courses. In this paper, a lightweight tiny ball tracking method is developed based on deep learning call Rep-Tracker, which incorporates the structure re-parameterization strategy on the basis of the pruned VGG-style model to construct a tennis tracking network. Firstly, the video frames will be sent into a pruned VGG-16 encoder follow by a decoder composed of deconvolution, and heatmap will be output after a softmax layer. Based on the encoder-decoder structure , pruning methods are used to reduce the number of non-essential convolutional layers and channels in the network. Secondly, residual connections are added at each layers to form a multi-branch block at training time. By using a structural re-parameterization strategy, each branch in a multi-branch block will be equivalently transformed into a shape consistent convolution. The trained blocks are convert into a single convolution layer for inference. Finally, based on the heatmap, the position of the target object is obtained by Gaussian distribution function 1 Springer Nature 2021 L A T E X template Article Title and Hough gradient function. In the tennis tracking task our method achieves a high accuracy with GFLOPs only 1.29 on RTX 3080 GPU.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Evaluation of Tennis Teaching Effect UsingOptimized DL Model with Cloud ComputingSystem
Sai Srinivas Vellela,M. V. Rao,Srihari Varma Mantena,M. V. J. Reddy,Ramesh Vatambeti,Syed Ziaur Rahman +5 more
TL;DR: This study evaluates the effectiveness of a bidirectional long-short-term memory (BLSTM) model in assessing table tennis forehand strokes using sensory data from 16 players, outperforming LSTM approaches with optimal settings selected via an enhanced dragonfly algorithm.
5
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
•Proceedings Article
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014
TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
102.6K
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
- 07 Jun 2015
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Densely Connected Convolutional Networks
Gao Huang,Zhuang Liu,Laurens van der Maaten,Kilian Q. Weinberger +3 more
- 21 Jul 2017
TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.
•Proceedings Article
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe,Christian Szegedy +1 more
- 06 Jul 2015
TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.