Rep-Tracker: a lightweight tiny ball tracking method using structure re-parameterization

doi:10.21203/rs.3.rs-2903142/v1

Open AccessPosted Content10.21203/rs.3.rs-2903142/v1

Rep-Tracker: a lightweight tiny ball tracking method using structure re-parameterization

15 May 2023

1

TL;DR: In this paper , a lightweight tiny ball tracking method is developed based on deep learning call Rep-Tracker, which incorporates the structure re-parameterization strategy on the basis of the pruned VGG-style model to construct a tennis tracking network.

Abstract: Abstract Vision-based object tracking techniques are used to analyze sport competition videos. The latest tennis tracking models achieve very high accuracy. However, high precision networks often imply higher computational complexity, which in turn brings higher hardware requirements. Also, part of the networks suffer from the problem of different degrees of adaptation to different courses. In this paper, a lightweight tiny ball tracking method is developed based on deep learning call Rep-Tracker, which incorporates the structure re-parameterization strategy on the basis of the pruned VGG-style model to construct a tennis tracking network. Firstly, the video frames will be sent into a pruned VGG-16 encoder follow by a decoder composed of deconvolution, and heatmap will be output after a softmax layer. Based on the encoder-decoder structure , pruning methods are used to reduce the number of non-essential convolutional layers and channels in the network. Secondly, residual connections are added at each layers to form a multi-branch block at training time. By using a structural re-parameterization strategy, each branch in a multi-branch block will be equivalently transformed into a shape consistent convolution. The trained blocks are convert into a single convolution layer for inference. Finally, based on the heatmap, the position of the target object is obtained by Gaussian distribution function 1 Springer Nature 2021 L A T E X template Article Title and Hough gradient function. In the tennis tracking task our method achieves a high accuracy with GFLOPs only 1.29 on RTX 3080 GPU.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.5815/ijmecs.2024.02.02

Evaluation of Tennis Teaching Effect UsingOptimized DL Model with Cloud ComputingSystem

Sai Srinivas Vellela, +5 more

- 08 Apr 2024

- International Journal of Modern Educatio...

TL;DR: This study evaluates the effectiveness of a bidirectional long-short-term memory (BLSTM) model in assessing table tennis forehand strokes using sensory data from 16 players, outperforming LSTM approaches with optimal settings selected via an enhanced dragonfly algorithm.

...read moreread less

5

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Proceedings Article•10.1109/CVPR.2017.243

Densely Connected Convolutional Networks

Gao Huang, +3 more

- 21 Jul 2017

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.

...read moreread less

46.1K

•Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

- 06 Jul 2015

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

43.7K

...

Expand

Rep-Tracker: a lightweight tiny ball tracking method using structure re-parameterization

Chat with Paper

AI Agents for this Paper

Citations

Evaluation of Tennis Teaching Effect UsingOptimized DL Model with Cloud ComputingSystem

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

Densely Connected Convolutional Networks

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Related Papers (5)

Deep 2nd-order residual block for image denoising

Improved Tracking of Multiple Humans with Trajectory Predcition and Occlusion Modeling

WLA-Net: A Whole New Light-weight Architecture For Visual Task

Deep Learning-Based Colon Cancer Tumor Prediction Using Histopathological Images

LPRNet: Lightweight Deep Network by Low-rank Pointwise Residual Convolution