Journal Article10.1007/S11042-019-7356-3
Action recognition from depth sequence using depth motion maps-based local ternary patterns and CNN
22
TL;DR: This paper presents a method for human action recognition from depth sequences captured by the depth camera via convolutional neural network (CNN) based approach and introduces Local Ternary Pattern as an image filter for DMMs to improve the distinguishability of similar actions.
read more
Abstract: This paper presents a method for human action recognition from depth sequences captured by the depth camera. The main idea of the method is the action mapping image classification via convolutional neural network (CNN) based approach. Firstly, we project the raw frames onto three orthogonal Cartesian planes and stack the results into three still images (corresponding to the front, side, and top views) to form the Depth Motion Maps (DMMs). Secondly, Local Ternary Pattern (LTP) is introduced as an image filter for DMMs, thus to improve the distinguishability of similar actions. Finally, we apply CNN to action recognition by classifying corresponding LTP-encoded images. Experimental results on the popular and challenging benchmark MSR-Action 3D and MSR-Gesture dataset show the effectiveness of the presented method and meet real-time action recognition task requirements.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Unsupervised pre-trained filter learning approach for efficient convolution neural network
Sadaqat Ur Rehman,Shanshan Tu,Muhammad Waqas,Yongfeng Huang,Obaid Ur Rehman,Basharat Ahmad,Salman Ahmad +6 more
TL;DR: A comprehensive survey of the relationship between ConvNet with different pre-trained learning methodologies and its optimization effects and the experimental results on the benchmark dataset highlight the merit of efficient pre- trained learning algorithms for optimized ConvNet.
253
Human motion recognition based on SVM in VR art media interaction environment
TL;DR: The test results show that compared with other classification recognition algorithms, the proposed method has a good classification effect on multiple performance indicators of human motion recognition and has higher recognition accuracy and better robustness.
CNN-based single object detection and tracking in videos and its application to drone detection
TL;DR: Three simple yet effective convolutional neural network (CNN) based single object detection and tracking algorithms for videos are proposed to efficiently combine a state-of-the-art object detector and visual tracker for circumstances in which only a few static images of the target are available for training.
52
Literature review of vision‐based dynamic gesture recognition using deep learning techniques
TL;DR: This study provides the precise categorization of DL algorithms and considers around 15 gesture datasets on which these techniques have been applied and provides a brief overview of the numerous challenging dataset available among the research community and insight into various challenges and limitations of a DL algorithm in vision‐based dynamic gesture recognition.
16
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
ImageNet classification with deep convolutional neural networks
TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.
•Proceedings Article
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky,Ilya Sutskever,Geoffrey E. Hinton +2 more
- 03 Dec 2012
TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
•Proceedings Article
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan,Andrew Zisserman +1 more
- 01 Jan 2015
TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
51.9K
Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions
Xiaoyang Tan,Bill Triggs +1 more
TL;DR: This work presents a simple and efficient preprocessing chain that eliminates most of the effects of changing illumination while still preserving the essential appearance details that are needed for recognition, and improves robustness by adding Kernel principal component analysis (PCA) feature extraction and incorporating rich local appearance cues from two complementary sources.