Open AccessProceedings Article
Human action recognition using a temporal hierarchy of covariance descriptors on 3D joint locations
Mohamed E. Hussein,Marwan Torki,Mohammad Gowayyed,Motaz El-Saban +3 more
- 03 Aug 2013
- pp 2466-2472
652
TL;DR: A novel approach to human action recognition from 3D skeleton sequences extracted from depth data that uses the covariance matrix for skeleton joint locations over time as a discriminative descriptor for a sequence to encode the relationship between joint movement and time.
read more
Abstract: Human action recognition from videos is a challenging machine vision task with multiple important application domains, such as human-robot/machine interaction, interactive entertainment, multimedia information retrieval, and surveillance. In this paper, we present a novel approach to human action recognition from 3D skeleton sequences extracted from depth data. We use the covariance matrix for skeleton joint locations over time as a discriminative descriptor for a sequence. To encode the relationship between joint movement and time, we deploy multiple covariance matrices over sub-sequences in a hierarchical fashion. The descriptor has a fixed length that is independent from the length of the described sequence. Our experiments show that using the covariance descriptor with an off-the-shelf classification algorithm outperforms the state of the art in action recognition on multiple datasets, captured either via a Kinect-type sensor or a sophisticated motion capture system. We also include an evaluation on a novel large dataset using our own annotation.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition
TL;DR: This work proposes a new framework for skeleton-based action recognition, namely Dynamic Group Spatio-Temporal GCN (DG-STGCN), which consists of two modules, DG-GCN and DG-TCN, respectively, for spatial and temporal modeling.
3D PostureNet: A unified framework for skeleton-based posture recognition
TL;DR: An end-to-end framework based on 3D CNN, called 3D PostureNet, is developed for robust posture recognition, and achieves significantly superior performance on both skeleton-based human posture and hand posture recognition tasks.
23
Human action recognition using multi-layer codebooks of key poses and atomic motions
TL;DR: A pattern-matching method is proposed and integrated with traditional classifiers for human action recognition that can obtain a comparable or better performance compared with the state-of-the-art methods.
23
Unsupervised activity recognition using latent semantic analysis on a mobile robot
Paul Duckworth,Muhannad Al-Omari,Yiannis Gatsoulis,David C. Hogg,Anthony G. Cohn +4 more
- 29 Aug 2016
TL;DR: It is shown that the abstraction into a qualitative space helps the robot to generalise and compare multiple noisy and partial observations in a real world dataset and that a vocabulary of latent activity classes (expressed using qualitative features) can be recovered.
23
•Posted Content
Focusing and Diffusion: Bidirectional Attentive Graph Convolutional Networks for Skeleton-based Action Recognition
TL;DR: A focusing and diffusion mechanism to enhance graph convolutional networks by paying attention to the kinematic dependence of articulated human pose in a frame and their implicit dependencies over frames that can facilitate skeleton-based action recognition is proposed.
23
References
LIBSVM: A library for support vector machines
Chih-Chung Chang,Chih-Jen Lin +1 more
TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
Svetlana Lazebnik,Cordelia Schmid,Jean Ponce +2 more
- 17 Jun 2006
TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.
Real-time human pose recognition in parts from single depth images
Jamie Shotton,Andrew Fitzgibbon,Mat Cook,Toby Sharp,Mark J. Finocchio,Richard E. Moore,Alex Aben-Athar Kipman,Andrew Blake +7 more
- 20 Jun 2011
TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.
Real-time human pose recognition in parts from single depth images
Jamie Shotton,Toby Sharp,Alex Aben-Athar Kipman,Andrew Fitzgibbon,Mark J. Finocchio,Andrew Blake,Mat Cook,Richard Moore +7 more
TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.
The recognition of human movement using temporal templates
Aaron F. Bobick,James W. Davis +1 more
TL;DR: A view-based approach to the representation and recognition of human movement is presented, and a recognition method matching temporal templates against stored instances of views of known actions is developed.
Related Papers (5)
Yong Du,Wei Wang,Liang Wang +2 more
- 07 Jun 2015
Wanqing Li,Zhengyou Zhang,Zicheng Liu +2 more
- 13 Jun 2010