Open AccessProceedings Article
Human action recognition using a temporal hierarchy of covariance descriptors on 3D joint locations
Mohamed E. Hussein,Marwan Torki,Mohammad Gowayyed,Motaz El-Saban +3 more
- 03 Aug 2013
- pp 2466-2472
652
TL;DR: A novel approach to human action recognition from 3D skeleton sequences extracted from depth data that uses the covariance matrix for skeleton joint locations over time as a discriminative descriptor for a sequence to encode the relationship between joint movement and time.
read more
Abstract: Human action recognition from videos is a challenging machine vision task with multiple important application domains, such as human-robot/machine interaction, interactive entertainment, multimedia information retrieval, and surveillance. In this paper, we present a novel approach to human action recognition from 3D skeleton sequences extracted from depth data. We use the covariance matrix for skeleton joint locations over time as a discriminative descriptor for a sequence. To encode the relationship between joint movement and time, we deploy multiple covariance matrices over sub-sequences in a hierarchical fashion. The descriptor has a fixed length that is independent from the length of the described sequence. Our experiments show that using the covariance descriptor with an off-the-shelf classification algorithm outperforms the state of the art in action recognition on multiple datasets, captured either via a Kinect-type sensor or a sophisticated motion capture system. We also include an evaluation on a novel large dataset using our own annotation.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Visual phrase recognition by modeling 3D spatial context of multiple objects
Lin Bai,Qingfeng Chen +1 more
TL;DR: Experiments conducted on several datasets show that the proposed method outperforms the state-of-the-art models in modeling 3D spatial geometric structure as well as recognizing visual phrase and demonstrates that modeling3D spatial configuration between objects can significantly improve the deeper image understanding.
9
Human Activity Recognition from automatically labeled data in RGB-D videos
David Jardim,Luís Nunes,Miguel Sales Dias +2 more
- 01 Sep 2016
TL;DR: This paper aims to compare the performance of several supervised classifier trained with manually labeled data versus the same classifiers trained with data automatically labeled, and proposes a framework capable of recognizing human actions using supervised classifierstrained with automatically labeled data.
Trucker Behavior Security Surveillance Based on Human Parsing
TL;DR: This work proposes a driver behavior recognition method based on human parsing which can quickly and accurately recognize the driver’s unsafe behavior and builds a trucker behavior dataset with 700 videos and improves a human body segmentation model with higher precision.
Adaptive most joint selection and covariance descriptions for a robust skeleton-based human action recognition
Van-Toi Nguyen,Tien-Nam Nguyen,Tien-Nam Nguyen,Thi-Lan Le,Dinh-Tan Pham,Dinh-Tan Pham,Hai Vu +6 more
TL;DR: The proposed method takes advantage of the skeleton data thanks to their robustness to human appearance change as well as the real-time performance, and proposes two schemes to select the most informative joints in terms of 3-D skeleton-based activity representation.
9
•Posted Content
Locality preserving projection on SPD matrix Lie group: algorithm and analysis
Yangyang Li,Ruqian Lu +1 more
TL;DR: Li et al. as discussed by the authors proposed a Lie-LPP approach to transform high-dimensional symmetric positive definite (SPD) matrices into low-dimensional SPD matrices based on the Lie group structure.
9
References
LIBSVM: A library for support vector machines
Chih-Chung Chang,Chih-Jen Lin +1 more
TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
Svetlana Lazebnik,Cordelia Schmid,Jean Ponce +2 more
- 17 Jun 2006
TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.
Real-time human pose recognition in parts from single depth images
Jamie Shotton,Andrew Fitzgibbon,Mat Cook,Toby Sharp,Mark J. Finocchio,Richard E. Moore,Alex Aben-Athar Kipman,Andrew Blake +7 more
- 20 Jun 2011
TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.
Real-time human pose recognition in parts from single depth images
Jamie Shotton,Toby Sharp,Alex Aben-Athar Kipman,Andrew Fitzgibbon,Mark J. Finocchio,Andrew Blake,Mat Cook,Richard Moore +7 more
TL;DR: This work takes an object recognition approach, designing an intermediate body parts representation that maps the difficult pose estimation problem into a simpler per-pixel classification problem, and generates confidence-scored 3D proposals of several body joints by reprojecting the classification result and finding local modes.
The recognition of human movement using temporal templates
Aaron F. Bobick,James W. Davis +1 more
TL;DR: A view-based approach to the representation and recognition of human movement is presented, and a recognition method matching temporal templates against stored instances of views of known actions is developed.
Related Papers (5)
Yong Du,Wei Wang,Liang Wang +2 more
- 07 Jun 2015
Wanqing Li,Zhengyou Zhang,Zicheng Liu +2 more
- 13 Jun 2010