1. What are the contributions in "Hon4d: histogram of oriented 4d normals for activity recognition from depth sequences" ?
The authors present a new descriptor for activity recognition from videos acquired by a depth sensor.. In contrast, the authors describe the depth sequence using a histogram capturing the distribution of the surface normal orientation in the 4D space of time, depth, and spatial coordinates.. To build the histogram, the authors create 4D projectors, which quantize the 4D space and represent the possible directions for the 4D normal.. The authors initialize the projectors using the vertices of a regular polychoron.. Consequently, the authors refine the projectors using a discriminative density measure, such that additional projectors are induced in the directions where the 4D normals are more dense and discriminative.. Through extensive experiments, the authors demonstrate that their descriptor better captures the joint shape-motion cues in the depth sequence, and thus outperforms the state-of-the-art on all relevant benchmarks.
read more
![Figure 1. Surface normals overlayed on three examples from MSR Actions 3D dataset [12]. The surface normals capture the shape cues at a specific time instance, while the change in the surface normal over time captures the motion cues. In this paper, we use 4D normals computed in the space of depth, time, and spatial coordinates in order to obtain rich descriptors of activities. Note that in the figure we illustrate 3D surface normals since it is difficult to visualize the 4D normals used in the paper.](/figures/figure-1-surface-normals-overlayed-on-three-examples-from-2y1g19r3.png)

![Figure 4. Example frames from different actions obtained from MSR Action 3D dataset [12], MSR Hand Gesture dataset [23], and MSR Daily Activity 3D [24].](/figures/figure-4-example-frames-from-different-actions-obtained-from-26e89qc8.png)


![Figure 5. The confusion tables for 3D Action Pairs dataset. Top: Pair-wise skeleton features and LOP features from [24] without temporal pyramid (left), and with pyramid (right). Bottom: HON4D features as is (left), and after refining the projectors using the discriminative density (right).](/figures/figure-5-the-confusion-tables-for-3d-action-pairs-dataset-2td4ah5w.png)