1. What contributions have the authors mentioned in the paper "Dense trajectories and motion boundary descriptors for action recognition" ?
This paper introduces a video representation based on dense trajectories and motion boundary descriptors.. As descriptors the authors extract features aligned with the trajectories to characterize shape ( point coordinates ), appearance ( histograms of oriented gradients ) and motion ( histograms of optical flow ).. Additionally, the authors introduce a descriptor based on motion boundary histograms ( MBH ) which rely on differential optical flow.. The authors evaluate their video representation in the context of action classification on nine datasets, namely KTH, YouTube, Hollywood2, UCF sports, IXMAS, UIUC, Olympic Sports, UCF50 and HMDB51.. On all datasets their approach outperforms current state-of-the-art results.
read more
2. What are the 11 action categories in the YouTube dataset?
The YouTube dataset6 (Liu et al, 2009) contains 11 action categories: basketball shooting, biking/cycling, diving, golf swinging, horse back riding, soccer juggling, swinging, tennis swinging, trampoline jumping, volleyball spiking, and walking with a dog.
read more
3. How does the algorithm perform on large video datasets?
Developing better optical flow algorithms suitable for large realistic video datasets is important to improve the performance of current action recognition systems.
read more
4. What are the default parameters for their experiments?
The default parameters for their experiments are N = 32, nσ = 2, nτ = 3, which showed to give best performance when cross validating on the training set of Hollywood2.
read more





