Motion Stimulation for Compositional Action Recognition

doi:10.1109/tcsvt.2022.3222305

Journal Article10.1109/tcsvt.2022.3222305

Motion Stimulation for Compositional Action Recognition

01 May 2023

- IEEE Transactions on Circuits and System...

- Vol. 33, Iss: 5, pp 2061-2074

28

TL;DR: Wang et al. as mentioned in this paper proposed a Motion Stimulation (MS) block, which is specifically designed to mine dynamic clues of the local regions autonomously from adjacent frames, which can be directly and conveniently integrated into existing video backbones to enhance the ability of compositional generalization for action recognition algorithms.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/tcsvt.2023.3243205

Discriminative and Robust Attribute Alignment for Zero-Shot Learning

De Cheng, +5 more

- 01 Jan 2023

- IEEE Transactions on Circuits and System...

TL;DR: Zhang et al. as discussed by the authors proposed to improve the discriminative power of the learned visual features by contrastive embedding, which exploits both the class-wise and instance-wise supervision for GZSL, under the attribute guided weakly supervised representation learning framework.

...read moreread less

24

Journal Article•10.1016/j.eswa.2023.120145

Multi-objective reinforcement learning approach for trip recommendation

Lei Chen, +3 more

- 01 Apr 2023

- Expert Systems With Applications

24

Journal Article•10.1016/j.compag.2023.107923

Picking point recognition for ripe tomatoes using semantic segmentation and morphological processing

Qianjie Rong, +3 more

- 01 Jul 2023

- Computers and Electronics in Agriculture

TL;DR: In this paper , a semantic segmentation model with the improved Swin Transformer V2 and a picking point recognition algorithm based on the connection of tomato fruit, calyx and stem are proposed for the problem of picking point detection of ripe tomatoes in complex environments.

...read moreread less

19

•Journal Article•10.3390/rs15051249

Adaptive Slicing-Aided Hyper Inference for Small Object Detection in High-Resolution Remote Sensing Images

Hao Zhang, +4 more

- 24 Feb 2023

- Remote sensing

TL;DR: Adaptive Slicing Aided Hyper Inference (ASHI) as discussed by the authors adaptively adjusts the slicing size to control the number of slices according to the image resolution, which can dramatically reduce redundant computation using an adaptive slicing size.

...read moreread less

14

Journal Article•10.1109/tcsvt.2023.3246475

Hierarchical Coupled Discriminative Dictionary Learning for Zero-shot Learning

Shuang Li, +4 more

- 01 Jan 2023

- IEEE Transactions on Circuits and System...

TL;DR: Zhang et al. as mentioned in this paper proposed hierarchical coupled discriminative dictionary learning (HCDDL) method to hierarchically establish visual-semantic embedding at class-level and image-level with a coarse-to-fine way.

...read moreread less

10

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

Proceedings Article•10.1109/ICCV.2017.322

Mask R-CNN

Kaiming He, +3 more

- 20 Mar 2017

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation, which extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition.

...read moreread less

23.6K

•Posted Content

Squeeze-and-Excitation Networks

Jie Hu, +4 more

- 05 Sep 2017

- arXiv: Computer Vision and Pattern Recog...

TL;DR: Squeeze-and-excitation (SE) as mentioned in this paper adaptively recalibrates channel-wise feature responses by explicitly modeling interdependencies between channels, which can be stacked together to form SENet architectures.

...read moreread less

18.9K

•Proceedings Article•10.1109/CVPR.2018.00813

Non-local Neural Networks

Xiaolong Wang, +3 more

- 18 Jun 2018

TL;DR: In this article, the non-local operation computes the response at a position as a weighted sum of the features at all positions, which can be used to capture long-range dependencies.

...read moreread less

12.6K