Action Recognition

Published: 09 Oct 2015 Category: deep_learning

3d convolutional neural networks for human action recognition

Two-stream convolutional networks for action recognition in videos

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Finding action tubes

  • intro: “built action models from shape and motion cues. They start from the image proposals and select the motion salient subset of them and extract saptio-temporal features to represent the video using the CNNs.”
  • arxiv: http://arxiv.org/abs/1411.6031

Action recognition with trajectory-pooled deepconvolutional descriptors

Action Recognition by Hierarchical Mid-level Action Elements

Contextual Action Recognition with R*CNN

Action Recognition using Visual Attention(LSTM/RNN)

Multi-velocity neural networks for gesture recognition in videos

ActivityNet: A Large-Scale Video Benchmark for Human Activity Understanding