Action Recognition
3d convolutional neural networks for human action recognition
Two-stream convolutional networks for action recognition in videos
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Finding action tubes
- intro: “built action models from shape and motion cues. They start from the image proposals and select the motion salient subset of them and extract saptio-temporal features to represent the video using the CNNs.”
- arxiv: http://arxiv.org/abs/1411.6031
Action recognition with trajectory-pooled deepconvolutional descriptors
Action Recognition by Hierarchical Mid-level Action Elements
Contextual Action Recognition with R*CNN
Action Recognition using Visual Attention(LSTM/RNN)
- arXiv: http://arxiv.org/abs/1511.04119
- project page: http://shikharsharma.com/projects/action-recognition-attention/
- github(Python/Theano): https://github.com/kracwarlock/action-recognition-visual-attention
Multi-velocity neural networks for gesture recognition in videos
ActivityNet: A Large-Scale Video Benchmark for Human Activity Understanding
- homepage: http://activity-net.org/