Hits:
Affiliation of Author(s):自动化学院
Title of Paper:Human action recognition with 3D convolution skip-connections and RNNs
Journal:Lect. Notes Comput. Sci.
Abstract:This paper proposes a novel network architecture for human action recognition. First, we employ a pre-trained spatio-temporal feature extractor to perform spatio-temporal features extraction on videos. Then, several-level spatio-temporal features are concatenated by 3D convolution skip-connections. Moreover, a batch normalization layer is applied to normalize the concatenated features. Subsequently, we feed these normalized features into a RNN architecture to model temporal dependencies, which enables our network to deal with long-term information. In addition, we divide each video into three parts in which each part is split into non-overlapping 16-frame clips to achieve data augmentation. Finally, the proposed method is evaluated on UCF101 Dataset and is compared with existing excellent methods. Experimental results demonstrate that our method achieves the highest recognition accuracy. © 2018, Springer Nature Switzerland AG.
ISSN No.:0302-9743
Translation or Not:no
Date of Publication:2018-01-01
Co-author:Song, Jiarong,Zhang, Qiuyan,Fang, Ting,Hu, Guoxiong,Han, Jiaming,Chen, Cong
Correspondence Author:Song, Jiarong,Yang Zhong