Two-Stream Gated Fusion ConvNets for Action Recognition
Zhu, Jiagang1,2; Zou, Wei2; Zhu, Zheng1,2
2018-11
会议日期20-24 Aug. 2018
会议地点Beijing, China
英文摘要

The two-stream ConvNets in action recognition always fuse the two streams' predictions by the weighted averaging scheme. This fusion way with fixed weights lacks of pertinence to different action videos and always needs trial and error on the validation set. In order to enhance the adaptability of two-stream ConvNets, an end-to-end trainable gated fusion method, namely gating ConvNet, is proposed in this paper based on the MoE (Mixture of Experts) theory. The gating ConvNet takes the combination of convolutional layers of the spatial and temporal nets as input and outputs two fusion weights. To reduce the over-fitting of gating ConvNet caused by the redundancy of parameters, a new multi-task learning method is designed, which jointly learns the gating fusion weights for the two streams and learns the gating ConvNet for action classification. With the proposed gated fusion method and multi-task learning approach, competitive performance is achieved on the video action dataset UCF101.

内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/39108]  
专题精密感知与控制研究中心_精密感知与控制
通讯作者Zou, Wei
作者单位1.Univ Chinese Acad Sci, Beijing 100190, Peoples R China
2.Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Zhu, Jiagang,Zou, Wei,Zhu, Zheng. Two-Stream Gated Fusion ConvNets for Action Recognition[C]. 见:. Beijing, China. 20-24 Aug. 2018.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace