CORC  > 自动化研究所  > 中国科学院自动化研究所
CLOSE: Coupled content-semantic embedding
Ren, Junhong; Zhang, Wensheng
刊名SIGNAL IMAGE AND VIDEO PROCESSING
2019-09-01
卷号13期号:6页码:1087-1095
关键词Video captioning Coupled content-semantic embedding Multi-content embedding
ISSN号1863-1703
DOI10.1007/s11760-019-01449-w
通讯作者Ren, Junhong(junhong.ren@ia.ac.cn)
英文摘要This paper proposes a novel coupled content semantic embedding (CLOSE) method with its application to video captioning. The motivation behind this design is to seek a consistent latent space between the content-semantic pair, in which the pair with same attribute is close to each other. Under the framework constructed on content-semantic embedding, CLOSE first learns two independent and reversible content-content and semantic-semantic embeddings, respectively, and then aggregates the two items via a coupled content-semantic embedding. Benefitting from the reversible property, our CLOSE can be pretrained with quantities of unlabeled data. In addition, casting on the work setting of feature embedding, a paradigm named multi-content embedding (MCE) is developed to describe the multi-focus information. Typically, MCE is capable of learning a feature embedding that can capture multiple discriminative contents. Extensive experiments compared with state-of-the-art methods on benchmark datasets, i.e., MSVD and MSR-VTT, demonstrate the effectiveness and superiority of the proposed CLOSE.
资助项目National Natural Science Foundation of China[61403376]
WOS研究方向Engineering ; Imaging Science & Photographic Technology
语种英语
出版者SPRINGER LONDON LTD
WOS记录号WOS:000481886600006
资助机构National Natural Science Foundation of China
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/27526]  
专题中国科学院自动化研究所
通讯作者Ren, Junhong
作者单位Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Ren, Junhong,Zhang, Wensheng. CLOSE: Coupled content-semantic embedding[J]. SIGNAL IMAGE AND VIDEO PROCESSING,2019,13(6):1087-1095.
APA Ren, Junhong,&Zhang, Wensheng.(2019).CLOSE: Coupled content-semantic embedding.SIGNAL IMAGE AND VIDEO PROCESSING,13(6),1087-1095.
MLA Ren, Junhong,et al."CLOSE: Coupled content-semantic embedding".SIGNAL IMAGE AND VIDEO PROCESSING 13.6(2019):1087-1095.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace