Associative Multichannel Autoencoder for MultimodalWord Representation

CORC > 自动化研究所 > 中国科学院自动化研究所 > 模式识别国家重点实验室 > 自然语言处理

	Associative Multichannel Autoencoder for MultimodalWord Representation
	shaonan wang 1,3; Jiajun Zhang 1,3; Chengqing Zong 1,2,3
	2018-10
会议日期	2018.10
会议地点	Brussel
英文摘要	In this paper we address the problem of learning multimodal word representations by integrating textual, visual and auditory inputs. Inspired by the re-constructive and associative nature of human memory, we propose a novel associative multichannel autoencoder (AMA). Our model first learns the associations between textual and perceptual modalities, so as to predict the missing perceptual information of concepts. Then the textual and predicted perceptual representations are fused through reconstructing their original and associated embeddings. Using a gating mechanism our model assigns different weights to each modality according to the different concepts. Results on six benchmark concept similarity tests show that the proposed method significantly outperforms strong unimodal baselines and state-of-the-art multimodal models.
会议录出版者	Conference on Empirical Methods in Natural Language Processing
语种	英语
内容类型	会议论文
源URL	[http://ir.ia.ac.cn/handle/173211/40575]
专题	模式识别国家重点实验室_自然语言处理
通讯作者	shaonan wang
作者单位	1.National Laboratory of Pattern Recognition, CASIA, Beijing, China 2.CAS Center for Excellence in Brain Science and Intelligence Technology, Beijing, China 3.University of Chinese Academy of Sciences, Beijing, China
推荐引用方式 GB/T 7714	shaonan wang,Jiajun Zhang,Chengqing Zong. Associative Multichannel Autoencoder for MultimodalWord Representation[C]. 见:. Brussel. 2018.10.

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

[发表评论/异议/意见]

暂无评论

评论
权益异议
反馈意见

评注功能仅针对注册用户开放，请您登录

您对该条目有什么异议，请向管理员反馈。
内容：
Email：	*
单位:
验证码：	刷新

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接

CORC

联系我们