Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation
Ma, Cong1,2; Zhang, Yaping1,2; Zhang, Zhiyang1,2; Liang, Yupu1,2; Zhao, Yang1,2; Zhou, Yu2,3; Zong, Chengqing1,2
2024-05
会议日期20-25 May, 2024
会议地点Torino, Italia
英文摘要

Text image machine translation (TIMT) aims at translating source language texts in images into another target lan- guage, which has been proven successful by bridging text image recognition encoder and text translation decoder. However, it is still an open question of how to incorporate fine-grained knowledge supervision to make it consistent between recognition and translation modules. In this paper, we propose a novel TIMT method named as BabyNet, which is optimized with hierarchical parental supervision to improve translation performance. Inspired by genetic recombination and variation in the field of genetics, the proposed BabyNet is inherited from the recognition and translation parent models with a variation module of which parameters can be updated when training on the TIMT task. Meanwhile, hierarchical and multi-granularity supervision from parent models is introduced to bridge the gap between inherited modules in BabyNet. Extensive experiments on both synthetic and real-world TIMT tests show that our proposed method significantly outperforms existing methods. Further analyses of various parent model combinations show the good generalization of our method.

会议录Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/57632]  
专题模式识别国家重点实验室_自然语言处理
通讯作者Zhang, Yaping
作者单位1.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, P.R. China
2.State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), Institute of Automation, Chinese Academy of Sciences, Beijing, China
3.Fanyu AI Laboratory, Zhongke Fanyu Technology Co., Ltd, Beijing 100190, P.R. China
推荐引用方式
GB/T 7714
Ma, Cong,Zhang, Yaping,Zhang, Zhiyang,et al. Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation[C]. 见:. Torino, Italia. 20-25 May, 2024.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace