A unified approach for effectively integrating source-side syntactic reordering rules into phrase-based translation | |
Zhang, Jiajun; Zong, Chengqing | |
刊名 | LANGUAGE RESOURCES AND EVALUATION |
2013-06-01 | |
卷号 | 47期号:2页码:449-474 |
关键词 | Handcrafted syntactic rules Probabilistic syntactic rules Effective integration Phrase-based translation |
英文摘要 | Phrase-based translation models, with sequences of words (phrases) as translation units, achieve state-of-the-art translation performance. However, phrase reordering is a major challenge for this model. Recently, researchers have focused on utilizing syntax to improve phrase reordering. In adding syntactic knowledge into phrase reordering model, using handcrafted or probabilistic syntactic rules to reorder the source-language approximating the target-language word order has been successful in improving translation quality. However, it suffers from propagating the pre-ordering errors to the later translation step (e.g. decoding). In this paper, we propose a novel framework to uniformly represent the handcrafted and probabilistic syntactic rules and integrate them more effectively into phrase-based translation. In the translation phase, for a source sentence to be translated, handcrafted or probabilistic syntactic rules are first acquired from the source parse tree prior to translation, and then instead of reordering the source sentence directly, we input these rules into the decoder and design a new algorithm to apply these rules during decoding. In order to attach more importance to the syntactic rules and distinguish reordering between syntactic and non-syntactic unit reordering, we propose to design respectively a syntactic reordering model and a non-syntactic reordering model. The syntactic rules will guide phrase reordering in decoding within the syntactic reordering model. Extensive experiments on Chinese-to-English translation show that our approach, whether incorporating handcrafted or probabilistic syntactic rules, significantly outperforms the previous methods. |
WOS标题词 | Science & Technology ; Technology |
类目[WOS] | Computer Science, Interdisciplinary Applications |
研究领域[WOS] | Computer Science |
收录类别 | SCI ; AHCI |
语种 | 英语 |
WOS记录号 | WOS:000319776500007 |
内容类型 | 期刊论文 |
源URL | [http://ir.ia.ac.cn/handle/173211/4126] |
专题 | 自动化研究所_模式识别国家重点实验室_自然语言处理团队 |
作者单位 | Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Jiajun,Zong, Chengqing. A unified approach for effectively integrating source-side syntactic reordering rules into phrase-based translation[J]. LANGUAGE RESOURCES AND EVALUATION,2013,47(2):449-474. |
APA | Zhang, Jiajun,&Zong, Chengqing.(2013).A unified approach for effectively integrating source-side syntactic reordering rules into phrase-based translation.LANGUAGE RESOURCES AND EVALUATION,47(2),449-474. |
MLA | Zhang, Jiajun,et al."A unified approach for effectively integrating source-side syntactic reordering rules into phrase-based translation".LANGUAGE RESOURCES AND EVALUATION 47.2(2013):449-474. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论