CORC  > 北京大学  > 信息科学技术学院
Error Feedback Based Lexical Entity Extraction for Chinese Language Modeling
Liu, Yi ; Hua, Jing ; Li, Xiangang ; Wu, Xihong
2013
关键词Chinese language modeling lexical entity extraction lexical entity selection error feedback phoneme-to-grapheme conversion WORD EXTRACTION VARIETY
英文摘要Chinese, which is quite different from western languages, has no standard definition of word. Therefore, choosing suitable lexicon plays an important role in Chinese language modeling. This paper proposes a novel method of constructing the lexicon automatically. Other than depending on statistical measures of text features, this method is directly based on the feedback of errors from the corresponding task, such as phoneme-to-grapheme conversion in this paper. The whole process consists of two iterative phases: selection of individual words from a large manual lexicon and further extraction of compound words based on Phase One. Experiments implemented on phoneme-to-grapheme conversion show that this method can achieve 1.09% and 0.38% absolute reduction in character error rate respectively for Phase One and Phase Two compared with baseline lexicons in the same size generated by the conventional method based on word frequency.; Computer Science, Artificial Intelligence; Engineering, Electrical & Electronic; CPCI-S(ISTP); 0
语种英语
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/292485]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Liu, Yi,Hua, Jing,Li, Xiangang,et al. Error Feedback Based Lexical Entity Extraction for Chinese Language Modeling. 2013-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace