Error Feedback Based Lexical Entity Extraction for Chinese Language Modeling | |
Liu, Yi ; Hua, Jing ; Li, Xiangang ; Wu, Xihong | |
2013 | |
关键词 | Chinese language modeling lexical entity extraction lexical entity selection error feedback phoneme-to-grapheme conversion WORD EXTRACTION VARIETY |
英文摘要 | Chinese, which is quite different from western languages, has no standard definition of word. Therefore, choosing suitable lexicon plays an important role in Chinese language modeling. This paper proposes a novel method of constructing the lexicon automatically. Other than depending on statistical measures of text features, this method is directly based on the feedback of errors from the corresponding task, such as phoneme-to-grapheme conversion in this paper. The whole process consists of two iterative phases: selection of individual words from a large manual lexicon and further extraction of compound words based on Phase One. Experiments implemented on phoneme-to-grapheme conversion show that this method can achieve 1.09% and 0.38% absolute reduction in character error rate respectively for Phase One and Phase Two compared with baseline lexicons in the same size generated by the conventional method based on word frequency.; Computer Science, Artificial Intelligence; Engineering, Electrical & Electronic; CPCI-S(ISTP); 0 |
语种 | 英语 |
内容类型 | 其他 |
源URL | [http://ir.pku.edu.cn/handle/20.500.11897/292485] ![]() |
专题 | 信息科学技术学院 |
推荐引用方式 GB/T 7714 | Liu, Yi,Hua, Jing,Li, Xiangang,et al. Error Feedback Based Lexical Entity Extraction for Chinese Language Modeling. 2013-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论