Research on Chinese N-gram Statistical Rule and its Application

CORC > 北京大学 > 软件与微电子学院

	Research on Chinese N-gram Statistical Rule and its Application
	Yin, Zhaoming ; Zhang, Huarui
	2009
关键词	Artificial Intelligence Chinese word segmentation with no dictionary dynamic programming new word extraction
英文摘要	In this article, we assign Chinese n-gram sequences to different types by their statistical properties such as frequency, mutual information and left/right border entropy. We call these sequence type "Radixes" and define some combination rules between them. Based on the radixes we classified and their combination rule we designed a new Chinese segmentation algorithm without dictionary based on dynamic programming, and do some research on the automatic word extraction of Chinese words consist of 2 to 4 letters, we achieved good performance on some aspects.; http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000270587500121&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=8e1609b174ce4e31116a60747a720701 ; Engineering, Electrical & Electronic; Telecommunications; CPCI-S(ISTP); 0
语种	英语
内容类型	其他
源URL	[http://ir.pku.edu.cn/handle/20.500.11897/325750]
专题	软件与微电子学院
推荐引用方式 GB/T 7714	Yin, Zhaoming,Zhang, Huarui. Research on Chinese N-gram Statistical Rule and its Application. 2009-01-01.

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

暂无评论

评注功能仅针对注册用户开放，请您登录

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接