CORC  > 厦门大学  > 人文学院-已发表论文
词语库的收词与规则库的建立; The Word-Collection of Word Base and The Establishment of Rule Base
苏新春 ; 杜晶晶
2014-02-15
关键词多义词 词义搭配知识库 词语库 规则库 polysemous words word sense collocation knowledge base word base rule base
英文摘要词语库与规则库是在“多义词词义搭配知识库“中起基础与核心作用的两个子库。词语库有两个来源,一是词典词,二是真实语料词,两类词语有着书面语与口语词、正体词与异体词、语言词与言语词、通用词与领域词、稳定词与具体词等方面的差异。词语库特点会在很大程度上影响到词义标注的效果与正确率。纳入首批考察的词语为双音节多义词3771条,共有义项7861个。规则库统摄语义库、义项库、语料库,这些知识库通过规则库的组织而发挥作用。规则库是实现词义标注工程目标的直接依据,对于任何一个多义词,规则定义的多寡有无、质量好坏都会直接影响标注结果。规则库集中体现SCT整个系统的意义与价值,是语言知识与工程实施的结晶体。; Word base and rule base are the two central and fundamental subsets of The Polysemy Sense Collocation Knowledge Base.Word base is comprised of words from dictionaries and words from corpora.They differentiate in written words and spoken words, standard words and variants,language words and speech words,general words and domain words,stable words and concrete words,etc.The characteristics of word base will to a large extent affect the result and accuracy of word sense tagging.The first study includes 3771 polysemou disyllables with 7861 word senses.The semantic base,sense base and corpus are subject to the organization of rule base.Rule base is a direct basis for word sense tagging.For any of the polysemous words,tagging depends on the quality and quantity of rule definition.It also epitomizes the significance and value of the whole SCT system,and is the combination of linguistic knowledge and word sense tagging.
语种zh_CN
内容类型期刊论文
源URL[http://dspace.xmu.edu.cn/handle/2288/126593]  
专题人文学院-已发表论文
推荐引用方式
GB/T 7714
苏新春,杜晶晶. 词语库的收词与规则库的建立, The Word-Collection of Word Base and The Establishment of Rule Base[J],2014.
APA 苏新春,&杜晶晶.(2014).词语库的收词与规则库的建立..
MLA 苏新春,et al."词语库的收词与规则库的建立".(2014).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace