CORC  > 北京大学  > 信息科学技术学院
A novel topic model for automatic term extraction
Li, Sujian ; Li, Jiwei ; Song, Tao ; Li, Wenjie ; Chang, Baobao
2013
英文摘要Automatic term extraction (ATE) aims at extracting domain-specific terms from a corpus of a certain domain. Termhood is one essential measure for judging whether a phrase is a term. Previous researches on termhood mainly depend on the word frequency information. In this paper, we propose to compute termhood based on semantic representation of words. A novel topic model, namely i-SWB, is developed to map the domain corpus into a latent semantic space, which is composed of some general topics, a background topic and a documents-specific topic. Experiments on four domains demonstrate that our approach outperforms the state-of-the-art ATE approaches. Copyright ? 2013 ACM.; EI; 0
语种英语
DOI标识10.1145/2484028.2484106
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/411746]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Li, Sujian,Li, Jiwei,Song, Tao,et al. A novel topic model for automatic term extraction. 2013-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace