CORC  > 软件研究所  > 软件所图书馆  > 会议论文
a novel kernel for text categorization
Zhang Lujiang ; Hu Xiaohui
2012
会议名称2012 IEEE International Conference on Computer Science and Automation Engineering, CSAE 2012
会议日期May 25, 2012 - May 27, 2012
会议地点Zhangjiajie, China
关键词Algorithms Computer science Support vector machines
页码186-190
中文摘要In this paper we proposed a novel kernel for text categorization. This kernel is an inner product in the feature space generated by all word combinations of specified length. A word combination is a collection of different words co-occurring in the same sentence. The word combination of length k is weighted by the k-th root of the product of the inverse document frequencies (IDF) of its words. A computationally simple and efficient algorithm was proposed to calculate this kernel. We conducted experiments on the 20 Newsgroups dataset. This kernel achieves better performance than the classical word kernel and word-sequence kernel. We also assessed the impact of word combination length on performance. © 2012 IEEE.
英文摘要In this paper we proposed a novel kernel for text categorization. This kernel is an inner product in the feature space generated by all word combinations of specified length. A word combination is a collection of different words co-occurring in the same sentence. The word combination of length k is weighted by the k-th root of the product of the inverse document frequencies (IDF) of its words. A computationally simple and efficient algorithm was proposed to calculate this kernel. We conducted experiments on the 20 Newsgroups dataset. This kernel achieves better performance than the classical word kernel and word-sequence kernel. We also assessed the impact of word combination length on performance. © 2012 IEEE.
收录类别EI
会议主办者IEEE Beijing Section; Hunan University of Humanities, Science and Technology; Tongji University; Xiamen University; Central South University
会议录CSAE 2012 - Proceedings, 2012 IEEE International Conference on Computer Science and Automation Engineering
语种英语
ISBN号9781467300865
内容类型会议论文
源URL[http://ir.iscas.ac.cn/handle/311060/15762]  
专题软件研究所_软件所图书馆_会议论文
推荐引用方式
GB/T 7714
Zhang Lujiang,Hu Xiaohui. a novel kernel for text categorization[C]. 见:2012 IEEE International Conference on Computer Science and Automation Engineering, CSAE 2012. Zhangjiajie, China. May 25, 2012 - May 27, 2012.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace