Ensemble subspace clustering of text data using two-level features
He Zhao; Salman Salloum; Yeshou Cai; Joshua Zhexue Huang
刊名INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
2017
文献子类期刊论文
英文摘要This paper proposes a new integrated method for ensemble subspace clustering of high dimensional sparse text data. Our method employs two-level feature representation of text data (words and topics) to generate clusters from subspaces. We also use ensemble clustering to increase the robustness of the clusters. This method depends on topic modeling to get the two-level feature representation of text data and to generate different ensemble components. By using both topics and words to cluster text data, we can get more interpretable clusters as we can measure the weight of words and topics in each cluster. In order to evaluate the proposed method, we have conducted several experiments on seven real-life data sets. While some of these data sets are easy to cluster, others are hard, and some others contain unbalanced data. Experimental results on this diversity of data sets show that our method outperforms other methods for ensemble clustering.
URL标识查看原文
语种英语
内容类型期刊论文
源URL[http://ir.siat.ac.cn:8080/handle/172644/12568]  
专题深圳先进技术研究院_数字所
作者单位INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
推荐引用方式
GB/T 7714
He Zhao,Salman Salloum,Yeshou Cai,et al. Ensemble subspace clustering of text data using two-level features[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS,2017.
APA He Zhao,Salman Salloum,Yeshou Cai,&Joshua Zhexue Huang.(2017).Ensemble subspace clustering of text data using two-level features.INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS.
MLA He Zhao,et al."Ensemble subspace clustering of text data using two-level features".INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2017).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace