CORC  > 北京大学  > 信息科学技术学院
Muli-label text categorization with hidden components
Li, Li ; Zhang, Longkai ; Wang, Houfeng
2014
英文摘要Multi-label text categorization (MTC) is supervised learning, where a document may be assigned with multiple categories (labels) simultaneously. The labels in the MTC are correlated and the correlation results in some hidden components, which represent the 'share' variance of correlated labels. In this paper, we propose a method with hidden components for MTC. The proposed method employs PCA to capture the hidden components, and incorporates them into a joint learning framework to improve the performance. Experiments with real-world data sets and evaluation metrics validate the effectiveness of the proposed method. ? 2014 Association for Computational Linguistics.; EI; 0
语种英语
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/328527]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Li, Li,Zhang, Longkai,Wang, Houfeng. Muli-label text categorization with hidden components. 2014-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace