CORC  > 软件研究所  > 基础软件国家工程研究中心  > 会议论文
tc-dca: a system for text classification based on document's content allocation
Li Wenbo ; Sun Le ; Zhang Zhenzhong ; Jiang Xue ; Zhang Weiru
2010
会议名称19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10
会议日期40842
会议地点Toronto, ON, Canada
关键词Knowledge management Learning algorithms Text processing Visualization
页码1937-1938
英文摘要The text classification methods heavily depend on machine learning algorithms with abstract mathematic metrics, which obstruct the direct observation and intuitive understanding of the text-specific classification. In this paper, we model a document as a Document-Classes-Topics top-down hierarchical structure. Furthermore, by running the document generation procedure, we can obtain each class's content share, which not only can be used to make the classification decision but also can provide a natural visualization approach for text classification. We implement this idea by a new tool named TC-DCA, which provides the visualization of text classification result, where the target document is expressed graphically as its content's allocation on every class. TC-DCA can also perform the drilling down operation to reveal the classification effect of each word of the document.
收录类别EI
会议主办者ACM SIGIR; ACM SIGWEB; ACM SIGKDD
会议录International Conference on Information and Knowledge Management, Proceedings
会议录出版地United States
ISBN号9781450000000
内容类型会议论文
源URL[http://124.16.136.157/handle/311060/8928]  
专题软件研究所_基础软件国家工程研究中心_会议论文
推荐引用方式
GB/T 7714
Li Wenbo,Sun Le,Zhang Zhenzhong,et al. tc-dca: a system for text classification based on document's content allocation[C]. 见:19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10. Toronto, ON, Canada. 40842.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace