CORC  > 北京大学  > 计算机科学技术研究所
Document similarity search based on manifold-ranking of TextTiles
Wan, Xiaojun ; Yang, Jianwu ; Xiao, Jianguo
2006
英文摘要Document similarity search aims to find documents similar to a query document in a text corpus and return a ranked list of similar documents. Most existing approaches to document similarity search compute similarity scores between the query and the documents based on a retrieval function (e.g. Cosine) and then rank the documents by their similarity scores. In this paper, we proposed a novel retrieval approach based on manifold-ranking of TextTiles to re-rank the initially retrieved documents. The proposed approach can make full use of the intrinsic global manifold structure for the TextTiles of the documents in the re-ranking process. Experimental results demonstrate that the proposed approach can significantly improve the retrieval performances based on different retrieval functions. TextTile is validated to be a better unit than the whole document in the manifold-ranking process. ? Springer-Verlag Berlin Heidelberg 2006.; EI; 0
语种英语
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/321480]  
专题计算机科学技术研究所
推荐引用方式
GB/T 7714
Wan, Xiaojun,Yang, Jianwu,Xiao, Jianguo. Document similarity search based on manifold-ranking of TextTiles. 2006-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace