CORC  > 北京大学  > 计算机科学技术研究所
WordRank-based lexical signatures for finding lost or related web pages
Wan, Xiaojun ; Yang, Jianwu
2006
英文摘要A lexical signature of a web page consists of several key words carefully chosen from the web page and is used to generate robust hyperlink to find the web page when its URL fails. In this paper, we propose a novel method based on WordRank to compute lexical signatures, which can take into account the semantic relatedness between words and choose the most representative and salient words as lexical signature. Experiments show that the DF-based lexical signatures are best at uniquely identifying web pages, and hybrid lexical signatures are good candidates for retrieving the desired web pages, while WordRank-based lexical signatures are best for retrieving highly relevant web pages when the desired web page cannot be extracted. ? Springer-Verlag Berlin Heidelberg 2006.; EI; 0
语种英语
DOI标识10.1007/11610113_83
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/321495]  
专题计算机科学技术研究所
推荐引用方式
GB/T 7714
Wan, Xiaojun,Yang, Jianwu. WordRank-based lexical signatures for finding lost or related web pages. 2006-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace