CORC  > 北京大学  > 计算机科学技术研究所
Person resolution in person search results: WebHawk
Wan, Xiaojun ; Gao, Jianfeng ; Li, Mu ; Ding, Binggong
2005
英文摘要Finding information about people on the Web using a search engine is difficult because there is a many-to-many mapping between person names and specific persons (i.e. referents). This paper describes a person resolution system, called WebHawk. Given a list of pages obtained by submitting a person query to a search engine, WebHawk facilitates person search in three steps: First of all, a filter removes those pages that contain no information about any person. Secondly, a cluster groups the remaining pages into different clusters, each for one specific person. To make the resulting clusters more meaningful, an extractor is used to induce query-oriented personal information from each page. Finally, a namer generates an informative description for each cluster so that users can find any specific person easily. The architecture of WebHawk is presented, and the four components are discussed in detail, with a separate evaluation of each component presented where appropriate. A user study shows that WebHawk complements most existing search engines and successfully improves users' experience of person search on the Web. Copyright 2005 ACM.; EI; 0
语种英语
DOI标识10.1145/1099554.1099585
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/321536]  
专题计算机科学技术研究所
推荐引用方式
GB/T 7714
Wan, Xiaojun,Gao, Jianfeng,Li, Mu,et al. Person resolution in person search results: WebHawk. 2005-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace