Person resolution in person search results: WebHawk | |
Wan, Xiaojun ; Gao, Jianfeng ; Li, Mu ; Ding, Binggong | |
2005 | |
英文摘要 | Finding information about people on the Web using a search engine is difficult because there is a many-to-many mapping between person names and specific persons (i.e. referents). This paper describes a person resolution system, called WebHawk. Given a list of pages obtained by submitting a person query to a search engine, WebHawk facilitates person search in three steps: First of all, a filter removes those pages that contain no information about any person. Secondly, a cluster groups the remaining pages into different clusters, each for one specific person. To make the resulting clusters more meaningful, an extractor is used to induce query-oriented personal information from each page. Finally, a namer generates an informative description for each cluster so that users can find any specific person easily. The architecture of WebHawk is presented, and the four components are discussed in detail, with a separate evaluation of each component presented where appropriate. A user study shows that WebHawk complements most existing search engines and successfully improves users' experience of person search on the Web. Copyright 2005 ACM.; EI; 0 |
语种 | 英语 |
DOI标识 | 10.1145/1099554.1099585 |
内容类型 | 其他 |
源URL | [http://ir.pku.edu.cn/handle/20.500.11897/321536] ![]() |
专题 | 计算机科学技术研究所 |
推荐引用方式 GB/T 7714 | Wan, Xiaojun,Gao, Jianfeng,Li, Mu,et al. Person resolution in person search results: WebHawk. 2005-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论