An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation
XU Xin ; GUO Jinlong ; HONG Yunjia ; JIN Biyi
刊名chinese journal of library and information science
2013-03-25
卷号6期号:1页码:64-77
关键词Ontology Semantic annotation Semantic retrieval Entity retrieval|KIM
ISSN号1674-3393
通讯作者xu xin (e-mail:xxu@infor.ecnu.edu.cn)
中文摘要

purpose: the objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

design/methodology/approach: an integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

findings: the research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

research limitations: due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of kim platform. therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

practical implications: our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

originality/value: the integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. our result verified the effectiveness of the combined index strategy.

英文摘要

purpose: the objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

design/methodology/approach: an integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

findings: the research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

research limitations: due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of kim platform. therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

practical implications: our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

originality/value: the integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. our result verified the effectiveness of the combined index strategy.

学科主题编辑出版
资助信息this work is supported by the national social science foundation of china (grant no. 11ctq003).
原文出处http://www.chinalibraries.net
公开日期2013-04-27
内容类型期刊论文
源URL[http://ir.las.ac.cn/handle/12502/6151]  
专题文献情报中心_Journal of Data and Information Science_Chinese Journal of Library and Information Science-2013
推荐引用方式
GB/T 7714
XU Xin,GUO Jinlong,HONG Yunjia,et al. An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation[J]. chinese journal of library and information science,2013,6(1):64-77.
APA XU Xin,GUO Jinlong,HONG Yunjia,&JIN Biyi.(2013).An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation.chinese journal of library and information science,6(1),64-77.
MLA XU Xin,et al."An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation".chinese journal of library and information science 6.1(2013):64-77.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace