CORC  > 清华大学
基于关键词元的话题内事件检测
张阔 ; 李涓子 ; 吴刚 ; Zhang Kuo ; Li JuanZi ; Wu Gang
2010-07-15 ; 2010-07-15
会议名称第三届全国信息检索与内容安全学术会议论文集 ; Proceedings of the 3rd National Conference on Information Retrieval and Content Security ; 第三届全国信息检索与内容安全学术会议 ; The 3rd National Conference on Information Retrieval and Content Security ; 中国江苏苏州 ; CNKI ; 中国中文信息学会信息检索与内容安全专业委员会
关键词事件检测 关键词元 event identification term committee TP391.3
其他题名Word Committee based Event Identification
中文摘要各种媒体每天有大量的新闻报道产生,需要一种自动化的分析方法将新闻以一种更加清晰的组织形式展示给用户。大多已有工作将新闻划分成平面的话题,然而一个话题并非仅仅是简单的新闻集合,而是由一系列的事件所组成的。由于话题内的事件之间往往非常相似, 导致话题内的事件检测精确度较差。为了克服以上问题,本文提出了词元委员会的方法,首先挖掘每个事件的核心词元,随后利用事件的核心词元进行事件检测。在 LDC 的两个数据集上的实验结果显示,本文提出的事件检测方法可以显著的改善已有方法的效果。; With the overwhelming volume of news stories created and stored electronically everyday,there is an increasing need for techniques to analyze and present news stories to the users in a more meaningful manner.Most previous research focus on organizing news set into flat collections(topics)of stories.However, a topic in news is more than a mere collection of stories:it is actually characterized by a definite structure of inter-related events.Unfortunately,it is very difficult to identify events within a topic because stories about the same topic are usually very similar to each other irrespective of the events they belong to.To deal with this problem,we propose a method based on event key terms to identify events.We first capture some tight term clusters as term committees of potential events,and then use them to find the core story sets of potential events. At last we assign all stories to an event.The experimental results on two Linguistic Data Consortium(LDC) datasets show that the proposed method for event identification outperforms previous methods significantly.; Supported by the National Natural Science Foundation of China under Grant No. 90604025(国家自然科学基金)
语种中文 ; 中文
内容类型会议论文
源URL[http://hdl.handle.net/123456789/70010]  
专题清华大学
推荐引用方式
GB/T 7714
张阔,李涓子,吴刚,等. 基于关键词元的话题内事件检测[C]. 见:第三届全国信息检索与内容安全学术会议论文集, Proceedings of the 3rd National Conference on Information Retrieval and Content Security, 第三届全国信息检索与内容安全学术会议, The 3rd National Conference on Information Retrieval and Content Security, 中国江苏苏州, CNKI, 中国中文信息学会信息检索与内容安全专业委员会.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace