CORC  > 北京大学  > 计算机科学技术研究所
Extracting relations from the web via weakly supervised learning
Chen, Liwei ; Feng, Yansong ; Zhao, Dongyan
刊名jisuanji yanjiu yu fazhancomputer research and development
2013
英文摘要In the time of big data, information extraction at a large scale has been an important topic discussed in natural language processing and information retrieval. Specifically, weak supervision, as a novel framework that need not any human involvement and can be easily adapted to new domains, is receiving increasing attentions. The current study of weak supervision is intended primarily for English, with conventional features such as segments of words based lexical features and dependency based syntactic features. However, this type of lexical features often suffer from the data sparsity problem, while syntactic features strongly rely on the availability of syntactic analysis tools. This paper proposes to make use of n-gram features which can relieve to some extent the data sparsity problem brought by lexical features. It is also observed that the n-gram features are important for multilingual relation extraction, especially, they can make up for the syntactic features in those languages where syntactic analysis tools are not reliable. In order to deal with the quality issue of training data used in weakly supervised learning models, a bootstrapping approach, co-training, is introduced into the framework to improve this extraction paradigm. We study the strategies used to combine the outputs from different training views. The experimental results on both English and Chinese datasets show that the proposed approach can effectively improve the performance of weak supervision in both languages, and has the potential to work well in a multilingual scenario with more languages.; EI; 0; 9; 1825-1835; 50
语种英语
内容类型期刊论文
源URL[http://ir.pku.edu.cn/handle/20.500.11897/321355]  
专题计算机科学技术研究所
推荐引用方式
GB/T 7714
Chen, Liwei,Feng, Yansong,Zhao, Dongyan. Extracting relations from the web via weakly supervised learning[J]. jisuanji yanjiu yu fazhancomputer research and development,2013.
APA Chen, Liwei,Feng, Yansong,&Zhao, Dongyan.(2013).Extracting relations from the web via weakly supervised learning.jisuanji yanjiu yu fazhancomputer research and development.
MLA Chen, Liwei,et al."Extracting relations from the web via weakly supervised learning".jisuanji yanjiu yu fazhancomputer research and development (2013).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace