CORC  > 上海财经大学  > 上海财经大学
Automatic Learning Common Definitional Patterns from Multi-domain Wikipedia Pages
Zhang, Jingsong1; Wang, Yinglin2; Yang, Dingyu1
2014
关键词definition extraction definitional pattern FIND-S algorithm similarity priority frequent pattern
DOI10.1109/ICDMW.2014.107
页码251-258
英文摘要Automatic definition extraction has attracted wide interest in NLP domain and knowledge-based applications. One primary task of definition extraction is mining patterns from definitional sentences. Existing extraction methods of definitional patterns, either focus on manual extraction by intuition or observation, or aim to mine intricate definitional patterns by automatic extraction methods. The manual method requires large human resources to identify the definitional patterns because of diverse lexico-syntactic structures. It inevitable suffers poor behavior especially the extraction from cross-domain corpora. The latter method mainly considers the precision in definition extraction, which is at the cost of decreasing the recall of definitions. Both of them are unsuitable for cross-domain definition extraction. To address those issues, this paper proposes a solution to perform the automatic extraction of definitional patterns from multi-domain definitional sentences of Wikipedia. Our method FIND-SS is modified based on FIND-S algorithm and solves the definition extraction problems of cross-domain corpora. Find-SS adopts a "the more similar the higher priority" scheme to improve the learning performance. It can accommodate some noisy information and does not require any pattern seeds for pattern learning. The experimental results indicate that our scenario is significantly superior to previous method.
会议录出版者IEEE
会议录出版地345 E 47TH ST, NEW YORK, NY 10017 USA
语种英语
WOS研究方向Computer Science
WOS记录号WOS:000389255100036
内容类型会议论文
源URL[http://10.2.47.112/handle/2XS4QKH4/3049]  
专题上海财经大学
作者单位1.Shanghai Jiao Tong Univ, Dept CSE, Shanghai, Peoples R China;
2.Shanghai Univ Finance & Econ, Dept CST, Shanghai, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Jingsong,Wang, Yinglin,Yang, Dingyu. Automatic Learning Common Definitional Patterns from Multi-domain Wikipedia Pages[C]. 见:.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace