A robust approach to text line grouping in online handwritten Japanese documents
Zhou, Xiang-Dong; Wang, Da-Han; Liu, Cheng-Lin
刊名PATTERN RECOGNITION
2009-09-01
卷号42期号:9页码:2077-2088
关键词Online handwritten documents Text line grouping MCE training Temporal merge Spatial merge
英文摘要In this paper, we present an effective approach for grouping text lines in online handwritten Japanese documents by combining temporal and spatial information. With decision functions optimized by supervised learning, the approach has few artificial parameters and utilizes little prior knowledge. First, the strokes in the document are grouped into text line strings according to off-stroke distances. Each text line string, which may contain multiple lines, is segmented by optimizing a cost function trained by the minimum classification error (MCE) method. At the temporal merge stage, over-segmented text lines (caused by stroke classification errors) are merged with a support vector machine (SVM) classifier for making merge/non-merge decisions. Last, a spatial merge module corrects the segmentation errors caused by delayed strokes. Misclassified text/non-text strokes (stroke type classification precedes text line grouping) can be corrected at the temporal merge stage. To evaluate the performance of text line grouping, we provide a set of performance metrics for evaluating from multiple aspects. In experiments on a large number of free form documents in the Tokyo University of Agriculture and Technology (TUAT) Kondate database, the proposed approach achieves the entity detection metric (EDM) rate of 0.8992 and the edit-distance rate (EDR) of 0.1114. For grouping of pure text strokes, the performance reaches EDM of 0.9591 and EDR of 0.0669. (C) 2008 Elsevier Ltd. All rights reserved.
WOS标题词Science & Technology ; Technology
类目[WOS]Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
研究领域[WOS]Computer Science ; Engineering
关键词[WOS]SPEECH RECOGNITION
收录类别SCI
语种英语
WOS记录号WOS:000267089000036
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/3058]  
专题自动化研究所_模式识别国家重点实验室_模式分析与学习团队
作者单位Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Zhou, Xiang-Dong,Wang, Da-Han,Liu, Cheng-Lin. A robust approach to text line grouping in online handwritten Japanese documents[J]. PATTERN RECOGNITION,2009,42(9):2077-2088.
APA Zhou, Xiang-Dong,Wang, Da-Han,&Liu, Cheng-Lin.(2009).A robust approach to text line grouping in online handwritten Japanese documents.PATTERN RECOGNITION,42(9),2077-2088.
MLA Zhou, Xiang-Dong,et al."A robust approach to text line grouping in online handwritten Japanese documents".PATTERN RECOGNITION 42.9(2009):2077-2088.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace