Classification of forms with similar layouts based on Mixed Gaussian Weighted Mask | |
Wang, Simeng ; Gao, Liangcai ; Wang, Yuehan | |
2015 | |
英文摘要 | As an essential step of form processing, form classification has attracted much attention from researchers. However, for the forms with similar layout, most of the previous classification methods still suffer from two issues: huge variation among areas of user-filled-in data and insufficient discriminative identifiers in areas of preprinted data. In this paper, we propose a novel Mixed Gaussian Weighted Mask (MGWM) based method to identify forms with similar layouts by leveraging the multiple information extracted from areas of user-filled-in data, areas of preprinted data and dithering data of a form. The proposed method utilizes a combination of three Gaussian weighted masks to mitigate the impact of noise from areas of user-filled-in data, layout consistency and position dithering among form images respectively. Experimental results show that the proposed method achieves more than 85% classification accuracy on a number of forms and outperforms the state-of-the-art form classification method. ? 2015 IEEE.; EI; 111-115; 2015-November |
语种 | 英语 |
出处 | 13th International Conference on Document Analysis and Recognition, ICDAR 2015 |
DOI标识 | 10.1109/ICDAR.2015.7333736 |
内容类型 | 其他 |
源URL | [http://ir.pku.edu.cn/handle/20.500.11897/436471] ![]() |
专题 | 计算机科学技术研究所 |
推荐引用方式 GB/T 7714 | Wang, Simeng,Gao, Liangcai,Wang, Yuehan. Classification of forms with similar layouts based on Mixed Gaussian Weighted Mask. 2015-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论