Grammar-Induced Wavelet Network for Human Parsing

CORC > 自动化研究所 > 中国科学院自动化研究所 > 多模态人工智能系统全国重点实验室

	Grammar-Induced Wavelet Network for Human Parsing
	Xiaomei Zhang; Yingying Chen; Ming Tang; Zhen Lei; Jinqiao Wang
刊名	IEEE TRANSACTIONS ON IMAGE PROCESSING
	2022-06
期号	31 页码:4502-4514
文献子类	SCI
英文摘要	Most existing methods of human parsing still face a challenge: how to extract the accurate foreground from similar or cluttered scenes effectively. In this paper, we propose a Grammar-induced Wavelet Network (GWNet), to deal with the challenge. GWNet mainly consists of two modules, including a blended grammar-induced module and a wavelet prediction module. We design the blended grammar-induced module to exploit the relationship of different human parts and the inherent hierarchical structure of a human body by means of grammar rules in both cascaded and paralleled manner. In this way, conspicuous parts, which are easily distinguished from the background, can amend the segmentation of inconspicuous ones, improving the foreground extraction. We also design a Partaware Convolutional Recurrent Neural Network (PCRNN) to pass messages which are generated by grammar rules. To further improve the performance, we propose a wavelet prediction module to capture the basic structure and the edge details of a person by decomposing the low-frequency and high-frequency components of features. The low-frequency component can represent the smooth structures and the high-frequency components can describe the fine details. We conduct extensive experiments to evaluate GWNet on PASCAL-Person-Part, LIP, and PPSS datasets. GWNet obtains state-of-the-art performance on these human parsing datasets.
内容类型	期刊论文
源URL	[http://ir.ia.ac.cn/handle/173211/57134]
专题	多模态人工智能系统全国重点实验室
通讯作者	Yingying Chen
推荐引用方式 GB/T 7714	Xiaomei Zhang,Yingying Chen,Ming Tang,et al. Grammar-Induced Wavelet Network for Human Parsing[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2022(31):4502-4514.
APA	Xiaomei Zhang,Yingying Chen,Ming Tang,Zhen Lei,&Jinqiao Wang.(2022).Grammar-Induced Wavelet Network for Human Parsing.IEEE TRANSACTIONS ON IMAGE PROCESSING(31),4502-4514.
MLA	Xiaomei Zhang,et al."Grammar-Induced Wavelet Network for Human Parsing".IEEE TRANSACTIONS ON IMAGE PROCESSING .31(2022):4502-4514.