Self-Supervised Feature Augmentation for Large Image Object Detection

doi:10.1109/TIP.2020.2993403

CORC > 自动化研究所 > 中国科学院自动化研究所 > 模式识别国家重点实验室 > 多媒体计算与图形学团队

	Self-Supervised Feature Augmentation for Large Image Object Detection
	Pan, Xingjia 1,1; Tang, Fan 5; Dong, Weiming 1; Gu, Yang 2; Song, Zhichao 3; Meng, Yiping 3; Xu, Pengfei 3; Oliver, Deussen 6; Xu, Changsheng 1,4
刊名	IEEE Transactions on Image Processing
	2020-05-14
卷号	29 期号:0 页码:6745-6758
关键词	object detection large image self-supervise feature augmentation
DOI	10.1109/TIP.2020.2993403
英文摘要	Input scale plays an important role in modern detection frameworks, and an optimal training scale for images exists empirically. However, the optimal one usually cannot be reached in facing extremely large images under the memory constraint. In this study, we explore the scale effect inside the object detection pipeline and find that feature upsampling with the introduction of high-resolution information benefits the detection. Compared with direct input upscaling, feature upsampling trades a small performance loss for a large amount of memory savings. From these observations, we propose a self-supervised feature augmentation network, which takes downsampled images as inputs and aims to generate comparable features with the ones when feeding upscaled images to networks. We present a guided feature upsampling module, which takes downsampled images as inputs, to learn upscaled feature representations with the supervision of real large features acquired from upscaled images. In a self-supervised learning manner, we can introduce detailed information of images to the network. For an efficient feature upsampling, we design a residualized sub-pixel convolution block based on a sub-pixel convolution layer, which involves considerable information in upsampling process. Experiments on Mapillary Vistas Dataset (MVD), Cityscapes, and COCO are conducted to demonstrate the effectiveness of our method. On the MVD and Cityscapes detection benchmarks, in which the images are extremely large, our method surpasses current approaches. On COCO, the proposed method obtains comparable results to existing methods but with higher efficiency.
语种	英语
内容类型	期刊论文
源URL	[http://ir.ia.ac.cn/handle/173211/41615]
专题	自动化研究所_模式识别国家重点实验室_多媒体计算与图形学团队
通讯作者	Dong, Weiming
作者单位	1.NLPR, Institute of Automation, Chinese Academy of Sciences 2.Moment.ai 3.Didi Chuxing 4.University of Chinese Academy of Sciences 5.Jilin University 6.University of Konstanz
推荐引用方式 GB/T 7714	Pan, Xingjia,Tang, Fan,Dong, Weiming,et al. Self-Supervised Feature Augmentation for Large Image Object Detection[J]. IEEE Transactions on Image Processing,2020,29(0):6745-6758.
APA	Pan, Xingjia.,Tang, Fan.,Dong, Weiming.,Gu, Yang.,Song, Zhichao.,...&Xu, Changsheng.(2020).Self-Supervised Feature Augmentation for Large Image Object Detection.IEEE Transactions on Image Processing,29(0),6745-6758.
MLA	Pan, Xingjia,et al."Self-Supervised Feature Augmentation for Large Image Object Detection".IEEE Transactions on Image Processing 29.0(2020):6745-6758.