| Self-Supervised Feature Augmentation for Large Image Object Detection |
| Pan, Xingjia1,1; Tang, Fan5; Dong, Weiming1; Gu, Yang2; Song, Zhichao3; Meng, Yiping3; Xu, Pengfei3; Oliver, Deussen6; Xu, Changsheng1,4
|
刊名 | IEEE Transactions on Image Processing
|
| 2020-05-14
|
卷号 | 29期号:0页码:6745-6758 |
关键词 | object detection
large image
self-supervise
feature augmentation
|
DOI | 10.1109/TIP.2020.2993403
|
英文摘要 | Input scale plays an important role in modern
detection frameworks, and an optimal training scale for images
exists empirically. However, the optimal one usually cannot be
reached in facing extremely large images under the memory
constraint. In this study, we explore the scale effect inside the
object detection pipeline and find that feature upsampling with
the introduction of high-resolution information benefits the detection.
Compared with direct input upscaling, feature upsampling
trades a small performance loss for a large amount of memory
savings. From these observations, we propose a self-supervised
feature augmentation network, which takes downsampled images
as inputs and aims to generate comparable features with the
ones when feeding upscaled images to networks. We present a
guided feature upsampling module, which takes downsampled
images as inputs, to learn upscaled feature representations with
the supervision of real large features acquired from upscaled
images. In a self-supervised learning manner, we can introduce
detailed information of images to the network. For an efficient
feature upsampling, we design a residualized sub-pixel convolution
block based on a sub-pixel convolution layer, which involves
considerable information in upsampling process. Experiments on
Mapillary Vistas Dataset (MVD), Cityscapes, and COCO are
conducted to demonstrate the effectiveness of our method. On the
MVD and Cityscapes detection benchmarks, in which the images
are extremely large, our method surpasses current approaches.
On COCO, the proposed method obtains comparable results to
existing methods but with higher efficiency. |
语种 | 英语
|
内容类型 | 期刊论文
|
源URL | [http://ir.ia.ac.cn/handle/173211/41615] |
专题 | 自动化研究所_模式识别国家重点实验室_多媒体计算与图形学团队
|
通讯作者 | Dong, Weiming |
作者单位 | 1.NLPR, Institute of Automation, Chinese Academy of Sciences 2.Moment.ai 3.Didi Chuxing 4.University of Chinese Academy of Sciences 5.Jilin University 6.University of Konstanz
|
推荐引用方式 GB/T 7714 |
Pan, Xingjia,Tang, Fan,Dong, Weiming,et al. Self-Supervised Feature Augmentation for Large Image Object Detection[J]. IEEE Transactions on Image Processing,2020,29(0):6745-6758.
|
APA |
Pan, Xingjia.,Tang, Fan.,Dong, Weiming.,Gu, Yang.,Song, Zhichao.,...&Xu, Changsheng.(2020).Self-Supervised Feature Augmentation for Large Image Object Detection.IEEE Transactions on Image Processing,29(0),6745-6758.
|
MLA |
Pan, Xingjia,et al."Self-Supervised Feature Augmentation for Large Image Object Detection".IEEE Transactions on Image Processing 29.0(2020):6745-6758.
|
修改评论