Super Sparse 3D Object Detection
Fan, Lue1,2; Yang, Yuxue1,3; Wang, Feng5; Wang, Naiyan5; Zhang, Zhaoxiang1,3,4
刊名IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
2023-10-01
卷号45期号:10页码:12490-12505
关键词Feature extraction Detectors Point cloud compression Three-dimensional displays Proposals Object detection Laser radar 3D object detection autonomous driving instance segmentation LiDAR point clustering sparse temporal fusion waymo open dataset
ISSN号0162-8828
DOI10.1109/TPAMI.2023.3286409
通讯作者Zhang, Zhaoxiang(zhaoxiang.zhang@ia.ac.cn)
英文摘要As the perception range of LiDAR expands, LiDAR-based 3D object detection contributes ever-increasingly to the long-range perception in autonomous driving. Mainstream 3D object detectors often build dense feature maps, where the cost is quadratic to the perception range, making them hardly scale up to the long-range settings. To enable efficient long-range detection, we first propose a fully sparse object detector termed FSD. FSD is built upon the general sparse voxel encoder and a novel sparse instance recognition (SIR) module. SIR groups the points into instances and applies highly-efficient instance-wise feature extraction. The instance-wise grouping sidesteps the issue of the center feature missing, which hinders the design of the fully sparse architecture. To further enjoy the benefit of fully sparse characteristic, we leverage temporal information to remove data redundancy and propose a super sparse detector named FSD++. FSD++ first generates residual points, which indicate the point changes between consecutive frames. The residual points, along with a few previous foreground points, form the super sparse input data, greatly reducing data redundancy and computational overhead. We comprehensively analyze our method on the large-scale Waymo Open Dataset, and state-of-the-art performance is reported. To showcase the superiority of our method in long-range detection, we also conduct experiments on Argoverse 2 Dataset, where the perception range (200 m) is much larger thanWaymo Open Dataset (75 m).
资助项目National Natural Science Foundation of China[2018AAA0100400] ; [61836014] ; [U21B2042] ; [62072457] ; [62006231]
WOS关键词TIME
WOS研究方向Computer Science ; Engineering
语种英语
出版者IEEE COMPUTER SOC
WOS记录号WOS:001197545600001
资助机构National Natural Science Foundation of China
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/58101]  
专题自动化研究所_智能感知与计算研究中心
通讯作者Zhang, Zhaoxiang
作者单位1.Chinese Acad Sci CASIA, Inst Automat, Natl Lab Pattern Recognit NLPR, Ctr Res Intelligent Percept & Comp CRIPAC, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci UCAS, Sch Future Technol, Beijing 100049, Peoples R China
3.Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 100049, Peoples R China
4.Chinese Acad Sci HKISI CAS, Hong Kong Inst Sci & Innovat, Ctr Artificial Intelligence & Robot, Hong Kong, Peoples R China
5.Tusimple, Beijing 100020, Peoples R China
推荐引用方式
GB/T 7714
Fan, Lue,Yang, Yuxue,Wang, Feng,et al. Super Sparse 3D Object Detection[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2023,45(10):12490-12505.
APA Fan, Lue,Yang, Yuxue,Wang, Feng,Wang, Naiyan,&Zhang, Zhaoxiang.(2023).Super Sparse 3D Object Detection.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,45(10),12490-12505.
MLA Fan, Lue,et al."Super Sparse 3D Object Detection".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 45.10(2023):12490-12505.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace