CoDRMA: Collaborative Depth Refinement via Dual-Mask and Dual-Attention for Bird’s Eye View Collaborative 3D Object Detection | |
Yang,Kang2; Wang, Yongcai2; Han, Yunjun1; Jia,Qingshan3 | |
2024 | |
会议日期 | 2024年8月28 |
会议地点 | Bari,Italy |
英文摘要 | In collaborative perception, camera-based approach is more informative and economical than Lidar-based approach. However, currently, camera-based methods still have a significant performance gap compared to the Lidar-based approach due to the difficulty and uncertainty involved in depth estimation. This paper introduces a strategy that refines depth estimation using foreground and background information, which empowers accurate Bird’s Eye View (BEV) collaborative 3D detection by multi-agents. Our strategy encompasses two stages: Initially, we introduce the Dual-Mask to enhance depth estimation and employ Bird’s Eye View (BEV) paradigms for integrating multi-viewpoint data, facilitating a comprehensive scene analysis. In the second stage, we generate pseudo-images by fusing depth and masks as auxiliary messages. A Dual- Attention scheme is proposed, which leverages multi-agent communication to augment auxiliary insights and further refine depth estimations. By refining the depth information twice, our method effectively improves BEV-based collaboration 3D object detection accuracy especially the occlused and long distance objects. Experiments on the OPV2V dataset show that our method achieves state-of-the-art performance in 3D object detection task among known camera-based methods, narrowing the gap with Lidar-based methods. Codes will be made available. |
内容类型 | 会议论文 |
源URL | [http://ir.ia.ac.cn/handle/173211/57370] |
专题 | 自动化研究所_复杂系统管理与控制国家重点实验室_先进控制与自动化团队 |
通讯作者 | Han, Yunjun |
作者单位 | 1.Institute of Automation, Chinese Academy of Science 2.School of Information, Renmin University of China 3.Department of Automation, Tsinghua University |
推荐引用方式 GB/T 7714 | Yang,Kang,Wang, Yongcai,Han, Yunjun,et al. CoDRMA: Collaborative Depth Refinement via Dual-Mask and Dual-Attention for Bird’s Eye View Collaborative 3D Object Detection[C]. 见:. Bari,Italy. 2024年8月28. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论