In this paper we introduce BEVFusion, an efficient and generic multi-task multi-sensor fusion framework. It unifies multimodal features in the shared bird’s-eye view (BEV) representation space, which nicely preserves both geometric and semantic information.
2022: Zhijian Liu, Haotian Tang, Alexander Amini, Xinyu Yang, Huizi Mao, D. Rus, Song Han
Ranked #1 on 3D Object Detection on nuScenes
https://arxiv.org/pdf/2205.13542v1.pdf
view more