The Engineer - 3D-mapping technique could improve navigation for autonomous vehicles

Because the technique works effectively with limited computational resources, it holds promise for improving the navigation of autonomous vehicles.

“Most autonomous vehicles use powerful AI programs called vision transformers to take 2D images from multiple cameras and create a representation of the 3D space around the vehicle,” said Tianfu Wu, corresponding author of a paper on the work and an associate professor of electrical and computer engineering at North Carolina State University. “However, while each of these AI programs takes a different approach, there is still substantial room for improvement.

Wu continued: “Our technique, called Multi-View Attentive Contextualization [MvACon], is a plug-and-play supplement that can be used in conjunction with these existing vision transformer AIs to improve their ability to map 3D spaces. The vision transformers aren’t getting any additional data from their cameras, they’re just able to make better use of the data.”

Related content

MonoXiver helps AI turn 2D into 3D imagery for autonomous vehicle cameras

MvACon modifies Patch-to-Cluster attention (PaCa), which Wu and his collaborators released last year. PaCa allows transformer AIs to more efficiently and effectively identify objects in an image.

“The key advance here is applying what we demonstrated with PaCa to the challenge of mapping 3D space using multiple cameras,” said Wu.

To test the performance of MvACon, the researchers used it in conjunction with three vision transformers, namely BEVFormer, the BEVFormer DFA3D variant, and PETR that each collected 2D images from six different cameras. In all three instances, MvACon is said to have significantly improved the performance of each vision transformer.

“Performance was particularly improved when it came to locating objects, as well as the speed and orientation of those objects,” said Wu. “And the increase in computational demand of adding MvACon to the vision transformers was almost negligible.

“Our next steps include testing MvACon against additional benchmark datasets, as well as testing it against actual video input from autonomous vehicles. If MvACon continues to outperform the existing vision transformers, we’re optimistic that it will be adopted for widespread use.”

The team’s paper - Multi-View Attentive Contextualization for Multi-View 3D Object Detection - will be presented on June 20 at the IEEE/CVF Conference on Computer Vision and Pattern Recognition, which is being held in Seattle, Washington.

Engineering publications brought to you by Mark Allen

Job of the week

Senior Cyber Security Engineer

Electronics

Automotive

Robotics & UAVs

3D-mapping technique could improve navigation for autonomous vehicles

Related content

Related Articles

Latest Comments

McMurtry Spéirling defies gravity using fan downforce

UK steam locomotive is first to operate with digital ETCS signalling system

UK steam locomotive is first to operate with digital ETCS signalling system

Engineering publications brought to you by Mark Allen

Senior Cyber Security Engineer

Electronics

Automotive

Robotics & UAVs

3D-mapping technique could improve navigation for autonomous vehicles

Related content

Related Articles

MonoXiver helps AI turn 2D into 3D imagery for autonomous vehicle cameras

New technique helps autonomous vehicles navigate complex traffic situations

AI program coverts 2D into 3D

Latest Comments

McMurtry Spéirling defies gravity using fan downforce

UK steam locomotive is first to operate with digital ETCS signalling system

UK steam locomotive is first to operate with digital ETCS signalling system