Autonomous Driving Algorithm Research: BEV Drives Algorithm Revolution, AI Large Model Promotes Algorithm Iteration
The core of the autonomous driving algorithm technical framework is divided into three parts: environment perception, decision planning, and control execution.
Environment perception: convert sensor data into machine language of the scenario where the vehicle is located, which can include object detection, recognition and tracking, environment modeling, motion estimation, etc.;
Decision planning: Based on the output results of perception algorithm, the final behavioral action instructions are given, including behavioral decisions (vehicle following, stopping and overtaking), action decisions (car steering, speed, etc.), path planning, etc.;
Control actuation: according to the output results of decision-making level, the underlying modules are mobilized to issue instructions to the core control components such as accelerator and brake, and promote vehicle to drive according to the planned route.
BEV drives algorithm revolution
In recent years, BEV perception has received extensive attention. BEV model mainly provides a unified space to facilitate the fusion of various tasks and sensors. It has following advantages:
BEV unifies the multimodal data processing dimension and makes multimodal fusion easier
The BEV perception system converts the information obtained from multiple cameras or radars to a bird's-eye view, and then do tasks such as object detection and instance segmentation, which can more intuitively display the dimension and direction of objects in BEV space.
In 2022, Peking University & Ali proposed a fusion framework of LiDAR and vision - BEVFusion. The processing of radar point clouds and image processing are carried out independently, using neural networks to encode, project to a unified BEV space, and then merge the two in BEV space.
Realize timing information fusion and build 4D space
In the 4D space, the perception algorithm can better complete the perception tasks such as speed measurement, and can transmit the results of motion prediction to the decision and control module.
PhiGent Robotics proposed BEVDet4D in 2022, which is a version based on BEVDet to increase timing fusion. BEVDet4D extends BEVDet by retaining intermediate BEV features of past frames, and then fuses features by aligning and splicing with the current frame, so that time clues can be obtained by querying two candidate features.
Imagine occluded objects to realize object prediction
In the BEV space, the algorithm can predict the occluded area based on prior knowledge, and imagine whether there are objects in the occluded area.
FIERY, proposed by Wayve in cooperation with the University of Cambridge in 2021, is an end-to-end road dynamic object instance prediction algorithm that does not rely on high-precision maps and is only based on aerial views of monocular cameras.
Promoting development of an end-to-end autonomous driving framework
In the BEV space, perception and prediction can be directly optimized end-to-end through neural networks in a unified space, and the results can be obtained at the same time. Not only the perception module, but also the BEV-based planning decision-making module is also the direction of academic research.
In 2022, autonomous driving team of Shanghai Artificial Intelligence Laboratory and the team of associate professor Yan Junchi of Shanghai Jiao Tong University collaborated on paper ST-P3 to propose a spatiotemporal feature learning solution that can simultaneously provide a set of more representative features for perception, prediction and planning tasks.
AI large model drives algorithm iteration
After 2012, deep learning algorithms are widely applied in autonomous driving field. In order to support larger and more complex AI computing needs, AI large models with the characteristics of "huge data, huge computing power, and huge algorithms" were born, which accelerated the iteration speed of algorithms.
Large Model and Intelligent Computing Center
In 2021, HAOMO.AI launched research and landing attempts on large-scale Transformer model, and then gradually applied it on a large scale in projects including multi-modal perception data fusion and cognitive model training. In December 2021, HAOMO.AI released autonomous driving data intelligence system MANA (Chinese name "Snow Lake"), which integrates perception, cognition, labeling, simulation, computing and other aspects. In January 2023, HAOMO.AI together with Volcano Engine unveiled MANA OASIS, a supercomputing center with a total computing power of 670 PFLOPS. After deploying HAOMO.AI's training platform, OASIS can run various applications including cloud large-scale model training, vehicle-side model training, annotation, and simulation. With the help of MANA OASIS, the five major models of HAOMO.AI have ushered in a new appearance and upgrade.
In August 2022, based on Alibaba Cloud intelligent computing platform, Xpeng Motors built an autonomous driving intelligent computing center "Fuyao", which is dedicated to training of autonomous driving models. In October 2022, Xpeng also announced the introduction of Transformer large model.
In November 2022, Baidu released Wenxin Big Model. Leveraging more than 1 billion parameters, it recognizes thousands of objects, helping to enlarge the scope of semantic recognition. At present, it is mainly used in three aspects: distance vision, multimodality and data mining.
Automotive DMS/OMS (Driver/Occupant Monitoring System) Research Report, 2023-2024
In-cabin Monitoring study: installation rate increases by 81.3% in first ten months of 2023, what are the driving factors?
ResearchInChina released "Automotive DMS/OMS (Driver/Occupant Monitoring Sys...
Automotive Functional Safety and Safety Of The Intended Functionality (SOTIF) Research Report, 2024
As intelligent connected vehicles boom, the change in automotive EEA has been accelerated, and the risks caused by electronic and electrical failures have become ever higher. As a result, functional s...
Autonomous Driving Map Industry Report,2024
As the supervision of HD map qualifications tightens, issues such as map collection cost, update frequency, and coverage stand out. Amid the boom of urban NOA, the "lightweight map" intelligent drivin...
Automotive Vision Industry Research Report, 2023
From January to September 2023, 48.172 million cameras were installed in new cars in China, a like-on-like jump of 34.1%, including:
9.209 million front view cameras, up 33.0%; 3.875 million side vi...
Automotive Voice Industry Report, 2023-2024
The automotive voice interaction market is characterized by the following:
1. In OEM market, 46 brands install automotive voice as a standard configuration in 2023.
From 2019 to the first nine month...
Two-wheeler Intelligence and Industry Chain Research Report, 2023
In recent years, two-wheelers have headed in the direction of intelligent connection and intelligent driving, which has been accompanied by consumption upgrade, and mature applications of big data, ar...
Commercial Vehicle Telematics Industry Report, 2023-2024
The market tends to be more concentrated in leading companies in terms of hardware.
The commercial vehicle telematics industry chain covers several key links such as OEMs, operators, terminal device ...
Automotive Camera Tier2 Suppliers Research Report, 2023
1. Automotive lens companies: "camera module segment + emerging suppliers" facilitates the rise of Chinese products.
In 2023, automotive lens companies still maintain a three-echelon pattern. The fir...
China Passenger Car Navigate on Autopilot (NOA) Industry Report, 2023
Intelligent driving is evolving from L2 to L2+ and L2++, and Navigate on Autopilot (NOA) has become a layout focus in the industry. How is NOA advancing at present? What are hotspots in the market? Wh...
Automotive Telematics Service Providers (TSP) and Application Services Research Report, 2023-2024
From January to September 2023, the penetration of telematics in passenger cars in China hit 77.6%, up 12.8 percentage points from the prior-year period. The rising penetration of telematics provides ...
Passenger Car Intelligent Chassis and Chassis Domain Controller Research Report, 2023
Passenger Car Intelligent Chassis and Chassis Domain Controller Research Report, 2023, released by ResearchInChina combs through three integration trends of brake-by-wire, steer-by-wire, and active su...
Automotive Smart Cockpit Design Trend Report, 2023
As the most intuitive window to experience automotive intelligent technology, intelligent cockpit is steadily moving towards the deep end of “intelligence”, and automakers have worked to deploy intell...
China Automotive Multimodal Interaction Development Research Report, 2023
China Automotive Multimodal Interaction Development Research Report, 2023 released by ResearchInChina combs through the interaction modes of mainstream cockpits, the application of interaction modes i...
Automotive Smart Surface Research Report, 2023
Market status: vehicle models with smart surfaces boom in 2023
From 2018 to 2023, there were an increasing number of models equipped with smart surfaces, up to 52,000 units in 2022 and 256,000 units ...
Passenger Car Intelligent Steering Industry Report, 2023
Passenger Car Intelligent Steering Industry Report, 2023 released by ResearchInChina combs through and studies the status quo of passenger car intelligent steering and the product layout of OEMs, supp...
Automotive High-precision Positioning Research Report, 2023-2024
Autonomous driving is rapidly advancing from highway NOA to urban NOA, and poses ever higher technical requirements for high-precision positioning, highlighting the following:
1. Higher accuracy: urb...
New Energy Vehicle Thermal Management System Research Report, 2023
Thermal management system research: the mass production of CO? heat pumps, integrated controllers and other innovative products accelerates
Thermal management of new energy vehicles coordinates the c...
Commercial Vehicle Intelligent Chassis Industry Report, 2023
Commercial Vehicle Intelligent Chassis Industry Report, 2023, released by ResearchInChina, combs through and researches status quo and related product layout of OEMs and suppliers, and predicts future...