Autonomous Driving Algorithm Research: BEV Drives Algorithm Revolution, AI Large Model Promotes Algorithm Iteration
The core of the autonomous driving algorithm technical framework is divided into three parts: environment perception, decision planning, and control execution.
Environment perception: convert sensor data into machine language of the scenario where the vehicle is located, which can include object detection, recognition and tracking, environment modeling, motion estimation, etc.;
Decision planning: Based on the output results of perception algorithm, the final behavioral action instructions are given, including behavioral decisions (vehicle following, stopping and overtaking), action decisions (car steering, speed, etc.), path planning, etc.;
Control actuation: according to the output results of decision-making level, the underlying modules are mobilized to issue instructions to the core control components such as accelerator and brake, and promote vehicle to drive according to the planned route.
BEV drives algorithm revolution
In recent years, BEV perception has received extensive attention. BEV model mainly provides a unified space to facilitate the fusion of various tasks and sensors. It has following advantages:
BEV unifies the multimodal data processing dimension and makes multimodal fusion easier
The BEV perception system converts the information obtained from multiple cameras or radars to a bird's-eye view, and then do tasks such as object detection and instance segmentation, which can more intuitively display the dimension and direction of objects in BEV space.
In 2022, Peking University & Ali proposed a fusion framework of LiDAR and vision - BEVFusion. The processing of radar point clouds and image processing are carried out independently, using neural networks to encode, project to a unified BEV space, and then merge the two in BEV space.

Realize timing information fusion and build 4D space
In the 4D space, the perception algorithm can better complete the perception tasks such as speed measurement, and can transmit the results of motion prediction to the decision and control module.
PhiGent Robotics proposed BEVDet4D in 2022, which is a version based on BEVDet to increase timing fusion. BEVDet4D extends BEVDet by retaining intermediate BEV features of past frames, and then fuses features by aligning and splicing with the current frame, so that time clues can be obtained by querying two candidate features.

Imagine occluded objects to realize object prediction
In the BEV space, the algorithm can predict the occluded area based on prior knowledge, and imagine whether there are objects in the occluded area.
FIERY, proposed by Wayve in cooperation with the University of Cambridge in 2021, is an end-to-end road dynamic object instance prediction algorithm that does not rely on high-precision maps and is only based on aerial views of monocular cameras.

Promoting development of an end-to-end autonomous driving framework
In the BEV space, perception and prediction can be directly optimized end-to-end through neural networks in a unified space, and the results can be obtained at the same time. Not only the perception module, but also the BEV-based planning decision-making module is also the direction of academic research.
In 2022, autonomous driving team of Shanghai Artificial Intelligence Laboratory and the team of associate professor Yan Junchi of Shanghai Jiao Tong University collaborated on paper ST-P3 to propose a spatiotemporal feature learning solution that can simultaneously provide a set of more representative features for perception, prediction and planning tasks.

AI large model drives algorithm iteration
After 2012, deep learning algorithms are widely applied in autonomous driving field. In order to support larger and more complex AI computing needs, AI large models with the characteristics of "huge data, huge computing power, and huge algorithms" were born, which accelerated the iteration speed of algorithms.
Large Model and Intelligent Computing Center
In 2021, HAOMO.AI launched research and landing attempts on large-scale Transformer model, and then gradually applied it on a large scale in projects including multi-modal perception data fusion and cognitive model training. In December 2021, HAOMO.AI released autonomous driving data intelligence system MANA (Chinese name "Snow Lake"), which integrates perception, cognition, labeling, simulation, computing and other aspects. In January 2023, HAOMO.AI together with Volcano Engine unveiled MANA OASIS, a supercomputing center with a total computing power of 670 PFLOPS. After deploying HAOMO.AI's training platform, OASIS can run various applications including cloud large-scale model training, vehicle-side model training, annotation, and simulation. With the help of MANA OASIS, the five major models of HAOMO.AI have ushered in a new appearance and upgrade.

In August 2022, based on Alibaba Cloud intelligent computing platform, Xpeng Motors built an autonomous driving intelligent computing center "Fuyao", which is dedicated to training of autonomous driving models. In October 2022, Xpeng also announced the introduction of Transformer large model.

In November 2022, Baidu released Wenxin Big Model. Leveraging more than 1 billion parameters, it recognizes thousands of objects, helping to enlarge the scope of semantic recognition. At present, it is mainly used in three aspects: distance vision, multimodality and data mining.

AI/AR Glasses Industry Research Report, 2025
ResearchInChina released the " AI/AR Glasses Industry Research Report, 2025", which deeply explores the field of AI smart glasses, sorts out product R&D and ecological layout of leading domestic a...
Global and China Passenger Car T-Box Market Report 2025
T-Box Research: T-Box will achieve functional upgrades given the demand from CVIS and end-to-end autonomous driving
ResearchInChina released the "Global and China Passenger Car T-Box Market Report 20...
Automotive Microcontroller Unit (MCU) Industry Report, 2025
Research on automotive MCUs: the independent, controllable supply chain for automotive MCUs is rapidly maturing
Mid-to-high-end MCUs for intelligent vehicle control are a key focus of domestic produc...
Automotive LiDAR Industry Report, 2024-2025
In early 2025, BYD's "Eye of God" Intelligent Driving and Changan Automobile's Tianshu Intelligent Driving sparked a wave of mass intelligent driving, making the democratization of intelligent driving...
Software-Defined Vehicles in 2025: SOA and Middleware Industry Research Report
Research on automotive SOA and middleware: Development towards global SOA, cross-domain communication middleware, AI middleware, etc.
With the implementation of centrally integrated EEAs, OEM softwar...
Global and Chinese OEMs’ Modular and Common Technology Platform Research Report, 2025
Modular platforms and common technology platforms of OEMs are at the core of current technological innovation in automotive industry, aiming to enhance R&D efficiency, reduce costs, and accelerate...
Research Report on the Application of AI in Automotive Cockpits, 2025
Cockpit AI Application Research: From "Usable" to "User-Friendly," from "Deep Interaction" to "Self-Evolution"
From the early 2000s, when voice recognition and facial monitoring functions were first ...
Analysis on Li Auto’s Layout in Electrification, Connectivity, Intelligence and Sharing, 2024-2025
Mind GPT: The "super brain" of automotive AI Li Xiang regards Mind GPT as the core of Li Auto’s AI strategy. As of January 2025, Mind GPT had undergone multip...
Automotive High-precision Positioning Research Report, 2025
High-precision positioning research: IMU develops towards "domain controller integration" and "software/hardware integrated service integration"
According to ResearchInChina, in 2024, the penetration...
China Passenger Car Digital Chassis Research Report, 2025
Digital chassis research: Local OEMs accelerate chassis digitization and AI
1. What is the “digital chassis”?
Previously, we mostly talked about concepts such as traditional chassis, ch...
Automotive Micromotor and Motion Mechanism Industry Report, 2025
Automotive Micromotor and Motion Mechanism Research: More automotive micromotors and motion mechanisms are used in a single vehicle, especially in cockpits, autonomous driving and other scenarios.
Au...
Research Report on AI Foundation Models and Their Applications in Automotive Field, 2024-2025
Research on AI foundation models and automotive applications: reasoning, cost reduction, and explainability
Reasoning capabilities drive up the performance of foundation models.
Since the second ha...
China's New Passenger Cars and Suppliers' Characteristics Research Report, 2024-2025
Trends of new cars and suppliers in 2024-2025: New in-vehicle displays are installed, promising trend of AI and cars is coming
ResearchInChina releases the China's New Passenger Cars and Suppli...
Global and China Skateboard Chassis Industry Report, 2024-2025
Skateboard chassis research: already used in 8 production models, and larger-scale production expected beyond 2025
Global and China Skateboard Chassis Industry Report, 2024-2025 released by ResearchI...
Two-wheeler Intelligence and Industry Chain Research Report, 2024-2025
Research on the two-wheeler intelligence: OEMs flock to enter the market, and the two-wheeler intelligence continues to improve
This report focuses on the upgrade of two-wheeler intelligence, analyz...
Automotive MEMS (Micro Electromechanical System) Sensor Research Report, 2025
Automotive MEMS Research: A single vehicle packs 100+ MEMS sensors, and the pace of product innovation and localization are becoming much faster.
MEMS (Micro Electromechanical System) is a micro devi...
Intelligent Vehicle Cockpit-driving Integration (Cockpit-driving-parking) Industry Report, 2024-2025
Cockpit-driving integration is gaining momentum, and single-chip solutions are on the horizon
The Intelligent Vehicle Cockpit-driving Integration (Cockpit-driving-parking) Industry Repor...
Automotive TSP and Application Service Research Report, 2024-2025
TSP Research: In-vehicle connectivity services expand in the direction of cross-domain integration, all-scenario integration and cockpit-driving integration
TSP (Telematics Service Provider) is mainl...