Automotive AI Agent Product Development and Commercialization Research Report, 2024

Search by Type

Report DataBase News

Abstract

Selected Charts

Related Reports

Related Companies

Automotive AI Agent product development: How to enable “cockpit endorser” via foundation models?

According to OPEN AI’s taxonomy of AI (a total of 5 levels), AI Agent is at L3 in the AI development path:

AGENT 商业化 1.png

Limited by interaction modes and tool usage capabilities, popular foundation models in 2023 can only reach L2 (Reasoners) at most. In contrast, developing the automotive AI system by building automotive agents is a more appropriate goal: Agent improves weak links of application of foundation models in scenarios by way of calling active intelligent features and multiple tools/foundation models, further improving cockpit intelligence level.

Agent is the endorser of emotional cockpits

"Emotional cockpit" has been nothing new for multiple years, but actually realizing it still starts with the introduction of foundation models in vehicles. Under specific triggering conditions, voice assistant chats with the user through preset emotional corpus, but it cannot adapt to human dialogue logic in real chat scenarios. After being applied to vehicles, Agent integrated with multiple foundation model bases can recognize the environment more accurately, and more tool library interfaces further enhances its generalization capability to cope with chat and Q&A in diversified scenarios, truly realizing the warm companionship of the "cockpit endorser".

AGENT 商业化 2.png

The design of mainstream emotional interaction scenarios focuses on emotion recognition, user memory, and behavior arrangement. Some OEMs and Tier1s have also launched technologies or products to enhance the emotional value of Agents:

AGENT 商业化 3.png

For example, Xiaoai Tongxue’s "emotional dialogue system" is built in three steps:

AGENT 商业化 4.png

The mixed strategy dredging model is composed of three important components: mental state-enhanced encoder, mixed strategy learning module, and multi-factor-aware decoder.

AGENT 商业化 5.png

The Institute of Digital Games at University of Malta proposes the Affectively Framework, establishes an emotional model, and adopts behavior reward and affective reward mechanisms in the training process to help Agents better understand human emotions and interact with humans more naturally.

AGENT 商业化 6.png

Sore points that need to be solved to improve user experience

Imagine that an intelligent cockpit can not only understand and execute instructions given by the car owner, but also predict the owner's needs, just like a thoughtful personal assistant. Will this make car owners more excited? Compared to buying a traditional car and having to explore each function on one’s own, everyone wants a cockpit "endorser" which can help manage all cockpit functions as they just say a few words. Agent is a time-saving and trouble-free solution.

Currently most Agents introduced in vehicles still serve as an assistant and a companion listing functions for specific scenarios. Yet compared with foundation models, Agents feature greater potential, motivated autonomy, and outstanding tool-using capabilities, more fit with the label of "active intelligence", and can even make up for the limitations of foundation models in practical applications.

AGENT 商业化 7.png

There is however still a long way to go in technology development to make automotive agents truly "active and intelligent" and meet users' experience value. Agent needs to be more precise in active perception, data processing, state recognition, etc., accurately understand the environment, judge real needs of people in the car, and then adopt corresponding strategies. Wherein, one of challenges lies in Agent's accurate judgment of user needs. Compared with passive interaction in normal circumstances, active intention recognition lacks voice commands. In the process of environment/personnel/vehicle state recognition, it may not be possible to obtain a description that is extremely close to the current scenario through vector feature matching, and the preset solution may not satisfy the real intentions of people in the car.

At present, most recommended functions are just to execute preset instructions. This limits "active and intelligent" capabilities of Agent and leads to frequent sore points in the reasoning process. For example, if Agent fails to accurately understand the current scenario, it may not make recommendations as expected, for instance, recommending music or navigation at a wrong time. The final result is to affect user experience and make the Agent become a "guessing machine" to users.

AGENT 商业化 8.png

In addition, Agent also has shortcomings in perception when receiving voice commands. According to ResearchInChina’s incomplete statistics on sore points in automotive agent use cases of some car owners, the most frequent sore points are wake-up failure, recognition error, and false wake-up.

Among the 120 cases, wake-up failure, recognition error, and false wake-up are mentioned 19, 18, and 17 times respectively, namely, accounting for 16%, 15%, and 14%. Other sore points also include unavailability of see-and-speak, semantic clarification and continuous commands, inability to recognize dialects, and delayed response, totaling 89 sore points in voice link, or 74.2% of the total in this statistical survey.

Furthermore, a range of problems caused by unreasonable Agent architecture/scenario design also include irrational scenario triggering conditions, secondary wake-up of foundation models, failure of long/short-term memory, and recommended actions made autonomously according to owners' habits but failing to meet expectations, which respectively reflect limitations of Agent in scenario setting, architecture deployment, memory module, and reflection module.

In summary, sore points of users are concentrated in the perception and reasoning links:
Perception: wake-up failure, false wake-up, recognition error, unavailability of see-and-speak, delayed response, etc.
Reasoning: object recognition error, autonomous recommendation failing to meet user expectations, etc.

Quick-response multi-agent framework

To enable all the functions of the "endorser" in cockpit, it is very critical to design the service framework of Agent in diversified scenarios. Agent framework is relatively flexible in construction. The simplest "receiver + executer" architecture can be used, or a more complex multi-agent architecture can be built. Its design principle is very simple: as long as it can solve user problems in a specific scenario, it is a good framework design. As a qualified "cockpit endorser", automotive Agent not only needs to act as an independent thinker, make decisions and solve problems on its own, but also quickly and freely adopts human behavior patterns, acting as a human.

A typical example is NIO Nomi. It uses a multi-agent architecture, calling different tools in different scenarios, and using multiple agents with different functions to perform specific duties and jointly complete the process of understanding needs, making decisions, executing tasks, and reflecting on iterations. The multi-agent architecture allows Nomi to not only make quick response, but also react more naturally like a human. Its seamless integration with other vehicle functions brings smoother experiences.

Compared with single-agent systems, multi-agent systems are more suitable for executing complex instructions. They are like a small community in which each "agent" has its own tasks, but can cooperate to complete more complex tasks. For example, one agent is responsible for understanding your instructions, another is responsible for making a decision, and there are special agents to perform tasks. This design makes automotive AI Agent systems more flexible and allows them to handle more diverse tasks. For example, the Commonwealth Scientific and Industrial Research Organization (CSIRO) of Australia proposed a multi-agent system that uses both collaboration agents and execution agents:

The entire Agent framework is divided into 6 modules, namely, Understanding & Interaction, Reasoning, Tool Use, Multi-Agent Collaboration, Reflection, and Alignment. It embraces mainstream Agent design patterns, and covers the entire process from active perception, reasoning and decision, tool calling to generation and execution, reflection and iteration, and alignment with human values. This framework features a multi-agent system where different Agents can play different roles (distribution/decision/actuation) in the entire process, making best use of each Agent to improve task execution efficiency.

In addition, in diversified scenarios, Agent deployment methods and tool calling capabilities also affect whether or not user needs can be quickly and accurately executed. Take NIO Nomi as an example:

Nomi Agents are deployed at the end and cloud sides. End-side model and NomiGPT are deployed at the end and cloud sides, respectively. Deeply integrated with SkyOS, the end-side model can call atomic capabilities in time and schedule resources (data, vehicle control hardware/software, etc.) across domains to speed up response. NomiGPT on the cloud connects more cloud tool resource interfaces to further enhance Nomi Agents’ capability of calling tools. Nomi Agents’ architecture is arranged in SkyOS middleware layer. Combining with SkyOS, it makes the process of calling atomic APIs, hardware/software and data more natural, coordinated, and faster.

1 Overview of Automotive AI Agent
1.1 Definition of Agent
1.2 Development History of Agent
1.3 Foundation Models Regain Vitality Using the Agent Concept
1.4 Differences between Foundation Models, Agents, and AIGC
1.5 Automotive AI Agent Product Definition
1.6 Automotive AI Agent based on Multi-agent System: Module Design
1.6 Automotive AI Agent based on Multi-agent System: Component Functions
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (1)
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (2)
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (3)
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (4)
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (5)
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (6)
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (7)
1.6 Automotive AI Agent based on Multi-agent System: Component Characteristics (8)
1.7 Automotive AI Agent Reference Architecture (by Functional Module and Component)
1.7 Automotive AI Agent Reference Architecture (by Deployment Level)
1.8 Agent Architecture Case (1): Original Diagram of NIO (Nomi) Architecture
1.8 Agent Architecture Case (1): Original Diagram of NIO (Nomi) Deployment
1.8 Agent Architecture Case (1): NIO (Nomi) Module Design
1.8 Agent Architecture Case (1): NIO (Nomi) Module Design - Multimodal Perception
1.8 Agent Architecture Case (1): NIO (Nomi) Module Design - Command Distribution
1.8 Agent Architecture Case (1): NIO (Nomi) Module Design - Scenario Customization and Creation Process
1.8 Agent Architecture Case (1): Highlights of NIO (Nomi) (1)
1.8 Agent Architecture Case (1): Highlights of NIO (Nomi) (2)
1.8 Agent Architecture Case (1): Highlights of NIO (Nomi) (3)
1.8 Agent Architecture Case (2): Original Diagram of Li Auto (Lixiang Tongxue) Architecture
1.8 Agent Architecture Case (2): Li Auto (Lixiang Tongxue) Module Design
1.8 Agent Architecture Case (2): Li Auto (Lixiang Tongxue) Supporting Facilities - Data/Training Platform
1.8 Agent Architecture Case (2): Li Auto (Lixiang Tongxue) Supporting Facilities - Reasoning Engine
1.8 Agent Architecture Case (3): Original Diagram of Xiaomi (Xiaoai Tongxue) Architecture
1.8 Agent Architecture Case (3): Xiaomi (Xiaoai Tongxue) Module Design
1.8 Agent Architecture Case (4): Zeekr Agent Module Design
1.8 Agent Architecture Case (5): Original Diagram of Neta Agent Architecture Deployment
1.8 Agent Architecture Case (5): Neta Agent Module Design
1.8 Agent Architecture Case (6): Original Diagram of BAIC Agent Architecture Deployment
1.8 Agent Architecture Case (6): BAIC Agent Module Design
1.8 Agent Architecture Case (7): Huawei (Pangu Agent) Module Design
1.8 Agent Architecture Case (8): Original Diagram of AISpeech Agent Architecture Deployment
1.8 Agent Architecture Case (8): AISpeech Agent Module Design
1.8 Agent Architecture Case (9): Original Diagram of Lenovo Agent Architecture Deployment
1.8 Agent Architecture Case (10): Original Diagram of Zhipu Agent Architecture Deployment
1.8 Agent Architecture Case (10): Zhipu Agent Module Design
1.8 Agent Architecture Case (11): Original Diagram of Tinnove Agent Architecture Deployment
1.8 Agent Architecture Case (11): Tinnove Agent Module Design
1.9 Agent Architecture Design Process: Framework Selection
1.9 Agent Architecture Design Process: Tool Calling Method
1.10 Comparison of Automotive AI Agent Architecture

2 Key Issues in Development of Automotive AI Agent Products - User Sore Points and Technical Difficulties
2.1 Classification of Automotive AI Agent Scenario: Typical Commands in Different Scenarios
2.1 Classification of Automotive AI Agent Scenario: Case (1) NIO
2.1 Classification of Automotive AI Agent Scenario: Case (2) Li Auto
2.1 Classification of Automotive AI Agent Scenario: Case (3) Xiaomi
2.2 Automotive AI Agent Scenario Design Case (1) Q&A Scenario
2.2 Automotive AI Agent Scenario Design Case (2) Q&A Scenario
2.2 Automotive AI Agent Scenario Design Case (3) Mobility Scenario
2.2 Automotive AI Agent Scenario Design Case (4) Chat Scenario
2.2 Automotive AI Agent Scenario Design Case (5) Chat Scenario
2.2 Automotive AI Agent Scenario Design Case (6) Chat Scenario
2.2 Automotive AI Agent Scenario Design Case (7) Q&A/Office Scenario
2.3 User Sore Points in Different Agent Usage Scenarios: Summary
2.4 User Sore Points (1): Vehicle Control Scenario
2.4 User Sore Points (2): Mobility Scenario
2.4 User Sore Points (3): Q&An Scenario
2.4 User Sore Points (4): Entertainment Scenario
2.5 Agent Technical Difficulties
2.6 Agent Technology Case: Intent Recognition (Case 1)
2.6 Agent Technology Case: Intent Recognition (Case 2)
2.6 Agent Technology Case: Intent Recognition (Case 3)
2.6 Agent Technology Case: Intent Recognition (Case 4)
2.6 Agent Technology Case: Reasoning Acceleration (Case 1)
2.6 Agent Technology Case: Reasoning Acceleration (Case 2)
2.6 Agent Technology Case: Reasoning Acceleration (Case 3)
2.6 Agent Technology Case: Streaming Voice (Case 1)
2.6 Agent Technology Case: Streaming Voice (Case 2)
2.6 Agent Technology Case: Streaming Voice (Case 3)
2.6 Agent Technology Case: Emotional Interaction (Case 1)
2.6 Agent Technology Case: Emotional Interaction (Case 2)
2.6 Agent Technology Case: Emotional Interaction (Case 3)
2.7 Agent Technology Trends (1): Two Keys to Achieving Active Intelligence
2.7 Agent Technology Trends (2):
2.7 Agent Technology Trends (3): Two Mainstream Design Methods for Emotional Anthropomorphism

3 OEMs’ AI Agent Investment, Development, and Operation
3.1 Comparison of Automotive AI Agent Development Support
3.2 OEMs’ Planning for Automotive AI Agents
3.3 Comparison between Three Automotive AI Agent Development Modes: Advantages/Disadvantages
3.3 Comparison between Three Automotive AI Agent Development Modes: Cost
3.4 Position Setting of OEMs’ AI Agent Team
3.4 Case of OEMs’ AI Agent Team Position Setting (1): Positions Recruited by Chery AI Agent Team
3.4 Case of OEMs’ AI Agent Team Position Setting (2): Positions Recruited by Geely AI Agent Team
3.4 Case of OEMs’ AI Agent Team Position Setting (3): Positions Recruited by Li Auto AI Agent Team
3.4 Case of OEMs’ AI Agent Team Position Setting (4): Positions Recruited by NIO AI Agent Team
3.4 Case of OEMs’ AI Agent Team Position Setting (5): Positions Recruited by Xiaomi AI Agent Team
3.5 AI Agent Development Cycle and Operation Mode
3.6 AI Agent Business: OEMs’ Profit Model
3.6 AI Agent Business: Suppliers’ Profit Model
3.6 AI Agent Business: Suppliers’ Charging Standards
3.7 Commercial Development Trends of Automotive AI Agents (1)
3.7 Commercial Development Trends of Automotive AI Agents (2)

4 Automotive AI Agent Suppliers and Their Supply Relationships
4.1 Cockpit Base Foundation Model: Model Configurations
4.1 Cockpit Base Foundation Model: Selection Reference Factors
4.2 Cockpit Base Foundation Model Suppliers (1)
4.2 Cockpit Base Foundation Model Suppliers (2)
4.2 Cockpit Base Foundation Model Suppliers (3)
4.2 Cockpit Base Foundation Model Suppliers (4)
4.2 Cockpit Base Foundation Model Suppliers (5)
4.2 Cockpit Base Foundation Model Suppliers (6)
4.2 Cockpit Base Foundation Model Suppliers (7)
4.2 Cockpit Base Foundation Model Suppliers (8)
4.2 Cockpit Base Foundation Model Suppliers (9)
4.2 Cockpit Base Foundation Model Suppliers (10)
4.3 Industry Chain of Vector Database Suppliers
4.4 Comparison between Vector Database Products: Chinese Vector Databases
4.4 Comparison between Vector Database Products: Foreign Vector Databases
4.5 Vector Database Supplier Cases (1)
4.5 Vector Database Supplier Cases (2)
4.5 Vector Database Supplier Cases (3)
4.5 Vector Database Supplier Cases (4)
4.5 Vector Database Supplier Cases (5)
4.5 Vector Database Supplier Cases (6)
4.5 Vector Database Supplier Cases (7)
4.5 Vector Database Supplier Cases (8)
4.6 Comparison between Voice ASR Module Suppliers
4.7 ASR Module Supplier Cases (1)
4.7 ASR Module Supplier Cases (2)
4.7 ASR Module Supplier Cases (3)
4.7 ASR Module Supplier Cases (4)
4.7 ASR Module Supplier Cases (5)
4.7 ASR Module Supplier Cases (6)
4.7 ASR Module Supplier Cases (7)
4.7 ASR Module Supplier Cases (8)
4.7 ASR Module Supplier Cases (9)
4.8 Cockpit Data Collection Sensors: Mainstream Configurations/Data Collection Regulations
4.9 Sensor Data Processing Cases (1)
4.9 Sensor Data Processing Cases (2)
4.9 Sensor Data Processing Cases (3)
4.9 Sensor Data Processing Cases (4)

Next-generation Central and Zonal Communication Network Topology and Chip Industry Research Report, 2025

The automotive E/E architecture is evolving towards a "central computing + zonal control" architecture, where the central computing platform is responsible for high-computing-power tasks, and zonal co...

Vehicle-road-cloud Integration and C-V2X Industry Research Report, 2025

Vehicle-side C-V2X Application Scenarios: Transition from R16 to R17, Providing a Communication Base for High-level Autonomous Driving, with the C-V2X On-board Explosion Period Approaching In 2024, t...

Intelligent Cockpit Patent Analysis Report, 2025

Patent Trend: Three Major Directions of Intelligent Cockpits in 2025 This report explores the development trends of cutting-edge intelligent cockpits from the perspective of patents. The research sco...

Smart Car Information Security (Cybersecurity and Data Security) Research Report, 2025

Research on Automotive Information Security: AI Fusion Intelligent Protection and Ecological Collaboration Ensure Cybersecurity and Data Security At present, what are the security risks faced by inte...

New Energy Vehicle 800-1000V High-Voltage Architecture and Supply Chain Research Report, 2025

Research on 800-1000V Architecture: to be installed in over 7 million vehicles in 2030, marking the arrival of the era of full-domain high voltage and megawatt supercharging. In 2025, the 800-1000V h...

Foreign Tier 1 ADAS Suppliers Industry Research Report 2025

Research on Overseas Tier 1 ADAS Suppliers: Three Paths for Foreign Enterprises to Transfer to NOA Foreign Tier 1 ADAS suppliers are obviously lagging behind in the field of NOA. In 2024, Aptiv (2.6...

VLA Large Model Applications in Automotive and Robotics Research Report, 2025

ResearchInChina releases "VLA Large Model Applications in Automotive and Robotics Research Report, 2025": The report summarizes and analyzes the technical origin, development stages, application cases...

OEMs’ Next-generation In-vehicle Infotainment (IVI) System Trends Report, 2025

ResearchInChina releases the "OEMs’ Next-generation In-vehicle Infotainment (IVI) System Trends Report, 2025", which sorts out iterative development context of mainstream automakers in terms of infota...

Autonomous Driving SoC Research Report, 2025

High-level intelligent driving penetration continues to increase, with large-scale upgrading of intelligent driving SoC in 2025 In 2024, the total sales volume of domestic passenger cars in China was...

China Passenger Car HUD Industry Report, 2025

ResearchInChina released the "China Passenger Car HUD Industry Report, 2025", which sorts out the HUD installation situation, the dynamics of upstream, midstream and downstream manufacturers in the HU...

ADAS and Autonomous Driving Tier 1 Suppliers Research Report, 2025 – Chinese Companies

ADAS and Autonomous Driving Tier 1 Suppliers Research Report, 2025 – Chinese Companies Research on Domestic ADAS Tier 1 Suppliers: Seven Development Trends in the Era of Assisted Driving 2.0 In the ...

Automotive ADAS Camera Report, 2025

①In terms of the amount of installed data, installations of side-view cameras maintain a growth rate of over 90%From January to May 2025, ADAS cameras (statistical scope: front-view, side-view, surrou...

Body (Zone) Domain Controller and Chip Industry Research Report,2025

Body (Zone) Domain Research: ZCU Installation Exceeds 2 Million Units, Evolving Towards a "Plug-and-Play" Modular Platform The body (zone) domain covers BCM (Body Control Module), BDC (Body Dom...

Automotive Cockpit Domain Controller Research Report, 2025

Cockpit domain controller research: three cockpit domain controller architectures for AI Three layout solutions for cockpit domain controllers for deep AI empowerment As intelligent cockpit tran...

China Passenger Car Electronic Control Suspension Industry Research Report, 2025

Electronic control suspension research: air springs evolve from single chamber to dual chambers, CDC evolves from single valve to dual valves ResearchInChina released "China Passenger Car Elect...

Automotive XR Industry Report, 2025

Automotive XR industry research: automotive XR application is still in its infancy, and some OEMs have already made forward-looking layout The Automotive XR Industry Report, 2025, re...

Intelligent Driving Simulation and World Model Research Report, 2025

1. The world model brings innovation to intelligent driving simulation In the advancement towards L3 and higher-level autonomous driving, the development of end-to-end technology has raised higher re...

Autonomous Driving Map (HD/LD/SD MAP, Online Reconstruction, Real-time Generative Map) Industry Report 2025

Research on Autonomous Driving Maps: Evolve from Recording the Past to Previewing the Future with "Real-time Generative Maps" "Mapless NOA" has become the mainstream solution for autonomous driving s...

Abstract

Table of Contents

Selected Charts

Related Reports

Related Companies