Cockpit AI Agent: Autonomous scenario creation becomes the first step to personalize cockpits
In AI Foundation Models’ Impacts on Vehicle Intelligent Design and Development Research Report, 2024, ResearchInChina mentioned that the core of an AI Agent uses a large language model (LLM) as its core computing engine (LLM OS). In the AI service framework, the LLM acts as AI core and the Agent acts as AI APP. With the help of reasoning and generation capabilities of AI foundation model, the Agent can create more cockpit scenarios, while further improving current multimodal interaction, voice processing and other technologies in the cockpit.
![座舱 AI AGENT 1_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%201_副本.png)
Compared with the traditional OS, AI OS is programmed in natural language, which is more suitable for human operation and convenient for customization and personalized human-computer interaction. The Agent responds to user prompts or instructions in natural language, and can perform a lot of tasks according to its interaction with physical or digital environment.
![座舱 AI AGENT 2_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%202_副本.png)
Foundation models spawn Agent customized scenarios
Since AI foundation models were installed in cars, they have been widely used in multimodal interaction scenarios. OEMs have started from voice assistants to further improve intelligent processing of user commands:
Before foundation models got on cars, the wake-up rate and effective instructions of voice assistants in cockpits are at a low level;
After foundation models were available in cars, not only has voice recognition become more intelligent, but the application of expression monitoring and gesture control have also been improved, with effective commands and daily active rates increasing significantly.
AI foundation models further enhance the user experience of smart cockpits through strong context processing and reasoning capabilities. Please refer to the following dialogue:
(Mobility scenario)
--**Tongxue! (the name of a voice assistant)
-I am here!
-There seems to be a prairie ahead. Please take a picture for me!
-Wait a moment, and I will capture the beautiful scenariory of the prairie for you. I'll let you know when the photo is taken. Is there anything else I can help you with?
-Send it to Moments immediately. I want my friends to have a look at this beautiful scenariory.
-Ok, according to your authorized account, please choose the material and filter style.
-Just this one, with a caption.
-Ok, three captions have been generated for you. Which one do you like?
-Just this one.
(You enter the office and receive a phone call)
-There is a phone call from a customer, and I have transferred it to the IVI for you. Would you like to answer it?
-Yes, and turn on the office mode.
-Ok, I will generate an avatar of the customer for you and start the conversation. Do you have any questions or things to discuss? I will record the conversation and provide you with the minutes of the meeting after it is over.
(The avatar is generated)
-Now you can start the conversation.
The above scenarios will not be materialized in the cockpit until 2024 when foundation models are installed on vehicles by some OEMs.
For example, IM L6 has built Carlog and City Drive scenarios to enable the AI foundation models to proactively recommend food and attractions and allow users to post them on social media:
Carlog: Actively perceive the scenario during driving through AI vision foundation model, mobilize four cameras to take photos, automatically save and edit them, and support one-click share in Moments.
City Drive: Cooperate with Volcengine to model nearby food, scenic spots and landmarks in real time in the digital screen, and push them according to users' habits and preferences.
![座舱 AI AGENT 3.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%203.png)
![座舱 AI AGENT 4.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%204.png)
The applicability of foundation models in various scenarios has stimulated users' demand for intelligent agents that can uniformly manage cockpit functions. In 2024, OEMs such as NIO, Li Auto, and Hozon successively launched Agent frameworks, using voice assistants as the starting point to manage functions and applications in cockpits.
Agent service frameworks can not only manage cockpit functions in a unified way, but also provide more abundant scenario modes according to customers' needs and preferences, especially supporting customized scenarios for users, which accelerates the advent of the cockpit personalization era.
![座舱 AI AGENT 5_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%205_副本.png)
For example, NIO’s NOMI GPT allows users to set an AI scenario with just one sentence:
![座舱 AI AGENT 6.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%206.png)
Core competence of cockpit Agents
AI Agents in the era of foundation models are based on LLMs, whose powerful reasoning expands the applicable scenarios of AI Agents that can improve the thinking capability of foundation models through feedback obtained during operation. In the cockpit, the Agent capability paradigm can be roughly divided into "Understanding" + "Planning" + "Tool Use" + "Reflection".
![座舱 AI AGENT 7_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%207_副本.png)
When Agents first get on cars, cognitive and planning abilities are more important. The understanding of task goals and the choice of implementation paths directly determine the accuracy of performance results, which in turn affect the scenario utilization rate of Agents.
For example, in Xiaomi's voice interaction process, semantic understanding is the difficulty of the entire automotive voice processing process. XiaoAi handles semantic parsing through a semantic parsing model.
![座舱 AI AGENT 8_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%208_副本.png)
After the mass production of Agents, the personalized cockpits that support users to customize scenario modes become the highlight, and Reflection becomes the most important core competence at this stage, so it is necessary to build an Agentic Workflow that is constantly learning and optimizing.
For example, Lixiang Tongxue offered by Li Auto supports the creation of one-sentence scenarios. It is backed by Mind GPT's built-in memory network and online reinforcement learning capabilities. Mind GPT can remember personalized preferences and habits based on historical conversations. When similar scenarios recur, it can automatically set scenario parameters through historical data to fit the user's original intentions.
![座舱 AI AGENT 9_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%209_副本.png)
At the AI OS architecture setting level, we take SAIC Z-One as an example:
Z-One accesses the LLM kernel (LLM OS) at the kernel layer, which controls the interfaces of AI OS SDK and ASF with the original microkernel respectively, in which AI OS SDK receives the scheduling of the LLM to promote the Agent service framework of the application layer. The Z-One AI OS architecture highly integrates AI with CPU. Through SOA atomic services, AI is then connected to the vehicle's sensors, actuators and controllers. This architecture, based on a terminal-cloud foundation model, can enhance the computing power of the terminal-side foundation model and reduce operational latency.
Application Difficulty of Cockpit AI Agents
Agents connect to users and execute commands. In the application process, in addition to the technical difficulties of putting foundation models on cars, they also face scenario difficulties. In the process of command reception-semantic analysis-intention reasoning-task execution, the accuracy of the performance results and the delay in human-computer interaction directly affect the user's riding experience.
![座舱 AI AGENT 10_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%2010_副本.png)
Humanization of interaction
For example, in the "emotional consultant" scenario, Agents should resonate emotionally with car owners and perform anthropomorphism. Generally, there are three forms of anthropomorphism of AI Agents: physical anthropomorphism, personality anthropomorphism, and emotional anthropomorphism.
![座舱 AI AGENT 11_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%2011_副本.png)
NIO's NOMI GPT uses "personality anthropomorphism" and "emotional anthropomorphism":
![座舱 AI AGENT 12_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%2012_副本.png)
Foundation model performance
In the "encyclopedia question and answer" scenario, Agents may be unable to answer the user's questions, especially open questions, accurately because of LLM illusion after semantic analysis, database search, answer generation and the like.
Current solutions include advanced prompting, RAG+knowledge graph, ReAct, CoT/ToT, etc., which cannot completely eliminate “LLM illusion”. In the cockpit, external databases, RAG, self-consistency and other methods are more often used to reduce the frequency of “LLM illusion”.
Some foundation model manufacturers have improved the above solutions. For example, Meta has proposed to reduce “LLM illusion” through Chain-of-Verification (CoVe). This method breaks down fact-checking into more detailed sub-questions to improve response accuracy and is consistent with the human-driven fact-checking process. It can effectively improve the FACTSCORE indicator in long-form generation tasks.
CoVe includes four steps: query, plan verification, execute verification and final verified response.
![座舱 AI AGENT 13_副本.png](/UpLoads/Article/2024H2/座舱%20AI%20AGENT%2013_副本.png)
Chinese OEMs (Passenger Car) Going Overseas Report, 2024--Germany
Keywords of Chinese OEMs going to Germany: electric vehicles, cost performance, intelligence, ecological construction, localization
The European Union's temporary tariffs on electric vehicles in Chi...
Analysis on DJI Automotive’s Autonomous Driving Business, 2024
Research on DJI Automotive: lead the NOA market by virtue of unique technology route.
In 2016, DJI Automotive’s internal technicians installed a set of stereo sensors + vision fusion positioning syst...
BYD’s Layout in Electrification, Connectivity, Intelligence and Sharing and Strategy Analysis Report, 2023-2024
Insight: BYD deploys vehicle-mounted drones, and the autonomous driving charging robot market is expected to boom.
BYD and Dongfeng M-Hero make cross-border layout of drones.
In recent years,...
Great Wall Motor’s Layout in Electrification, Connectivity, Intelligence and Sharing and Strategy Analysis Report, 2023-2024
Great Wall Motor (GWM) benchmarks IT giants and accelerates “Process and Digital Transformation”.
In 2022, Great Wall Motor (GWM) hoped to use Haval H6's huge user base to achieve new energy transfo...
Cockpit AI Agent Research Report, 2024
Cockpit AI Agent: Autonomous scenario creation becomes the first step to personalize cockpits
In AI Foundation Models’ Impacts on Vehicle Intelligent Design and Development Research Report, 2024, Res...
Leading Chinese Intelligent Cockpit Tier 1 Supplier Research Report, 2024
Cockpit Tier1 Research: Comprehensively build a cockpit product matrix centered on users' hearing, speaking, seeing, writing and feeling.
ResearchInChina released Leading Chinese Intelligent Cockpit ...
Global and China Automotive Wireless Communication Module Market Report, 2024
Communication module and 5G research: 5G module installation rate reaches new high, 5G-A promotes vehicle application acceleration
5G automotive communication market has exploded, and 5G FWA is evolv...
ADAS and Autonomous Driving Tier 1 Suppliers Research Report, 2024 – Chinese Companies
ADAS Tier1s Research: Suppliers enter intense competition while exploring new businesses such as robotics
In China's intelligent driving market, L2 era is dominated by foreign suppliers. Entering era...
Automotive Gateway Industry Report, 2024
Automotive gateway research: 10BASE-T1S and CAN-XL will bring more flexible gateway deployment solutions
ResearchInChina released "Automotive Gateway Industry Report, 2024", analyzing and researching...
Global and China Electronic Rearview Mirror Industry Report, 2024
Research on electronic rearview mirrors: electronic internal rearview mirrors are growing rapidly, and electronic external rearview mirrors are facing growing pains
ResearchInChina released "Global a...
Next-generation Zonal Communication Network Topology and Chip Industry Research Report, 2024
The in-vehicle communication architecture plays a connecting role in automotive E/E architecture. With the evolution of automotive E/E architecture, in-vehicle communication technology is also develop...
Autonomous Delivery Industry Research Report, 2024
Autonomous Delivery Research: Foundation Models Promote the Normal Application of Autonomous Delivery in Multiple Scenarios
Autonomous Delivery Industry Research Report, 2024 released by ResearchInCh...
Global Autonomous Driving Policies & Regulations and Automotive Market Access Research Report, 2024
Intelligent driving regulations and vehicles going overseas: research on regional markets around the world and access strategies. "Going out”: discussion about regional markets aroun...
China Passenger Car HUD Industry Report, 2024
HUD research: AR-HUD accounted for 21.1%; LBS and optical waveguide solutions are about to be mass-produced. The automotive head-up display system (HUD) uses the principle of optics to display s...
Ecological Domain and Automotive Hardware Expansion Research Report, 2024
Automotive Ecological Domain Research: How Will OEM Ecology and Peripheral Hardware Develop? Ecological Domain and Automotive Hardware Expansion Research Report, 2024 released by ResearchInChina ...
C-V2X and CVIS Industry Research Report, 2024
C-V2X and CVIS Research: In 2023, the OEM scale will exceed 270,000 units, and large-scale verification will start.The pilot application of "vehicle-road-cloud integration” commenced, and C-V2X entere...
Automotive Intelligent Cockpit Platform Configuration Strategy and Industry Research Report, 2024
According to the evolution trends and functions, the cockpit platform has gradually evolved into technical paths such as cockpit-only, cockpit integrated with other domains, cockpit-parking integratio...
Analysis on Huawei's Electrification, Connectivity, Intelligence and Sharing,2023-2024
Analysis on Huawei's Electrification, Connectivity, Intelligence and Sharing: Comprehensive layout in eight major fields and upgrade of Huawei Smart Selection
The “Huawei Intelligent Driving Business...