Xuan Cai - Academic Homepage

Journal

Text2Scenario: Text-Driven Scenario Generation for Autonomous Driving Test

Leverages a Large Language Model to autonomously generate simulation test scenarios from natural language inputs, extracting scenario components from a hierarchical repository and fabricating executable test scenarios via Domain Specific Language.

Xuan Cai, Xuesong Bai, Zhiyong Cui, Danmu Xie, Daocheng Fu, Haiyang Yu, Yilong Ren

Automotive Innovation, 2025 | doi: 10.1007/s42154-025-00374-8

PDF

Autonomous driving (AD) testing constitutes a critical methodology for assessing performance benchmarks prior to product deployment. The creation of segmented scenarios within a simulated environment is acknowledged as a robust and effective strategy; however, the process of tailoring these scenarios often necessitates laborious and time-consuming manual efforts, thereby hindering the development and implementation of AD technologies. In response to this challenge, Text2Scenario is introduced, a framework that leverages a Large Language Model (LLM) to autonomously generate simulation test scenarios that closely align with user specifications, derived from their natural language inputs. Specifically, an LLM, equipped with a meticulously engineered input prompt scheme functions as a text parser for test scenario descriptions. The LLM extracts from a hierarchically organized scenario repository the components that most accurately reflect the user's preferences. Subsequently, by exploiting the precedence of scenario components, the process involves sequentially matching and linking scenario representations within a Domain Specific Language corpus, ultimately fabricating executable test scenarios.

Conference

Vulnerability-aware and Curiosity-driven Adversarial Reinforcement Learning Policy for Safety-Critical Scenario Generation

Proposes a vulnerability-aware and curiosity-driven adversarial RL policy that uses a surrogate network to fit the victim's value function and random network distillation for exploration, significantly enhancing policy exposure in learning-based AVs.

Xuan Cai, Zhiyong Cui, Xuesong Bai, Ruimin Ke, Haiyang Yu, Yilong Ren, Zechang Ye

2025 IEEE Intelligent Vehicles Symposium (IV), Cluj-Napoca, Romania

PDF Code

Autonomous vehicles (AVs) face significant threats to their safe operation in complex traffic environments. Adversarial policy for scenario generation has been established as a robust paradigm for enhancing AV resilience against adversarial perturbations through proactive exposure to synthetically engineered safety-critical scenarios. However, adversarial policies in existing methodologies often get stuck in a loop of over-exploiting established vulnerabilities, resulting in poor exploration for AVs. To overcome the limitations, we introduce a pioneering framework termed the vulnerability-aware and curiosity-driven adversarial reinforcement learning policy. Specifically, during the traffic vehicle attacker training phase, a surrogate network is employed to fit the value function of the AV victim, providing dense information about the victim's inherent vulnerabilities. Subsequently, random network distillation is used to characterize the novelty of the scenario, constructing an intrinsic reward to guide the attacker in exploring unexplored territories.

Journal

Adversarial Stress Test for Autonomous Vehicle via Series Reinforcement Learning Tasks With Reward Shaping

Introduces an evolving series RL framework for adversarial policy training, integrating RSS and DTW theories to shape the reward function, expediting the exploration of vulnerability-revealing scenarios for both black-box and white-box AV systems.

Xuan Cai, Xuesong Bai, Zhiyong Cui, Peng Hang, Haiyang Yu, Yilong Ren

IEEE Transactions on Intelligent Vehicles, Vol. 10, No. 2, pp. 832–845, 2025

PDF Code

Testing is a pivotal phase for uncovering potential vulnerabilities in autonomous vehicles (AVs) to develop a secure autonomy system. However, existing methods often lack consideration for efficiently exploring multiple vulnerability-revealing cases, particularly under adversarial game scenarios. We introduce an evolving series reinforcement learning (RL) framework for adversarial policy training, integrating Responsibility Sensitive Safety (RSS) and Dynamic Time Warping (DTW) theories to shape the reward function to steer the evolving direction of the subsequent series agents for exploring vulnerability-revealing attack scenarios uncharted in the refined buffered repository. Our method undertakes adversarial stress tests for both black-box and white-box AV systems under test in driving tasks that engage in games with traffic vehicles and pedestrians. The results indicate that our approach expedites the exploration of additional scenarios blamed for the AV, outperforming the baselines in the vulnerability-revealing accident and scenario diversity.

Journal

An explicit State-of-Charge planning solution for plug-in hybrid electric vehicle based on low-granularity prior-knowledge

Devises an explicit SoC planning method requiring only sparse traffic prior-knowledge, drawing inspiration from optimal charge depletion behavior, and develops a hierarchical predictive energy management framework integrating SoC planning and power split.

Xuan Cai, Wei Zhou, Zhiyong Cui, Xuesong Bai, Fan Liu, Haiyang Yu, Yilong Ren

Energy, Vol. 313, 133990, 2024

PDF

The intervention of batteries in hybrid electric vehicles, when paired with an effective Energy Management Strategy (EMS), substantially improves fuel efficiency and reduces emissions in comparison to conventional internal combustion engine vehicles. The evolution of Intelligent Transportation Systems (ITS) has facilitated the possibility of predictive energy management (PEM) predicated on State-of-Charge (SoC) planning. Nevertheless, prevalent methodologies frequently encounter challenges in balancing optimization with real-time applicability. To address these limitations, we have devised an explicit SoC planning method that necessitates sparse traffic prior-knowledge, drawing inspiration from the optimal charge depletion behavior. This innovative method strategically determines the average SoC depletion rate for each anticipated driving road segment by integrating theoretical predictions of optimal depletion rate with experienced constraints. The results of the simulation experiments reveal that the SoC trajectories and fuel consumption generated by this method are in close approximation to theoretically optimal benchmarks.

Journal

Decoding the optimal charge depletion behavior in energy domain for predictive energy management of series plug-in hybrid electric vehicle

Reveals four aggregated optimal charge depletion behaviors through rigorous PMP-based analytical derivations, providing fundamental understanding on how and why optimal charge depletion rates behave in different driving conditions.

Wei Zhou, Xuan Cai, Yaoqi Chen, Junqiu Li, Xiaoyan Peng

Applied Energy, Vol. 316, 119098, 2022

PDF

A critical issue for designing predictive energy management (PEM) strategy of Plug-in Hybrid Electric Vehicles is the planning of optimal global charge trajectory. Existing planning methods have flaws in terms of optimality or computational efficiency due to their lack of in-depth consideration about optimal charge depletion behaviors. To address this issue, rigorous theoretical analysis on the aggregated local and global optimal charge depletion behaviors in energy domain is conducted by combining Pontryagin's Minimum Principle-based analytical derivations and some qualitative reasoning. Fundamental understanding on how the optimal charge depletion rates behave in different driving conditions and why they exhibit such behaviors is provided. The theoretical analysis is further validated through model-in-the-loop tests using an experimentally validated high-fidelity vehicle simulator.

Xuan Cai (蔡翾)

About Me

Education

Publications

Patents

Awards

News