Learning to Stabilize Plasma: Provable Imitation Learning for Nuclear Fusion Control

发布时间:2026-05-18

时   间:11:00-12:00, May 26, 2026 (Tue)

地   点:RM 1-202, FIT Building

内容:

Maintaining the stability of magnetically confined plasma is a central challenge on the path toward practical nuclear fusion. When modelled by kinetic Vlasov–Poisson equations, the control problem is particularly difficult due to nonlinearity, sensitivity to initial conditions, and partial observability. Recent advances in AI have shown considerable promise for plasma control, yet the theoretical foundations and principled algorithmic methodologies remain underexplored.

In this talk, we discuss recent advances in machine learning for plasma control. Starting from an expert controller designed for a fully observed model, we develop algorithms that learn feedback policies operating solely on experimentally available measurements. We study both offline and online imitation learning algorithms, revealing new tradeoff between adaptivity and stability. Offline behavior cloning adapts to the complexity of the initial distribution, but inevitably suffers from exponential error compounding. Online algorithms, by contrast, can achieve long-term stability with only polynomial error compounding. Our theory highlights the advantages of learning-based control in adapting to unknown initial conditions while maintaining long-time stability. Empirical results on simulated plasma systems further validate the effectiveness of our methods in stabilizing plasma over long time horizons.

This work builds a bridge between statistical learning theory and the control of complex physical systems, and represents a step toward theoretically grounded, AI-assisted control strategies for fusion energy. Joint work with Xiaofan Xia and Qin Li.

个人简介:

Wenlong Mou is an Assistant Professor in the Department of Statistical Sciences at the University of Toronto. He received his Ph.D. in Electrical Engineering and Computer Sciences from UC Berkeley in 2023. Before joining Berkeley, he earned a B.Sc. in Computer Science and a B.A. in Economics from Peking University.

His research interests include reinforcement learning theory, post-training methods for deep generative models, and the interplay between reinforcement learning and continuous control. He is particularly interested in developing theory and algorithms that use reinforcement learning to control real-world systems such as fusion plasma. His work has been published in leading journals and conferences in machine learning, statistics, and applied mathematics. His research has been recognized by the INFORMS Applied Probability Society, where he was named a Best Student Paper finalist.

返回列表
演讲人 牟文龙 [University of Toronto] 时间 11:00-12:00, May 26, 2026 (Tue)
地点 RM 1-202, FIT Building EN
TOP