Zhanxing Zhu

Machine learning researcher.

profile_pic.png

ECS, University of Southampton

Southampton, SO17 1BJ, UK

Email: z.zhu@soton.ac.uk

I am an Associate Professor at Vision, Learning and Control Group (VLC), School of Electrical and Computer Science (ECS), University of Southampton, UK. I am now closely affiliated with UKRI AI Centre for Doctoral Training in AI for Sustainability. Previously I obtained Ph.D on machine learning from School of Informatics, University of Edinburgh, UK.

I have been focusing on machine learning, particularly, deep learning, broadly covering its theory, methodology and application. Together with my students and collaborators, we attempt to rigorously reveal the underlying mechanism of why deep learning works or not, and inspired by our theoretical understanding and empirical observation, we develop robust, fast and generalizable models and algorithms to boost its applicability in various challenging scenarios and interdisciplinary tasks, e.g. AI4Science and AI4Engineering. More information is shown in my Google Scholar profile.

Research Interests:

Ph.D Studentships. I’m interested in supervising motivated students in the area of AI and machine learning, ranging from theory, algorithms and various applications. Please get in touch to discuss the options and potential topics. You can also check out the UKRI AI Centre for Doctoral Training in AI for Sustainability which has opportunities for 70 PhD students in the area of AI and environmental sustainability.

news

May 01, 2026 3 ICML papers accepted, with one Spotlight! 😉 Two of them are about the fundamental understanding of the Transformer architecture and its training dynamics. The other is about private training of SGD with adaptive clipping strategies.
Jan 29, 2026 4 papers accepted by ICLR 2026 😉! Two are for highly skillful AI models for simulation deformable objects, one for mixture of depth-recurrrent architecture design for improving LLM resasoning, and the other for analyzing training dynamics of semi-supervised learning.

selected publications

  1. ICML
    Transformers with RL or SFT Provably Learn Sparse Boolean Functions, But Differently
    Bochen Lyu, Yiyang Jia, Xiaohao Cai, and Zhanxing Zhu
    In International Conference for Machine Learning (ICML), 2026
  2. ICML
    Modelling Attention with Aitchison Geometry: Token Distinguishability and Temperature Scaling
    Sam Hilton-Jones, Tim Norman, and Zhanxing Zhu
    In International Conference for Machine Learning (ICML), 2026
  3. TPAMI
    Analyzing the Implicit Bias of Adversarial Training from a Generalized Margin Perspective
    Bochen Lyu and Zhanxing Zhu
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
  4. ICLR
    A Solvable Attention for Neural Scaling Laws
    Bochen Lyu, Di Wang, and Zhanxing Zhu
    In International Conference on Learning Representation (ICLR), 2025
  5. NeurIPS
    Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay
    Ruosi Wan, Zhanxing Zhu, Xiangyu Zhang, and Jian Sun
    Advances in Neural Information Processing Systems (NeurIPS), 2021
  6. ICML
    The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
    Zhanxing Zhu, Jingfeng Wu, Bing Yu, Lei Wu, and Jinwen Ma
    In International Conference on Machine Learning (ICML), 2019
  7. IJCAI
    Spatio-temporal graph convolutional neural network: A deep learning framework for traffic forecasting
    Bing Yu, Haoteng Yin, and Zhanxing Zhu
    In International Joint Conference of Artificial Intelligence (IJCAI), 2018