Zhanxing Zhu

Machine learning researcher.


ECS, University of Southampton

Southampton, SO17 1BJ, UK

Email: z.zhu@soton.ac.uk

I am Associate Professor at Vision, Learning and Control Group (VLC), School of Electrical and Computer Science (ECS), University of Southampton, UK. I am now closely affiliated with UKRI AI Centre for Doctoral Training in AI for Sustainability. Previously I obtained Ph.D on machine learning from School of Informatics, University of Edinburgh, UK.

I have been focusing on machine learning, particularly, deep learning, broadly covering its theory, methodology and application. Together with my students and collaborators, we attempt to rigorously reveal the underlying mechanism of why deep learning works or not, and inspired by our theoretical understanding and empirical observation, we develop robust, fast and generalizable models and algorithms to boost its applicability in various challenging scenarios and interdisciplinary tasks, e.g. ML4Science. More information is shown in my Google Scholar profile.

Research Interests:

Ph.D Studentships. I’m interested in supervising motivated students in the area of AI and machine learning, ranging from theory, algorithms and various applications. Please get in touch to discuss the options and potential topics. You can also check out the UKRI AI Centre for Doctoral Training in AI for Sustainability which has opportunities for 70 PhD students in the area of AI and environmental sustainability.

selected publications

  1. ICLR
    A Solvable Attention for Neural Scaling Laws
    Bochen Lyu ,  Di Wang ,  and  Zhanxing Zhu
    In International Conference on Learning Representation (ICLR) , 2025
  2. NeurIPS
    Implicit Bias of (Stochastic) Gradient Descent for Rank-1 Linear Neural Network
    Bochen Lyu ,  and  Zhanxing Zhu
    In Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS) , 2023
  3. ICLR
    Implicit Bias of Adversarial Training for Deep Neural Networks
    Bochen Lv ,  and  Zhanxing Zhu
    In International Conference on Learning Representation (ICLR) , 2022
  4. NeurIPS
    Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay
    Ruosi Wan ,  Zhanxing Zhu ,  Xiangyu Zhang ,  and  Jian Sun
    Advances in Neural Information Processing Systems (NeurIPS), 2021
  5. NeurIPS
    You only propagate once: Accelerating adversarial training via maximal principle
    Dinghuai Zhang ,  Tianyuan Zhang ,  Yiping Lu ,  Zhanxing Zhu ,  and  Bin Dong
    In Advances in Neural Information Processing Systems (NeurIPS) , 2019
  6. ICML
    The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
    Zhanxing Zhu ,  Jingfeng Wu ,  Bing Yu ,  Lei Wu ,  and  Jinwen Ma
    In International Conference on Machine Learning (ICML) , 2019
  7. IJCAI
    Spatio-temporal graph convolutional neural network: A deep learning framework for traffic forecasting
    Bing Yu ,  Haoteng Yin ,  and  Zhanxing Zhu
    In International Joint Conference of Artificial Intelligence (IJCAI) , 2018