Xiang Zheng
Research Assistant Professor
Hong Kong Institute of AI for Science (HKAI-Sci)
City University of Hong Kong
Email: xiang.zheng [at] cityu.edu.hk
Google Scholar | GitHub | CityU Scholar | CV
I am a Research Assistant Professor at the Hong Kong Institute of AI for Science (HKAI-Sci), City University of Hong Kong. I also work closely with Prof. Xingjun Ma at the Institute of Trustworthy Embodied AI, Fudan University. My research is positioned at the intersection of Reinforcement Learning, Trustworthy AI, Generative AI (LLMs, diffusion models, and AI agents), and Robot Learning. I am particularly passionate about developing robust and efficient reinforcement learning algorithms to enable Trustworthy Decision Making in real-world systems. I have been honored to receive the prestigious CityU Presidential PhD Scholarship.
I received my Ph.D. from the Department of Computer Science at City University of Hong Kong in 2024, under the guidance of Prof. Cong Wang. Prior to that, I earned my Master’s degree in Control Science and Engineering from the Department of Automation at Tsinghua University in 2019, where I was advised by Prof. Tao Zhang. I hold dual Bachelor’s degrees in Automation and Mathematics from the Shen Yuan Honors College (formerly the School of Advanced Engineering) at Beihang University, where I graduated in 2016.
Latest News
| Feb 01, 2026 | Will join HKAI-Sci as Research Assistant Professor in late Feb. |
|---|---|
| Jan 15, 2026 | Our survey on large model and agent safety is published in Foundations and Trends® in Privacy and Security. |
| Jan 23, 2025 | Our work on reinforced defense for VLMs is accepted by ICLR’25. |
| Dec 14, 2024 | Our work on RL-based auditing for LLMs is accepted by AAAI’25. |
| Apr 17, 2024 | Our work on intrinsic motivation for RL is accepted by IJCAI’24. |
Selected Publications
- FnT P&SSafety at Scale: A Comprehensive Survey of Large Model and Agent SafetyFoundations and Trends in Privacy and Security, 2026
- ICLR’25BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak AttacksIn International Conference on Learning Representations (ICLR), 2025
- AAAI’25CALM: Curiosity-Driven Auditing for Large Language ModelsIn AAAI Conference on Artificial Intelligence (AAAI), 2025
- IJCAI’24Constrained intrinsic motivation for reinforcement learningIn International Joint Conference on Artificial Intelligence (IJCAI), 2024
- DSN’24Toward evaluating robustness of reinforcement learning with adversarial policyIn Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2024
- ASTReinforcement learning with prior policy guidance for motion planning of dual-arm free-floating space robotAerospace Science and Technology (AST), 2023
- RA-LCollision-free trajectory planning for a 6-DoF free-floating space robot via hierarchical decoupling optimizationIEEE Robotics and Automation Letters (RA-L), 2022
- IROS’21A multi-target trajectory planning of a 6-DoF free-floating space robot via reinforcement learningIn IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021
- CVPR’20Clean-label backdoor attacks on video recognition modelsIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
Selected Awards and Honors
- IJCAI Travel Grant, IJCAI Organization, 2024
- Conference Grant, City University of Hong Kong, 2024
- DSN Student Travel Grant, DSN Student Travel Awards Committee, 2024
- Research Activities Fund, City University of Hong Kong, 2022
- Institutional Research Tuition Grant, City University of Hong Kong, 2020
- CityU Presidential PhD Scholarship (HK$1.56m for world-class PhD candidates), City University of Hong Kong, 2020
- NII MOU Research Activities Grant, National Institute of Informatics, Japan, 2018
- CSC Scholarship, China Scholarship Council, 2016
- Stars of Advanced Engineering, the highest honor at the Shen Yuan Honors College, Beihang University, 2015
Professional Activities
Program Committee Member- ICML 2026, CVPR 2026, ICLR 2026, AAAI 2026, ICRA 2026, MM 2025, ICLR 2025, AAAI 2025
- IEEE TDSC, IEEE TSC, IEEE TC
- NeurIPS 2025, ICNP 2025, ESORICS 2022, AsiaCCS 2022, RAID 2021, IEEE IoT-J
Research & Work Experience
Visiting Researcher @ Xi'an Jiaotong University- Supervised by Prof. Chao Shen
- Sep 2022 - Aug 2023, Xi'an, China
- May 2020 - Aug 2020, Xi'an, China
- Supervised by Prof. Chao Shen & Prof. Xingjun Ma
- Nov 2019 - Feb 2020, Xi'an, China
- Supervised by Prof. Tetsunari Inamura
- Feb 2018 - May 2018, Tokyo, Japan
- Supervised by Prof. Elias Aboutanios
- Feb 2016 - June 2016, Sydney, Australia
Teaching Assistant
- CS5293, Topics on Information Security, Semester B 2023/24, Code for Tutorial
- CS4394, Information Security and Management, Semester A 2023/24
- CS4293 / CS5293, Topics in Cybersecurity / Topics on Information Security, Semester B 2021/22
- CS4394 / CS5294, Information Security and Management, Semester A 2021/22
- CS4293 / CS6290, Topics in Cybersecurity / Privacy-enhancing Technologies, Semester B, 2020/21
- CS2310 / CS2311, Computer Programming, Semester A, 2020/21