Embodied AI + Reasoning @ NVIDIA
Hi! This is Shun Zhang (张舜). I am a senior GenAI engineer on the NVIDIA Cosmos team.
My research interests lie in reinforcement learning (RL) and language models. I am interested in RL-inspired self-evolving agents that plan, acquire reusable skills, and improve through interaction. I am also interested in value alignment, particularly enabling agents to proactively infer and adapt to human users’ goals rather than reactively following instructions.
Aug 2025 - Present
NVIDIA
Santa Clara, CA
Post-training of vision language models.
Jun 2024 - Jan 2025
Asari AI
San Francisco, CA
Developed an AI agent that plans, verifies, and discovers new skills and knowledge.
Jun 2022 - Jun 2024
MIT-IBM Watson AI Lab
Research on reinforcement learning and post-training of language models, with a focus on code generation and reinforcement learning from human feedback.
Aug 2020 - Jun 2022
IBM Research
Research on meta-reinforcement learning and AI for scientific discovery.
Sep 2015 - Apr 2020
University of Michigan
Ann Arbor, MI
Advisors: Prof. Satinder Singh and Prof. Ed Durfee.
Research on value alignment and AI safety in reinforcement learning.
Aug 2015
University of Texas at Austin
Austin, TX
Undergraduate/master research advisors: Prof. Peter Stone and Prof. Dana Ballard.
arXiv, 2024
Conference on Neural Information Processing Systems (NeurIPS), 2023
International Conference on Learning Representations (ICLR), 2023
International Conference on Machine Learning (ICML), 2022
Ph.D. Dissertation, 2020
AAAI Conference on Artificial Intelligence (AAAI), 2020
International Joint Conference on Artificial Intelligence (IJCAI), 2018
International Conference on Automated Planning and Scheduling (ICAPS), 2017