Guanzhi Wang

I am a Research Scientist at NVIDIA. My research interests lie in the area of foundation models, robotics, and embodied agents.

Featured Publications * Equal contribution, † Equal advising
All Publications
NitroGen: A Foundation Model for Generalist Gaming Agents
arXiv preprint, 2026
We present NitroGen, a vision-action foundation model trained on 40,000 hours of gameplay videos across 1,000+ games that demonstrates strong cross-game generalization.
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
arXiv preprint, 2025
We present GR00T N1, an open Vision-Language-Action foundation model for generalist humanoid robots.
Voyager: An Open-Ended Embodied Agent with Large Language Models
Transactions on Machine Learning Research (TMLR), 2024
We introduce Voyager, the first LLM-powered embodied lifelong learning agent in Minecraft that continuously explores the world, acquires diverse skills, and makes novel discoveries without human intervention.
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2022
Outstanding Paper Award
We introduce MineDojo, a new framework based on the popular Minecraft game for building generally capable, open-ended embodied agents.
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
Conference on Robot Learning (CoRL), 2025
We introduce DreamGen, a 4-stage pipeline for training robot policies that generalize across behaviors and environments through neural trajectories.
iGibson
iGibson 1.0: a Simulation Environment for Interactive Tasks in Large Realistic Scenes
International Conference on Intelligent Robots and Systems (IROS), 2021
We present iGibson, a novel simulation environment for developing interactive robotic agents in large-scale realistic scenes.
Teaching

Caltech CS148: Large Language and Vision Models (Spring 2024)

Teaching Assistant

Caltech CS101: 3D Deep Learning (Winter 2024)

Teaching Assistant

Stanford CS231n: ConvNet for Visual Recognition (Spring 2021)

Teaching Assistant

Academic Services

Conference Reviewer: CVPR 2025, ICML 2024, ICLR 2024, NeurIPS 2023, NeurIPS 2022, ICLR 2022, ICCV 2021, CVPR 2021, ECCV 2020