GameNGen: Diffusion World Model for Game Simulation

Published: January 01, 2026

Trained a world model to simulate Super Mario Bros gameplay end-to-end using latent diffusion. The system learns a compressed representation of game state via a VAE, generates plausible next frames through a diffusion model, and trains PPO agents entirely within the simulated environment — no real game engine needed at inference time.

Key components: VAE encoder/decoder for latent state compression · Latent diffusion model for next-frame generation · CNN-based reward model estimating reward signal from visual observations · PPO agents trained within the world model loop · Adversarial initialization for diverse rollout coverage

Stack: Python, PyTorch, diffusion models, VAE, PPO, computer vision

GitHub →

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)

Aravind Kannappan

Share on