Project Icon

Super-mario-bros-PPO-pytorch

Enhance Gaming AI using Proximal Policy Optimization with Python

Product DescriptionThe project applies the Proximal Policy Optimization (PPO) algorithm to train an AI agent to play Super Mario Bros, completing 31 out of 32 levels. Building on the A3C method, this shows marked performance improvements. It allows training and testing of models with customizable learning rates for optimal results. A Dockerfile facilitates a seamless setup for training and testing, although there may be rendering issues. This framework is ideal for those exploring AI-centered game development and performance optimization.
Project Details