ViZDoom - Enhance AI with Visual Reinforcement Learning in Doom

Understanding ViZDoom

Introduction

ViZDoom is an innovative platform that allows developers to create AI bots that play the classic game Doom using only visual information—essentially seeing what appears on the screen. This project is mainly focused on advancing research in machine visual learning, particularly in the field of deep reinforcement learning. ViZDoom serves as an excellent tool for studying how AI agents can interact with their environment based solely on visual cues.

Technology and Features

ViZDoom operates using the ZDoom engine, which ensures the smooth running of Doom’s game mechanics. It is designed to support multiple platforms, including Linux, macOS, and Windows, which makes it versatile and accessible to a wide range of users. Here are some notable features:

Programming Support: It offers APIs for both Python and C++, making it accessible to developers familiar with these languages.
Integration with Gymnasium/Gym: ViZDoom provides environment wrappers compatible with Gymnasium, facilitating easy integration with reinforcement learning frameworks.
Create Custom Scenarios: Users have access to visual editors, a scripting language, and various examples, allowing them to create and customize their own game scenarios.
Multiplayer Modes: It supports both asynchronous and synchronous single-player and multiplayer modes.
Performance and Customization: ViZDoom can run at up to 7000 frames per second in synchronous mode, with custom resolutions and rendering settings available.
Unique Features: The platform grants access to depth and audio buffers, automatic labeling of game objects, and the option to record episodes for later analysis.

Application and Use Cases

ViZDoom is highly regarded in the AI research community for its applications in reinforcement learning. It allows researchers to explore different learning techniques, such as apprenticeship learning and learning from demonstrations. Although the primary focus is on Python and C++, Julia, Lua, and Java bindings are also available in other branches, albeit not actively maintained.

Getting Started with ViZDoom

Installation by Platform

Linux

To install ViZDoom on Linux, users simply run:

pip install vizdoom

It supports both x86-64 and AArch64 architectures, with wheels available for Python 3.8+.

macOS

For macOS, installation is just as straightforward:

pip install vizdoom

Support is extended to both Intel and Apple Silicon CPUs.

Windows

Windows installations are also supported:

pip install vizdoom

Currently, only the x86-64 architecture is supported.

Customizing the Experience

While ViZDoom cannot redistribute original Doom graphics, it allows users who own original Doom or Doom 2 games to use their own graphics. By placing specific files in the working directory, users can enhance their visual experience.

Contributing and Community

ViZDoom encourages contributions from the community. Whether it's bug fixes, new scenarios, or feature ideas, all are welcome. The project is maintained out of passion, and contributors are vital to its ongoing success.

Licensing

The ViZDoom code is under the MIT license, while the ZDoom engine incorporates various third-party codes with differing licenses.

ViZDoom presents a fascinating intersection between classic gaming and cutting-edge AI research, making it an invaluable tool for developers and researchers eager to explore visual reinforcement learning.