Artificial Twin | DIAMBRA AI Tournament Platform

A Competitive AI Platform for Deep Reinforcement Learning

Artificial Twin partnered with DIAMBRA to architect and develop an advanced platform for training, deploying, and evaluating intelligent agents through Deep Reinforcement Learning (DeepRL) in arcade-style competitive simulations. The collaboration focused on delivering a robust, modular system capable of supporting large-scale experimentation and community participation.

DIAMBRA offered a novel combination of research-grade training environments and consumer-facing esports-style engagement, creating a hybrid product with applications across AI research, education, and competitive development. By bridging the gap between cutting-edge reinforcement learning techniques and accessible game-like interfaces, it opened the doors to a broader ecosystem of contributors and learners.

The result was a mature product ecosystem with real-time tournaments, leaderboards, and a community-driven approach to reinforcement learning experimentation. In December 2024, the project was successfully acquired, validating the commercial viability of deeply technical, open-source-led innovation and its ability to attract both talent and strategic interest.

Trained Agent on Street Figher III

Project Objectives

The client’s core goal was to build a public-facing AI arena that allowed researchers and independent developers to train agents in structured environments, benchmark their performance in real-time competitions, and engage with a gamified leaderboard and achievement system. The platform also provided standardized tools to support experimentation across single-agent, multi-agent, imitation, and offline learning setups, lowering the barrier to entry for those exploring reinforcement learning in competitive contexts.

This vision required a seamless blend of research infrastructure and real-time interactive systems. Artificial Twin was brought on board as the end-to-end AI technology provider, responsible for the architecture, implementation, and deployment of the full system stack. This included environment and simulator design, multi-agent orchestration and evaluation logic, training pipelines and starter kits, CI/CD infrastructure for user-submitted agents, as well as tournament hosting and live streaming integrations. The result was a tightly integrated ecosystem that supported both academic exploration and community-driven innovation.

Arcade-style RL Environments

Artificial Twin engineered a set of high-quality arcade game environments, designed for both performance and research flexibility:

Built in C++ for real-time responsiveness, with gRPC-based bindings for Python integration
Gymnasium API-compatible, allowing drop-in usage with major RL libraries like Stable Baselines 3 and Ray RLlib
Multi-modal observation support (raw pixels, structured state data)
Fully Dockerized for consistent deployment across Linux, Windows, and macOS
Designed for diverse learning paradigms:
- Single-agent (AI vs COM)
- Multi-agent adversarial/self-play (AI vs AI)
- Human-agent interaction
- Imitation learning

These environments became the foundation of DIAMBRA Arena, providing plug-and-play capabilities for advanced RL experimentation.

Agent Training & League Support

Artificial Twin developed a complete training stack under the DIAMBRA Agents repository, offering modular and reproducible training pipelines, primarily based on PPO, alongside customizable reward shaping and network architectures. The framework was lightweight enough to run efficiently on consumer hardware, while remaining powerful and extensible for complex experiments.

It supported advanced techniques such as league training, enabling scalable self-play dynamics in population-based reinforcement learning setups. Extensive logging and evaluation tools were also integrated to ensure reliable benchmarking across runs.

The stack was designed with accessibility in mind, allowing both seasoned researchers and independent developers to get started quickly without sacrificing flexibility. As a result, users were able to prototype, test, and iterate on new reinforcement learning agents with minimal overhead, dramatically accelerating the experimentation and deployment cycle.

Evaluation & Tournament Infrastructure

A distinguishing feature of the DIAMBRA platform was its competitive AI evaluation system, fully designed and built by Artificial Twin:

Automated orchestration of AI-vs-AI and AI-vs-COM match rollouts
Centralized leaderboard computation across users, games, and metrics
Support for unlockable achievements and skill progression
Full episode logging with automatic Twitch and YouTube live-streaming, enhancing user engagement and visibility
Configurable tournament formats, from round-robin to elimination brackets
Scalable backend to handle peak-time matches and live broadcasts
Replay and match analysis tools to support post-game breakdowns and learning

The infrastructure enabled transparent, repeatable benchmarking in a format that echoed esports and coding competitions, making DIAMBRA one of the few RL platforms that emphasized *public performance as a first-class feature*. It helped create an engaging feedback loop where users could track their agents’ progress and visibility in real time.

Strategic Value & Outcomes

The platform served as a strategic differentiator in the reinforcement learning space by bridging the gap between academic research tools and developer-friendly gamified platforms. It significantly lowered the barrier to entry for complex RL experimentation and introduced a unique model for community-led agent competitions similar in spirit to Kaggle but focused on interactive intelligence. This approach created a dynamic ecosystem where both hobbyists and professionals could contribute, compete, and innovate on a level playing field.

The project culminated in a successful acquisition in December 2024 highlighting its commercial readiness, strong open-source adoption and a viable business model centered on competitive RL agent development. The platform demonstrated how high-performance, interactive AI systems could generate sustainable interest and value across research, education, and entertainment domains.

Artificial Twin’s technical leadership was central to DIAMBRA’s journey from concept to acquisition, delivering a production-grade system that blended AI innovation with product-market alignment and community-centric design.

Selected References

To explore DIAMBRA in more depth, here are the official resources developed and maintained throughout the project. They include the core codebase, documentation, and the research paper that guided the technical foundations.

Paper on DIAMBRA Platform (arXiv)
Github Repositories:

Documentation

League Training — TLeague customization for DIAMBRA Arena. Agents evolve through competitive matchups in a multi-agent reinforcement learning loop driven by TLeague’s population-based self-play and DIAMBRA’s fighting game environments.

Some additional videos

Trained Agent on Dead Or Alive ++

Trained Agent on Tekken Tag Tournament

Reward Engineering

Reward Hacking