A Gymnasium environment for the game Balatro for use with reinforcement learning (RL) libraries such as TorchRL and Stable Baselines3. Also includes RL training and evaluation scripts.
In these videos, the policy is performing rollouts of game hands to learn from. All gameplay is completely automated on a live game client using the balatrobot API and requires no human input. Full training speed (512x) to accelerate rollouts is accomplished using the HandyBalatro mod.
MedalTVBalatro20251201100130899.mp4
MedalTVBalatro20251201101800555.mp4
Coming soon
Coming soon
Coming soon
- BalatroBot: https://coder.github.io/balatrobot/installation/
- Improve observation space definition to account for complex hand space embedding.
- Add engineered features to observation space such as
has_flush. - Add basic features to observation space such as
hands_left. - Refactor network architecture to include multi-head output for shop decisions.
- Improve README Evaluation, Installation, Usage, and Requirements sections.
The project has an associated feature board that you can view here.