Fast experiment on T4 GPU. Self play training on Dark Hex (Colab notebook) [P]

Reddit r/MachineLearning / 4/28/2026

💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical Usage

Key Points

  • The article shares an informal experiment using an NVIDIA T4 GPU to run self-play training for the Dark Hex game/agent.
  • It presents visualizations comparing two training iterations (1800 vs 1900) where the agent plays against itself.
  • The author provides a Google Colab notebook so readers can reproduce and run the experiment on their own.
  • The focus is on quickly iterating and observing agent learning dynamics rather than announcing a new model or benchmark.
  • The post is distributed via Reddit’s Machine Learning community and is framed as a “fun” experiment and demo.

Last week I run a fun experiment on Dark Hex. Here's a visualization of two iterations (1800 vs 1900) of agent playing agains each other 😃

Here's my colab notebook if you like to run it yourself
https://colab.research.google.com/drive/1-rm_Bh8CNaM861We97ZoicfgKxz0xOSi?usp=sharing

submitted by /u/asmonix
[link] [comments]