Original Research
Self-play with PPO, search, and board-state encoders
The original project focused on training and evaluating Blokus Duo agents with policy and value modeling, then comparing how search changed the quality of move selection.
Blokus Duo ML Agents
The original Blokus Duo work was a research-heavy self-play project. This route turns that work into something visitors can actually use: a browser-friendly board, several agent personalities, and a ruleset small enough to finish in a few minutes without losing the diagonal-placement strategy that made the project interesting.
Original Research
The original project focused on training and evaluating Blokus Duo agents with policy and value modeling, then comparing how search changed the quality of move selection.
Website Adaptation
The web demo distills those ideas into a reduced 9x9 ruleset with static policy tables. That makes the page fast, deployable on GitHub Pages, and still useful as a tactical showcase.
Browser Demo
Each profile uses a different precomputed move-ranking table. They are not live neural nets in the browser, but they let the site demonstrate how different policy preferences change openings, space control, and endgame shape.
Playable Web Adaptation
This browser version keeps the diagonal-only expansion rule from Blokus while shrinking the board to 9x9 and limiting each side to six pieces. The agent profiles are static move-ranking tables designed to mimic different self-play personalities without shipping full model inference into the site.
Match Setup
Prefers long diagonals that race toward the opponent corner and force awkward replies.
Turn 1
Blue opens from the top-left corner. Pick a piece, rotate it, and place on a highlighted anchor.
Previewing Seed from A1. Click a dotted anchor to place it.