Skip to main content

One doc tagged with "policy"

View all tags

Investigating a Local Minimum

I've been training an AlphaZero-style agent to learn, on its own, how to play a form of turn-based TETR.IO, a kind of competitive