Category: Cool Stuff
-
Unit-Vector Gradients: The Geometry of a Self-Stabilizing Universe
Over the last several months I’ve been searching for the ultimate learning algorithm that could reliably learn strategic plays in adversarial, sparse reward settings like foosball, air hockey and table tennis as part of an ongoing development of Infinite MR Arcade. One of my many Frankenstein architectures is a “tabular” Q learning with neural encoded…
