In which we cover DeepMind's new paper on AlphaCode, a paper showing the Bellman error is not a good surrogate for value error, and more!
Gradient Update #18: DeepMind's AlphaCode is…
In which we cover DeepMind's new paper on AlphaCode, a paper showing the Bellman error is not a good surrogate for value error, and more!