Gradient Update #18: DeepMind's AlphaCode is…

In which we cover DeepMind's new paper on AlphaCode, a paper showing the Bellman error is not a good surrogate for value error, and more!

Read →