Archives for sharding


“The new infrastructure reduces the training time from eight hours down to merely one hour compared to a strong baseline.” The current state-of-the-art reinforcement learning techniques require many iterations over many samples from the environment to learn a target task. For instance, the game Dota learns from batches of 2 million frames every 2 seconds.…
The post Google Teases Large Scale Reinforcement Learning Infrastructure appeared first on Analytics India Magazine.

