DeepMind recently introduced a new meta-learning approach that generates a reinforcement learning algorithm known as Learned Policy Gradient (LPG). According to the researchers, automating the discovery of update rules from data could lead to more efficient algorithms that could also be better adapted to specific environments.    That one technique of machine learning which can be…

The post Behind DeepMind’s Framework That Discovers New Reinforcement Learning Algorithms appeared first on Analytics India Magazine.