In Reinforcement learning, the generalization of the agents is benchmarked on the environments they have been trained on. In a supervised learning setting, this would mean testing the model using the training dataset. OpenAI has open-sourced Procgen-benchmark emphasizing the generalization for RL agents as they struggle to generalize in new environments. Procgen consists of 16…

The post Generalization in Reinforcement Learning – Exploration vs Exploitation appeared first on Analytics India Magazine.