Share 'Some Reinforcement Learning: The Greedy and Explore-Exploit Algorithms for the Multi-Armed Bandit Framework in Python'
In this article the multi-armed bandit framework problem and a few algorithms to solve the problem is going to be discussed. This problem appeared as a lab assignment in the edX course DAT257x: Reinforcement Learning Explained by Microsoft. The problem description is taken from the assignment itself.
The Problem Statement and Some Theory
Given a set of actions with some unknown reward distributi…
You can share this blog post in two ways…
Share this link:
Send it with your computer's email program: Email this