1/20/2022 – Interdisciplinary Science Seminar
Title: Markov chains, optimal control, and reinforcement learning Abstract: Markov decision processes are a model for several artificial intelligence problems, such as games (chess, Go…) or robotics. At each timestep, an agent has to choose an action, then receives a reward, and then the agent’s environment changes (deterministically or stochastically) in response to the agent’s action. […]