Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in RL
- Sheila Mcllraith | University of Toronto
- Reinforcement Learning Day 2019
Reinforcement Learning Day 2019:
Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in Reinforcement Learning
Watch Next
-
-
Microsoft Transforms its Cloud Supply Chain with Optimization and Generative AI
- Peter Lee,
- Konstantina Mellou,
- Kayla Kummerlowe
-
-
Dion2: A new simple method to shrink matrix in Muon
- Anson Ho,
- Kwangjun Ahn
-
-
-
-
-
-