2023
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences
Siddhant Agarwal, Ishan Durugkar, Peter Stone, Amy Zhang
arxiv /
project page
Imitation from Arbitrary Experience: A Dual Unification of Reinforcement and Imitation Learning Methods
Harshit Sikchi, Amy Zhang, Scott Neikum
arxiv /
project page