Machine Intelligence through Decision-making and Interaction

2025
Fast Adaptation with Behavioral Foundation Models
Harshit Sikchi, Andrea Tirinzoni, Ahmed Touati, Yingchen Xu, Anssi Kanervisto, Scott Niekum, Amy Zhang, Alessandro Lazaric, Matteo Pirotta
arxiv /
RL Zero: Zero-Shot Language to Behaviors without any Supervision
Harshit Sikchi, Siddhant Agarwal, Pranaya Jajoo, Samyak Parajuli, Caleb Chuck, Max Rudolph, Peter Stone, Amy Zhang, Scott Niekum
arxiv / project page
Proto Successor Measure: Representing the Behavior Space of an RL Agent
Siddhant Agarwal, Harshit Sikchi, Peter Stone, Amy Zhang
arxiv /
CRESTE: Scalable Mapless Navigation with Internet Scale Priors and Counterfactual Guidance
Arthur Zhang, Harshit Sikchi, Amy Zhang, Joydeep Biswas
RSS 2025 / project page
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu, Shuozhe Li, Harshit Sikchi, Scott Niekum, Amy Zhang
ICLR 2025 / project page
Reevaluating Policy Gradient Methods for Imperfect-Information Games
Max Rudolph, Nathan Lichtle, Sobhan Mohammadpour, Alexndre Bayen, J. Zico Kolter, Amy Zhang, Eugene Vinitsky, Sam Sokota
arxiv / project page
Augmented Conditioning Is Enough For Effective Training Image Generation
Jiahui Chen, Amy Zhang, Adriana Romero-Soriano
arxiv
2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao*, Haoran Xu*, Weinan Zhang, Xianyuan Zhan, Amy Zhang
NeurIPS 2024 / project page
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit Sikchi, Caleb Chuck, Amy Zhang, Scott Niekum
arxiv / project page
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby, Justin Sasek, Samyak Parajuli, Daniel Adebi, Amy Zhang, Yuke Zhu
arxiv / project page
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation
Carl Qi, Dan Haramati, Tal Daniel, Aviv Tamar, Amy Zhang
arxiv / project page
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Zizhao Wang, Jiaheng Hu, Caleb Chuck, Stephen Chen, Roberto Martín-Martín, Amy Zhang, Scott Niekum, Peter Stone
arxiv / project page
Automated Discovery of Functional Actual Causes in Complex Environments
Caleb Chuck, Sankaran Vaidyanathan, Stephen Giguere, Amy Zhang, David Jensen, Scott Niekum
arxiv / project page
Automated Discovery of Functional Actual Causes in Complex Environments
Caleb Chuck, Sankaran Vaidyanathan, Stephen Giguere, Amy Zhang, David Jensen, Scott Niekum
arxiv / project page
Learning Action-based Representations Using Invariance
Max Rudolph, Caleb Chuck, Kevin Black, Misha Lvovsky, Scott Niekum, Amy Zhang
arxiv / project page
Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum
arxiv / project page
2023
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck, Carl Qi, Michael J Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum
arxiv / project page
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences
Siddhant Agarwal, Ishan Durugkar, Peter Stone, Amy Zhang
arxiv / project page
Dual RL: Unification and new methods for reinforcement and imitation learning
Harshit Sikchi, Qinqing Zheng, Amy Zhang, Scott Niekum
arxiv / project page