Abhishek Gupta

I am an assistant professor in computer science and engineering at the Paul G. Allen School at the University of Washington. I lead the Washington Embodied Intelligence and Robotics Development (WEIRD) lab.

Previously, I was a post-doctoral scholar at MIT, collaborating with Russ Tedrake and Pulkit Agarwal.

I spent 6 wonderful years completing my PhD in machine learning and robotics at BAIR at UC Berkeley, where I was advised by Professor Sergey Levine and Professor Pieter Abbeel. Previously, I completed my bachelors degree also at UC Berkeley.

My main research goal is to develop algorithms which enable robotic systems to learn how to perform complex tasks in a variety of unstructured environments like offices and homes. To that end, I work towards building deep reinforcement learning algorithms that can learn in the real world, with and around humans. Recently our work has focused on deployment time reinforcement learning, learning on deployment directly in human-centric environments under the following themes

  • Human-in-the-loop RL for targeted exploration
  • Constructing and leveraging simulation as a tool to bootstrap safe RL
  • Robustness, generalization and fast adaptation for robotic control policies
More generally, I have been interested in the problems of human in the loop reinforcement learning, reward specification, continual real world data collection and learning, fast adaptation with meta learning for robotics, offline reinforcement learning for robotics, multi-task and meta-learning and dexterous manipulation with robotic hands and studying generalization and extrapolation for policies and models. I am also excited about a broader space of problems including algorithms for assistive robotics, safe exploration, robustness and compositionality in deep learning, and all things embodied intelligence.

For prospective students: I am looking for highly motivated Ph.D students and postdoctoral researchers to join our group. For Ph.D. students, I highly encourage you to apply to the UW CSE Ph.D program through the Allen school, and list me as an advisor of interest. I am very open to coadvising requests as well, please mention this in your application. I ask that you do not email me directly with regard to PhD admissions until after you are admitted, as I will not be able to reply to emails from individual applicants. Rest assured I will give your application a read! For postdoctoral scholar applications, please send me an email with your CV and a statement of your interests.

Email  /  CV  /  GitHub  /  Google Scholar  /  Ph.D. Thesis

Workshop Papers, Submissions and Pre-prints

Ecological Reinforcement Learning
John D Co-Reyes, Suvansh Sanjeev, Glen Berseth, Abhishek Gupta, Sergey Levine
arXiv Preprint
paper

Unsupervised meta-learning for reinforcement learning
Abhishek Gupta*, Benjamin Eysenbach*, Chelsea Finn, Sergey Levine
arXiv preprint, best paper at LLARLA workshop at ICML 2018
paper / blog

Accelerating online reinforcement learning with offline datasets
Ashvin Nair*, Abhishek Gupta*, Murtaza Dalal, Sergey Levine
arXiv preprint
paper

Learning latent state representation for speeding up exploration
Giulia Vezzani, Abhishek Gupta, Lorenzo Natale, Pieter Abbeel
arXiv preprint
paper



Publications

2024

CCIL: Continuity-Based Data Augmentation for Corrective Imitation Learning
Liyiming Ke, Yunchu Zhang, Abhay Deshpande, Siddhartha Srinivasa, Abhishek Gupta
ICLR 2024
paper

Modeling Boundedly Rational Agents with Latent Inference Budgets
Athul Paul Jacob, Abhishek Gupta, Jacob Andreas
ICLR 2024
paper

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du
ICLR 2024
paper

ASID: Active Exploration for System Identification and Reconstruction in Robotic Manipulation
Marius Memmel, Andrew Wagenmaker, Chuning Zhu, Dieter Fox, Abhishek Gupta
ICLR 2024

Learning to Grasp in Clutter with Interactive Visual Failure Prediction
Michael Murray, Abhishek Gupta, Maya Cakmak
ICRA 2024

Lifelong Robot Learning with Human Assisted Language Planners
Zichen Zhang, Yunshuang Li, Osbert Bastani, Abhishek Gupta, Dinesh Jayaraman, Yecheng Jason Ma, Luca Weihs
ICRA 2024
paper

Rank2Reward: Learning Shaped Reward Functions from Passive Video
Daniel Yang, Davin Tjia, Jacob Berg, Dima Damen, Pulkit Agrawal, Abhishek Gupta
ICRA 2024

Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Zichen Zhang, Yunshuang Li, Osbert Bastani, Abhishek Gupta, Dinesh Jayaraman, Yecheng Jason Ma, Luca Weihs
ICRA 2024
paper

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo, Zheyuan Hu, Charles Xu, You Liang Tan, Jacob Berg, Archit Sharma, Stefan Schaal, Chelsea Finn, Abhishek Gupta, Sergey Levine
ICRA 2024
paper

2023

Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback
Max Balsells I Pamies, Marcel Torne Villasevil, Zihan Wang, Samedh Desai, Pulkit Agrawal, Abhishek Gupta
CoRL 2023
paper

Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
HJ Suh, Glen Chou, Hongkai Dai, Lujie Yang, Abhishek Gupta, Russ Tedrake
CoRL 2023
paper

REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation
Zheyuan Hu, Aaron Rovinsky, Jianlan Luo, Vikash Kumar, Abhishek Gupta, Sergey Levine
CoRL 2023
paper

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong, Aviral Kumar, Sathwik Karnik, Abhishek Bhandwaldar, Akash Srivastava, Joni Pajarinen, Romain Laroche, Abhishek Gupta, Pulkit Agrawal
NeurIPS 2023
paper

Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
Marcel Torne, Max Balsells, Zihan Wang, Samedh Desai, Tao Chen, Pulkit Agrawal, Abhishek Gupta
NeurIPS 2023
paper

RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu, Max Simchowitz, Siri Gadipudi, Abhishek Gupta
NeurIPS 2023 (Spotlight)
paper

Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen, Chuning Zhu, Pulkit Agrawal, Kaiqing Zhang, Abhishek Gupta
NeurIPS 2023
paper

Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective
Max Simchowitz, Abhishek Gupta, Kaiqing Zhang
COLT 2023
paper

Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas
ICML 2023
paper

GenAug: Retargeting behaviors to unseen situations via Generative Augmentation
Zoey Chen, Sho Kiami, Abhishek Gupta*, Vikash Kumar*
RSS 2023 (Best Systems Paper Finalist)
paper

Cherry-picking with reinforcement learning
Yunchu Zhang, Liyiming Ke, Abhay Deshpande, Abhishek Gupta, Siddhartha Srinivasa
RSS 2023
paper

Learning to Extrapolate: A Transductive Approach
Aviv Netanyahu*, Abhishek Gupta*, Max Simchowitz, Kaiqing Zhang, Pulkit Agrawal
ICLR 2023
paper

TactoFind: A Tactile Only System for Object Retrieval
Sameer Pai, Tao Chen, Megha Tippur, Edward Adelson, Abhishek Gupta*, Pulkit Agrawal*
ICRA 2023
paper

Demonstration-Bootstrapped Autonomous Practicing via Multi-Task Reinforcement Learning
Abhishek Gupta, Corey Lynch, Brandon Kinman, Garrett Peake, Sergey Levine, Karol Hausman
ICRA 2023
paper

Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu*, Zheyuan Hu*, Ria Doshi, Aaron Rovinsky, Vikash Kumar, Abhishek Gupta, Sergey Levine
ICRA 2023
paper

2022

Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation
iuyu Chen, Karl Van Wyk, Yu-Wei Chao, Wei Yang, Arsalan Mousavian, Abhishek Gupta, Dieter Fox
CoRL 2022
paper

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Abhishek Gupta*, Aldo Pacchiano*, Simon Zhai, Sham Kakade, Sergey Levine
NeurIPS 2022
paper

Distributionally Adaptive Meta Reinforcement Learning
Anurag Ajay*, Abhishek Gupta*, Dibya Ghosh, Sergey Levine, Pulkit Agrawal
NeurIPS 2022
paper

Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma, Kelvin Xu, Nikhil Sardana, Abhishek Gupta, Karol Hausman, Sergey Levine, Chelsea Finn
ICLR 2022
paper

2021

Teachable Reinforcement Learning via Advice Distillation
Olivia Watkins, Trevor Darrell, Pieter Abbeel, Jacob Andreas, Abhishek Gupta
NeurIPS 2021
paper

Persistent Reinforcement Learning via Subgoal Curricula
Archit Sharma, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn
NeurIPS 2021
paper

Adaptive risk minimization: A meta-learning approach for tackling group shift
Marvin Zhang, Henrik Marklund, Nikita Dhawan, Abhishek Gupta, Sergey Levine, Chelsea Finn
NeurIPS 2021
paper / blog

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?
Kate Rakelly, Abhishek Gupta, Carlos Florensa, Sergey Levine
NeurIPS 2021
paper

Fully Autonomous Real-World Reinforcement Learning for Mobile Manipulation
Charles Sun, Jedrzej Orbik, Coline Devin, Brian Yang, Abhishek Gupta, Glen Berseth, Sergey Levine
CoRL 2021
paper

Learning to reach goals via iterated supervised learning
Dibya Ghosh*, Abhishek Gupta*, Ashwin Reddy, Justin Fu, Coline Devin, Benjamin Eysenbach, Sergey Levine
ICLR 2021 (Oral)
paper / blog

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning
Kevin Li*, Abhishek Gupta*, Ashwin D Reddy, Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine
ICML 2021
paper / website

Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention
Abhishek Gupta*, Justin Yu*, Tony Z. Zhao*, Vikash Kumar*, Aaron Rovinsky, Kelvin Xu, Thomas Devlin, Sergey Levine
ICRA 2021
paper / website

2020

The ingredients of real-world robotic reinforcement learning
Henry Zhu*, Justin Yu*, Abhishek Gupta*, Dhruv Shah, Kristian Hartikainen, Avi Singh, Vikash Kumar, Sergey Levine
ICLR 2020 (spotlight)
paper / blog

Discor: Corrective feedback in reinforcement learning via distribution correction
Aviral Kumar, Abhishek Gupta, Sergey Levine
NeurIPS 2020 (spotlight)
paper / blo

Gradient surgery for multi-task learning
Tianhe Yu, Saurabh Kumar, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn
NeurIPS 2020
paper

2019

Unsupervised curricula for visual meta-reinforcement learning
Allan Jabri, Kyle Hsu, Benjamin Eysenbach, Abhishek Gupta, Alexei Efros, Sergey Levine, Chelsea Finn
NeurIPS 2019 (spotlight)
paper

ROBEL: RObotics BEnchmarks for Learning with low-cost robots
Michael Ahn, Henry Zhu, Kristian Hartikainen, Hugo Ponte, Abhishek Gupta, Sergey Levine, Vikash Kumar
CoRL 2019
paper / blog

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning
Abhishek Gupta, Vikash Kumar, Corey Lynch, Sergey Levine, Karol Hausman
CORL 2019
paper / website

Guided meta-policy search
Russell Mendonca, Abhishek Gupta, Rosen Kralev, Pieter Abbeel, Sergey Levine, Chelsea Finn
NeurIPS 2019 (spotlight)
paper

Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost
Henry Zhu*, Abhishek Gupta*, Aravind Rajeswaran, Sergey Levine, Vikash Kumar
ICRA 2019
paper / blog

Diversity is all you need: Learning skills without a reward function
Benjamin Eysenbach, Abhishek Gupta, Julian Ibarz, Sergey Levine
ICLR 2019
paper / video

Guiding policies with language via meta-learning
John D Co-Reyes, Abhishek Gupta, Suvansh Sanjeev, Nick Altieri, John DeNero, Pieter Abbeel, Sergey Levine
ICLR 2019
paper

Learning actionable representations with goal-conditioned policies
Dibya Ghosh, Abhishek Gupta, Sergey Levine
ICLR 2019
paper

Automatically composing representation transformations as a means for generalization
Michael B. Chang, Abhishek Gupta, Sergey Levine, Thomas Griffith
ICLR 2019
paper

2018

Self-consistent trajectory autoencoder: Hierarchical reinforcement learning with trajectory embeddings
John D Co-Reyes*, YuXuan Liu*, Abhishek Gupta*, Benjamin Eysenbach, Pieter Abbeel, Sergey Levine
ICML 2018
paper

Imitation from observation: Learning to imitate behaviors from raw video via context translation
YuXuan Liu*, Abhishek Gupta*, Pieter Abbeel, Sergey Levine
ICRA 2018
paper / video

Meta-reinforcement learning of structured exploration strategies
Abhishek Gupta, Russell Mendonca, YuXuan Liu, Pieter Abbeel, Sergey Levine
NeurIPS 2018 (spotlight)
paper / code

Learning complex dexterous manipulation with deep reinforcement learning and demonstrations
Aravind Rajeswaran*, Vikash Kumar*, Abhishek Gupta, Giulia Vezzanni, John Schulman, Emanuel Todorov, Sergey Levine
RSS 2018
paper / video

2017

Learning modular neural network policies for multi-task and multi-robot transfer
Abhishek Gupta*, Coline Devin*, Trevor Darrell, Pieter Abbeel, Sergey Levine
ICRA 2017
paper / video

Learning invariant feature spaces to transfer skills with reinforcement learning
Abhishek Gupta*, Coline Devin*, Yuxuan Liu, Pieter Abbeel, Sergey Levine
ICLR 2017
paper / video

2016

Learning dexterous manipulation for a soft robotic hand from human demonstrations
Abhishek Gupta, Clemens Eppner, Sergey Levine, Pieter Abbeel
IROS 2016
paper / video

Guided search for task and motion plans using learned heuristics
Rohan Chitnis, Dylan Hadfield-Menell, Abhishek Gupta, Siddhart Srivastava, Edward Groshev, Christopher Lin, Pieter Abbeel
ICRA 2016
paper / video

2015

Learning from multiple demonstrations using trajectory-aware non-rigid registration with applications to deformable object manipulation
Alex Lee, Abhishek Gupta, Henry Lu, Sergey Levine, Pieter Abbeel
IROS 2015
paper

Learning force-based manipulation of deformable objects from multiple demonstrations
Alex X. Lee, Henry Lu, Abhishek Gupta, Sergey Levine, Pieter Abbeel
ICRA 2015
paper

Tractability of planning with loops
Siddharth Srivastava, Shlomo Zilberstein, Abhishek Gupta, Pieter Abbeel, Stuart Russell
AAAI 2015
paper / video


Website template from Jon Barron.
Last updated January 2021.