FacebookFacebook group TwitterTwitter
ICVSS From Perception to Action

Robot Learning In The Wild: Continual Improvement by Watching and Practicing

Deepak Pathak

Carnegie Mellon University, USA

Abstract

How can we train a robot that can generalize to perform thousands of tasks in thousands of environments? This poses a chicken and egg problem: to train robots for generalization, we need large amounts of robotic data from diverse environments, but it is impractical to collect such data unless we can deploy robots that generalize. Passive human videos on the internet can help alleviate this issue by providing diverse scenarios to pretrain robotic skills. However, just watching humans is not enough, the robot needs to learn and improve by autonomously practicing in the real world and adapting its learning to new scenarios. We will unify these three mechanisms -- learning by watching others (social learning), practicing by exploration (curiosity), and adapting already learned skills in real-time (adaptation) -- to define a continually adaptive robotic framework. I will demonstrate the potential of this framework for scaling up robot learning via case studies of controlling dextrous robotic hands from monocular vision, dynamic-legged robots walking from vision on unseen challenging hikes, and mobile manipulators performing lots of diverse manipulation tasks in the wild.

Bio

Deepak Pathak is a faculty in the School of Computer Science at Carnegie Mellon University. He received his Ph.D. from UC Berkeley and his research spans computer vision, machine learning, and robotics. He is a recipient of Okawa research award, IIT Kanpur Young Alumnus award, CoRL22 Paper Award and faculty awards from Google, Samsung, Sony and GoodAI. Deepak's research has been featured in popular press outlets, including The Economist, The Wall Street Journal, Forbes, Quanta Magazine, Washington Post, CNET, Wired, and MIT Technology Review among others. Earlier, he received his Bachelor's from IIT Kanpur with a Gold Medal in Computer Science. He co-founded VisageMap Inc. later acquired by FaceFirst Inc.