Presenter (15mins)
Himangi Mittal <hmittal@andrew.cmu.edu>
Learning a Unified Policy for Whole-Body Control of�Manipulation and Locomotion
Zipeng Fu*, Xuxin Cheng*, Deepak Pathak�CoRL 2022 (Oral)
Problem Statement/Motivation
Legged-only robots have achieved impressive performance in the last decade in challenging outdoor and indoor terrain.
Kumar, Ashish, et al. "Rma: Rapid motor adaptation for legged robots." arXiv preprint arXiv:2107.04034 (2021).
Problem Statement/Motivation
However, legged-only robots have strong limitations in what they can achieve.
Pick and place objects
Pressing button
Everyday tasks require some form of manipulation.
Attaching an arm can significantly increase the ability of the legged robots to several mobile manipulation tasks
Attaching an arm can significantly increase the ability of the legged robots to several mobile manipulation tasks
Not an easy task!
Challenges (1)
High-DoF control
Challenges (2)
Conflicting policies
Challenges (3)
Dependency
Challenges (4)
Cost ($$$$$) and Hardware
Spot Arm from Boston Dynamics
(has pre-designed controllers, cannot be changed)
ANYmal robot with a custom arm (ANYBotics)
Challenges (4)
Cost ($$$$$) and Hardware
Spot Arm from Boston Dynamics
(has pre-designed controllers, cannot be changed)
ANYmal robot with a custom arm (ANYBotics)
Expensive ($100K)!!
Related Work (1)
Go Fetch! - Dynamic Grasps using Boston Dynamics Spot with External�Robotic Arm (Simon Zimmermann, Roi Poranne, Stelian Coros)
Related Work (2)
Combining Learning-based Locomotion Policy with Model-based�Manipulation for Legged Mobile Manipulators (Yuntao Ma, Farbod Farshidian, Takahiro Miki, Joonho Lee, Marco Hutter)
Related Work (2)
ALMA - Articulated Locomotion and Manipulation for a Torque-Controllable Robot (C. Dario Bellicoso, Koen Kr ̈amer, Markus St ̈auble, Dhionis Sako, Fabian Jenelten, Marko Bjelonic, Marco Hutter)
Related Work (3)
RMA: Rapid Motor Adaptation for Legged Robots
(Ashish Kumar, Zipeng Fu, Deepak Pathak, Jitendra Malik)
Has
Has a similar online real-time adaptation module which works on a diverse set of environment configurations.
Contributions and Proposed Solution
Contributions and Proposed Solution
Input
Contributions and Proposed Solution
Input
Contributions and Proposed Solution
Input
Output
Contributions and Proposed Solution
Contributions and Proposed Solution
Hardware setup
The robot platform is comprised of a Unitree Go1 quadraped with 12 actuatable DoFs, and a robot arm which is the 6-DoF Interbotix WidowX 250s with a parallel gripper. We mount the arm on top of the quadruped. The RealSense D435 provides RGB visual information and is mounted close to the gripper of WidowX. Both power of Go1 and WidowX are provided by Go1’s onboard battery. Neural network inference is also done onboard of Go1. Our robot system uses only onboard computation and power so it is fully untethered.
Advantage Mixing for locomotion and manipulation
: policy
: target leg joint position
: state
:Advantage function of locomotion
:Advantage function of manipulation
: target arm joint position
: beta curriculum parameter linearly increasing from 0 to 1
Regularized Online Adaptation for Sim-to-Real Transfer
: parameters of the policy
: parameters of the encoder
: parameters of the adaptation module
: Encoder
: stop-gradient
: adaptation module
Evaluation Questions
Baselines and Metrics
Baselines
Metrics
Dataset (Simulation Environment)
Results (1) (Simulation Environment)
Results (2) (Simulation Environment)
Results (3) (Simulation Environment)
Results (4) (Simulation Environment)
Results (1) (Real-World Environment)
Dataset (Real-World Environment)
Results (2) (Real-World Environment)
Results (2) (Real-World Environment)
Results (3) (Real-World Environment)
Results (Videos)
Strengths
Strength 1
Both the real-world and simulation experiments demonstrate that the proposed learning-based method is able to perform well on complex tasks that require coordination of arms and legs.
Strength 2
Technical strengths
Strength 3
Proposes a low-cost hardware setup for academic research labs. Reduces the cost from $100K to $6K.
Weaknesses
Himangi Mittal <hmittal@andrew.cmu.edu>
Weakness 1
Generalization of the method to other type of object interactions :
Weakness 2
High input dimension : Although it does not impact training time, but training is prone to error if multiple dimensions have external noise.
Weakness 3
Adaptation of online module to changing dynamics when a few degree of freedom is lost
TL;DR/Summary: Key Insights
QnA (1mins)
5 Discussion Points – send to TA, don’t include in slides
Sim2Real Table Tennis
Unified Whole-Body Control of Manipulation and Locomotion
4. If a legged robot has two arms : a). What additional challenges would need to be addressed? b). Are there any added benefits of adding extra arms? If yes, what can the benefits be?
5. What are the different scenarios which can be tested to analyze the robustness of the robot (for example, slippery/snowy/rough terrains, how much weight can the arm hold and the robot can still maintain its pose and move)?