REINFORCEMENT LEARNING OF MOTION FROM VIDEOS
Xuefei Li
Advisor: Stephen J. Guy
1
Introduction
It doesn’t have generality;
It requiring considerable effort to construct, the range of motions is limited to the space of all possible reactions
2
Introduction
1. Motion capture(Mocap)
Mocap system uses tracking cameras or non-optical approaches to measure inertia or mechanical motions. It is capable of recreating complex movement and realistic physical interactions in physically accurate manners.
However, mocap requires specific hardware and special software programs to obtain and process the data, expensive to obtain a large dataset.
3
Introduction
2. In-the-wild video
Easy to obtain, abundant and flexible source of motions to learn from.
Low quality, inaccurate, unstable
4
Introduction
5
Related work
6
OpenPose
7
Testing the Crazy Uptown Funk flashmob in Sydney
video sequence with OpenPose
Challenges of 3D pose estimation
Temporal information of motions and body shape would be helpful
8
VideoPose3D
9
HMR: Human Mesh Recovery
10
VIBE: Video Inference for Human Body Pose and Shape Estimation
11
Pose transfer
12
Pose format of COCO Pose format of SMPL Humanoid
Reinforcement Learning (RL)
13
Implementation
14
Pose estimation
15
Motion reconstruction
16
Motion reconstruction
17
Motion reconstruction
18
Motion reconstruction
19
Motion imitation
Transformed into:
root position (3D), root rotation (4D), chest rotation (4D), neck rotation (4D), right hip rotation (4D), right knee rotation (1D), right ankle rotation (4D), right shoulder rotation (4D), right elbow rotation (1D), left hip rotation (4D), left knee rotation (1D), left ankle rotation (4D), left shoulder rotation (4D), left elbow rotation (1D)
Simulation environment: PyBullet
State: relative positions, rotations, linear and angular velocities of each link with respect to the root
Action: target orientations for PD controllers at each joint
Policy determines which actions should be applied at each timestep in order to reproduce the desired motion
20
Motion imitation
21
Motion imitation
22
Test on mocap data after training with 8 workers for 1 day
Test on in-the-wild video after training with 8 workers for 1 day
Motion imitation
23