1 of 101

F1TENTH Autonomous Racing

�Pose Representation and Coordinate Transformation

Safe Autonomy Lab

University of Pennsylvania

Rahul Mangharam

University of Pennsylvania

rahulm@seas.upenn.edu

2 of 101

Acknowledgements

Except where otherwise noted, this work is licensed under

https://creativecommons.org/licenses/by-nc-sa/4.0/

This course is a collaborative development with significant contributions from:

Hongrui Zheng (lead), Matthew O’Kelly (lead), Johannes Betz (lead), Joseph Auckley, Madhur Behl, Luca Carlone, Jack Harkins, Paril Jain, Kuk Jang, Sertac Karaman, Dhruv Karthik, Nischal KN, Thejas Kesari, Matthew Lebermann, Kim Luong, Yash Pant, Varundev Shukla, Nitesh Singh, Siddharth Singh, and many others.

We are grateful for learning from each other

3 of 101

Frames of Reference

4 of 101

What we’ll cover today

Coordinate Frames and Transformations: why?
Reference Frames and Rigid Body Transformations
ROS specific frames and transformations using tf2_ros

https://imgs.xkcd.com/comics/matrix_transform.png

5 of 101

Frames and Transformations on

Autonomous Vehicles

6 of 101

Why do we need Frame Transformations on AV (1)

Reason 1:

Sensors provides measurement in the frame of reference specific to that sensor

www.autonomousvehicletech.com

RGBD

7 of 101

Tesla

Cameras! Lots of cameras!

8 of 101

Waymo

LIDARS! Lots of LIDARS!

9 of 101

Uber

Cameras and LIDAR

10 of 101

Navigation requires fusing multiple sensor perspectives

Each sensor

is processed

individually

11 of 101

Navigation requires fusing multiple sensor perspectives

Multiple sensor outputs are combined to provide a combined Birds Eye View

Input semantic segmentation images and predicted BEV map in Cam2BEV (source)

12 of 101

Why do we need Frame Transformations on AV (2)

Reason 2:

Mapping is usually done by tying submaps together

https://github.com/googlecartographer/cartographer_ros

13 of 101

Coordinate Frames

14 of 101

What is a Coordinate Frame?

A set of orthogonal axes attached to a body that serves to describe position of points relative to that body

axes intersect at a point known as the origin

In many robotics problems,

the first step is to assign a coordinate frame to all objects of interest

15 of 101

Why do we need Coordinate Frames?

Compact representation of points
Compact representation of orientations

Absolute coordinates in World Frame Relative coordinates between robots

16 of 101

Why do we need Coordinate Frames?

pose = position & rotation

Coordinates have no meaning without specifying coordinate frame

17 of 101

Why do we need Coordinate Frames?

Compact representation of points
Compact representation of orientations
Understand and relate quantities observed by

diﬀerent robots and sensors
at diﬀerent time steps

Rigid body transformations transform points from one frame to another

time t

time t+1

18 of 101

Right-handed Coordinate Frames

Direction of axes is important!

There is a common mnemonic:

positive x axis points along your index finger, positive y axis points along your middle finger.
In so-called "right-handed" coordinate systems, positive z-axis points along the thumb of your right hand.
In so-called "left-handed" coordinate systems, positive z-axis points along the thumb of your left hand.

robotics always uses right-handed coordinate systems

left-handed systems are sometimes used in graphics

right- handed

left- handed

19 of 101

Transformations and Frames

20 of 101

Transformations and Frames: Map Frame

– where are you with respect to the map – coordinates from origin

Map frame

21 of 101

Transformations and Frames: Sensor Frame

The – how does the world look from the sensor

Does this tell you anything about where obstacles are in the map?

Does this tell you anything about where we are in the map?

We must link frames together

Sensor frame

Transformations →

22 of 101

Transformations and Frames

The frame of reference in which the measurement is taken

Distance measurements returned by LIDAR

𝛥𝑧

23 of 101

Transformations and Frames

𝛥𝑧

The scan values from the LIDAR will not tell us how far the obstacles are. We must take care of the offsets

𝛥𝑥

24 of 101

Transformations and Frames

The frame of reference in which the measurement is taken

Distance measurements returned by LIDAR

𝛥𝑧

25 of 101

A world without frames and transformations

26 of 101

The actual motion of the car

27 of 101

Transformations and Frames

To convert measurements from one frame to another we need to perform transformations

Map frame

Car frame

laser frame

Between frames we need transformations that convert measurements from one frame to another

28 of 101

Transformations and Frames

Map frame

Car frame

laser frame

Transform from map to car

Transform from car to laser

To convert measurements from one frame to another we need to perform transformations

Static Transformations Dynamic Transformations

29 of 101

Rigid Body Transforms: An aside

What’s with it being Rigid?

Play-Doh: Not a rigid body

The distance between any two given points of a rigid body remains constant in time regardless of external forces exerted on it.

Rigid Body Transforms

Deformations experienced by a rigid body are small relative to its motion.
Translation, rectilinear and curvilinear: The motion of the body is completely specified by the motion of any point in the body. All points of the body have the same velocity and same acceleration.
Rotation about a fixed axis: All particles move in circular paths about the axis of rotation. The motion of the body is completely determined by the angular velocity of the rotation.

Baymax: Obviously, not a rigid body

30 of 101

Summary: why do we need transforms between frames?

Data is usually provided in the most convenient frame to the data source
We want to localize ourselves on a map
If an obstacle is detected in the laser frame, maybe we want to know where it is in the world, i.e. in the map frame
We need to know the transforms between frames of components on the car for more accurate actuation
If we had two disconnected maps (e.g. submaps), we might want to know their location w.r.t. Each other within a global world frame

31 of 101

Reference Frames on F1TENTH

32 of 101

Multiple Reference Frames on F1TENTH

We need to know where the robot is in the world.

We need to know the relationship between the measurement frame and the actuation frame.

map

laser

Why?

33 of 101

Multiple Reference Frames

Measurement Frame

Measurement in laser frame: can see some information about the environment, and some information on the positional relationship between the robot and the world.

34 of 101

Multiple Reference Frames

World Frame

Measurement in map frame: the range measurement we see fully makes sense. The observation matches up with the environment. We know where the robot is in the environment.

35 of 101

Multiple Reference Frames

The world makes more sense if we put the laser scans in the global map frame instead of the laser frame.

More information available when planning in a global frame instead of extracting information in the observation frame.

Laser frame

Map frame

36 of 101

Frames we will use

map
base_link
laser

map

base_link

laser

P.S. In ROS, RGB → XYZ

37 of 101

Frames we will use: Environment

map
Represents the environment around the robot

Map frame origin can be set arbitrarily when we make the map

map

base_link

laser

38 of 101

Frames we will use: Actuation

2. base_link

Defined as the center of the car’s rear axle

Moves with the car relative to the map frame

map

base_link

laser

39 of 101

Frames we will use: Measurement

3. laser

The frame in which the laser scan measurement is taken

Also moves with the car relative to the map frame

map

base_link

laser

40 of 101

Odometry

odom

Not exactly a frame

Describe the relationship between two frames

map

base_link

laser

41 of 101

Poses & Transformations

42 of 101

An Example

In the safety lab, when you define only one Time to Collision (TTC) threshold for emergency braking, the distance between the wall and the car after the car is stopped is different depending on the traveling direction.

Caused by range measurements taken in the laser frame, which is toward the front of the vehicle.

43 of 101

An Example

Could be remedied by adding a frame in the center of the chassis and calculated the TTC from there.

44 of 101

An Example

How do we transform the measurements in the original frame into the new frame that we just added?

45 of 101

Rigid Body Transforms

X_A

Y_A

Z_A

46 of 101

Rigid Body Transforms

X_B

Y_B

Z_B

X_A

Y_A

Z_A

47 of 101

Rigid Body Transforms

X_B

Y_B

Z_B

X_A

Y_A

Z_A

48 of 101

Rotating a point in 2D

49 of 101

Rotating a point in 2D

50 of 101

Rotating a point in 2D

51 of 101

Rotating a point in 2D

52 of 101

Rigid Body Transforms

Special type of matrices called Rotation matrices

X_A

Y_A

Z_A

X_B

Y_B

Z_B

53 of 101

Rigid Body Transforms

Special type of matrices called Rotation matrices

X_A

Y_A

Z_A

X_B

Y_B

Z_B

Takes points in frame B and represents their orientation in frame A

54 of 101

Rigid Body Transforms: Rotation Matrices

X_A

Y_A

Z_A

X_B

Y_B

Z_B

55 of 101

Rigid Body Transforms: Rotation Matrices

X_B

Y_B

Z_B

Cosine component

(0,5,0)

56 of 101

Rigid Body Transforms: Rotation Matrices

X_B

Y_B

Z_B

Sine component

(0,5,0)

57 of 101

Rigid Body Transforms: Rotation Matrices

X_B

Y_B

Z_B

(0,5,0)

58 of 101

Rigid Body Transforms: Rotation Matrices

59 of 101

We have the Rotation Matrix, so now what?

We now have the point P as referenced in frame A

Known

For example

= (-2.5,4.3,0)

X_A

Y_A

Z_A

(-2.5,4.3,0)

X_A

Y_A

Z_A

60 of 101

Important point to remember

The rotation matrix will take care of perspectives of orientation, what about displacement?

X_A

Y_A

Z_A

X_B

Y_B

Z_B

X_A

Y_A

Z_A

X_B

Y_B

Z_B

Origins of both the frames are at the same location

61 of 101

Rigid Body Transforms

X_B

Y_B

Z_B

X_A

Y_A

Z_A

62 of 101

Rigid Body Transforms

X_B

Y_B

Z_B

X_A

Y_A

Z_A

63 of 101

Rigid Body Transforms

X_B

Y_B

Z_B

X_A

Y_A

Z_A

64 of 101

Rigid Body Transforms

What we need is Point p with respect to Frame A, given its pose in Frame B

X_B

Y_B

Z_B

X_A

Y_A

Z_A

65 of 101

Rigid Body Transforms:

And we are back to the Future

X_B

Y_B

Z_B

X_A

Y_A

Z_A

X_B

Y_B

Z_B

X_B

Y_B

Z_B

66 of 101

Rigid Body Transforms:

And we are back to the Future

What we need is Point p with respect to Frame A, given its pose in Frame B

X_A

Y_A

Z_A

X_B

Y_B

Z_B

= Homogeneous transformations that transforms

measurements in Frame B to those in Frame A

67 of 101

Recap:

Rotation & Translation in a slightly different example

O₂

p₂

O₁

p’

First we apply the rotation

Frame 2

Frame 1

68 of 101

Putting everything together (with translation)

O₂

p₂

O₁

p’

p₁

Then we apply the translation

Frame 2

Frame 1

69 of 101

Putting everything together

O₂

p₂

O₁

What if we want a more compact representation?

70 of 101

Homogeneous Transformations

A Homogeneous Transformation is a matrix representation of a rigid body transformation.

Here R is a 2x2 (2D) / 3x3 (3D) rotation matrix and d is a 2x1 (2D) / 3x1 (3D) displacement vector.
The Homogeneous Representation of a vector is:

Here p is a 2x1 (2D) / 3x1 (3D) vector

71 of 101

Summary: Poses & Transformations

72 of 101

Points & Positions

How to express orientation of a coordinate frame with respect to another?

e.g., What is the orientation of the robot frame in the world frame?

73 of 101

Poses & Transformations

Pose = position & rotation

Poses are used to:

express relations between coordinate frames
transform points from one coordinate frame to another

74 of 101

Rigid-body Transformations

How to transform points from one coordinate frame to another?

Given point “c” in coordinate frame “r” what is the position of the point in

frame “w”?

75 of 101

Remark about notation

Positions Rotations

Poses

potential for confusion
choice of notation depends on balance of simplicity / informativeness
using consistent notation helps doing math right

76 of 101

Composition of Transforms

77 of 101

Orientation & Rotations: 2D Case

wait! is angle positive or negative?

2D rotation matrix

How to express orientation of a coordinate frame with respect to another?

e.g., What is the orientation of the robot frame in the world frame?

let’s start from the 2D case

78 of 101

Orientation & Rotations: 3D Case

3D case

x y z

3D rotation matrix

columns: unit norm, orthogonal
R is an orthonormal matrix

How to express orientation of a coordinate frame with respect to another?

e.g., What is the orientation of the robot frame in the world frame?

79 of 101

3D Rotation Matrix

80 of 101

3D Rotation Matrix

81 of 101

Elementary Rotations

cosφ

⎡

⎣

R_x(φ) = _⎢0

⎢ ₀

⎤

⎥

_⎢1 0

cosφ −sinφ _⎥

sinφ

⎥

⎦

cosθ 0 sinθ

0 1 0

−sinθ 0 cosθ

⎣

⎡

⎢

R_y(θ ) = _⎢

⎢

⎦

⎤

⎥

⎢

R (ψ) = sinψ

⎢

⎣

₀⎤

⎦

^⎡cosψ −sinψ

⎥

cosψ 0

⎥

₁⎥

Rotation around the x axis

Rotation around the y axis

Rotation around the z axis

82 of 101

Inverse of Rotations

Given

compute

Rotation Matrices

Euler Angles

Quaternions

Inverse

83 of 101

Composition of Rotations

Given

and

compute

Rotation Matrices

Euler Angles

Quaternions

Composition

(quaternion product)

84 of 101

Composition and Inverse of Transformations

Composition Inverse

85 of 101

Other representations of rotations

Quaternions: (ROS standard)

Supplementary materials in LaValle’s Virtual Reality lectures.

Euler angles

Axis angle

86 of 101

Euler Angles

Euler’s rotation theorem (1.0): Any rotation can be written as the product of no more than 3 elementary rotations (no two consecutive rotations along the same axis).

http://petercorke.com/wordpress/toolboxes/robotics-toolbox

roll

pitch

yaw

•

We only need 3 numbers to represent a rotation!

•

why not x-y-z, y-x-z, x-x-z, …?

are generally called Euler angles

87 of 101

Where will you use this?

Labs 2, 3: Automatic Emergency Braking,

Reactive Methods for navigation & planning in local frame

Later labs: Map based methods, need transforms to perform planning in global frame.

88 of 101

Next: How do we do this in ROS?

Map frame

Car frame

laser frame

Transform from map to car

Transform from car to laser

To convert measurements from one frame to another we need to perform transformations

Static Transformations Dynamic Transformations

89 of 101

Pose Representation and Coordinate Transformation

Key Takeaways

Coordinates are numerical representations of geometric concepts, like points, directions, frames of reference, and movement.

Points, directions, and displacements in 𝑛 -dimensional space are represented by 𝑛 -dimensional vectors, while rotations and scalings are represented by 𝑛×𝑛 matrices.

Rigid transformations consist of a rotation followed by a translation. They represent both rigid body movement and changes of coordinate frame.

Homogeneous coordinates represent rigid transforms using matrix multiplication in an 𝑛+1 dimensional space where the last coordinate is either 0 or 1.

When working with coordinates it is easy to make mistakes. Having clear assumptions, clear notation and/or using coordinate management software can reduce the risk of error.

90 of 101

Questions?

91 of 101

Extra slides

92 of 101

Rotation Matrix

x₁

y₁

x₂

y₂

93 of 101

Rotation Matrix - from the racecar’s perspective

x₁

x₂

y₁

y₂

94 of 101

Rotation Matrix

x₁

x₂

y₁

y₂

P_{1, 2}

Recall we represented the basis vectors of one frame as linear combinations of the other frame’s basis vector. We can do the same here.

95 of 101

Rotation Matrix

How do we relate the two?

x₁

y₁

x₂

y₂

p’

x₂

y₂

p’

96 of 101

Rotation Matrix

x₁

y₁

x₂

y₂

p’

Instead of using two frames, we can represent these two vectors in the same basis vector now.

97 of 101

Homogeneous Transformations

How to transform points from one coordinate frame to another?

Homogeneous coordinates:

More elegant, matrix formulation:

Pose & rigid body transformation

98 of 101

Rotation Matrix

x₁

y₁

x₂

y₂

p’

???

99 of 101

Example: Rotations in 3D

Rotation matrices in 3D around each axis.

See something similar?

100 of 101

Rigid Body Pose

Once chosen a frame, a rigid body is described in space by its position and orientation with respect to that reference frame.

100

map

base_link

laser

odom