2 of 38

What is Computer Vision?

Taking an image (often using a camera)
Manipulating it into usable data (pitch, yaw, height/distance)

Vision Processor

RoboRIO

Camera

Target

3 of 38

What isn’t Computer Vision?

Taking an image
Displaying it to a human for interpretation

RoboRIO

Camera

Target

Driver

4 of 38

How Vision Processing Works

Acquiring, Transferring, and Using Data

5 of 38

The 3 Parts of Computer Vision Tracking

Transferring Vision Data

Moving data from the processor to a place where it can be used by the central robot controller to adjust actuators

Acquiring Vision Data

Using a processor to collect and manipulate camera input to get meaningful data and information

Using Vision Data

Using the Vision data to move actuators and solve engineering problems.

Camera

Vision Processor

RoboRIO

Vision Processor

RoboRIO

Motor

Piston

6 of 38

Getting Vision Data

The Process of Turning Camera Input into Usable Data

7 of 38

Cameras - Acquiring Vision Data

Sensor for Vision Tracking
Things to look for:

USB Real-Time Transfer
Aperture Priority
High frame rate
High Field of View (FOV)
Mountability (can I get this on my robot)
Affordability (You should have at least 2)
Durability

Camera

8 of 38

Choosing a Camera Type - Acquiring Vision Data

Standard RGB

Pros:

Easier to find
More variety
Often higher FOV
Higher resolution

Cons:

Doesn’t work as well with retro-reflective
Can interfere with other teams’ wavelengths
Harder to filter and use for vision tracking
May have to be turned off for light rules

Example:

Microsoft Lifecam

HD-3000

Infrared (IR)

Pros:

Works better for Vision tracking
Doesn’t interfere with other teams’ light
Doesn’t have to be turned off for light rules
Drastically reduces mis-detections

Cons:

Not as many of them (hard to find)
Sometimes have smaller FOV
Often aren’t USB

Example:�Infrared Camera Module

(NoIR) V2

9 of 38

The Vision Processor- Acquiring Vision Data

The part of the system that runs the program that creates the usable data from the camera images
Runs the program on startup
Program written in

C++, Python, or Java

Vision Processor

10 of 38

Vision Processor Options - Acquiring Vision Data

Many available offboard processors

Recommended
NVidia Jetson Tegra K1
NVidia Jetson Tegra X1
NVidia Jetson Tegra X2

Other Options
Raspberry Pi
BeagleBoard/BeagleBone
Intel Galileo
A10 Linux Dev Board

11 of 38

The Vision Processing Program

The Software Itself

12 of 38

The Program Pipeline - Acquiring Vision Data

Program runs a script on loop to handle new images.

Filtering

Program filters the image using both color and geometry to achieve a set of pixel coordinates of the points of the target

Image Manipulation

Image is blurred and converted into a format that can be easily filtered.

Input

Image from the camera is loaded into the program as a 2D array of colored pixel data called a Mat (matrix)

Calculations

Pixel points of the target received from the filtering step are used to run calculations and compared to the last image to verify sensibility.

13 of 38

Step 1 - “Input”

Latest still frame is taken from the camera and input into the program
Handled by the the VideoCapture object

Python

C++

14 of 38

Step 2 - “Image Manipulation”

Image is blurred and converted into easily filterable format (HSV).

Blur

Color Conversion

15 of 38

Blur - Step 2 “Image Manipulation”

Blurring appears counter-intuitive
Eliminates imperfections and oddities from an image
Result: more accurate filtering and detection

Python

C++

16 of 38

Color Conversion - Step 2 “Image Manipulation”

Convert type of the image to allow for a more intuitive filtering process
Important to understand types of image color storage

17 of 38

Color Storage Formats - Step 2 “Image Manipulation”

BGR (RGB)

Classic images - contain a different amount of blue, green, or red to create any color. Terrible for filtering because of 3 different hue values.

Note: For some reason OpenCV refers to this format by default as BGR. Your IDE may suggest an RGB enum, but do not use it.

HSV

A coloring format optimized for filtering.

Like BGR, has 3 data points, but instead of different amounts of base colors includes one value (H) for the hue range, (S) for saturation (difference from white), and V (difference from black).

Mask Image (Boolean Image)

Pixels are activated or inactivated.

Used for contour detection and math. Often referred to as a 2-channel image/mask.

18 of 38

Color Conversion- Step 2 “Image Manipulation”

Convert your image from unfriendly BGR format to filter-friendly HSV format
Use OpenCV function cvtColor.

Python

C++

19 of 38

Step 3 - “Filtering”

Geometric Filtering

Program filters image using both color and geometry to determine a set of pixel coordinates of the target

Color Filtering

Contour

Detect

20 of 38

Color Conversion- Step 3 “Filtering”

Convert your image from HSV to an Boolean Mask using upper and lower color filters that are tuned to the field
Use OpenCV function inRange.

Python

C++

21 of 38

Contour Detection - Step 3 “Filtering”

Convert white “active” pixels into blobs of coordinate data
OpenCV has a function to do this called findContours.

Python

C++

22 of 38

Geometric Filtering - Step 3 “Filtering”

The final step filtering is Geometric filtering
Removes any mis-detections based on shape or size

Types of geometric filters:

Area
Perimeter
Aspect Ratio
Location on screen
Location relative to other shapes
etc.

23 of 38

Geometric Filtering (cont.) - Step 3 “Filtering”

Example: peg rectangles in Steamworks

24 of 38

Geometric Filtering (cont.) - Step 3 “Filtering”

Process:

For loops, filtering out contours that are too small or oddly shaped
Sorting functions to remove small,rectangular mis-detections
Final, large scale sorting to determine final target(s) of out of remaining contours

Ex. Implementation:

All Contours Array

Large Contours Array

For: Larger than 100px²

For: Aspect Ratio ~ 2/5

Correct Shape

Sorting: Largest Two

Final Contour(s)

25 of 38

Step 4 - Calculations

Pixel points are used to run calculations to compare with previous data and discard invalid points.

Geometric Math

Comparison

26 of 38

Relative Tuning - Geometric Calculations

Absolute Tuning:

Using complex math and real world measurements to find exact pose of target, and then tuning robot mechanisms to that data.

Relative Tuning:

Using simple math and geometric properties to find relative position of target, and then tuning robot mechanisms to that data.

Coordinate point data is used to interpret information about your target
Use of relative tuning can make the math aspect of this simple.

27 of 38

Solving Math Problems - Geometric Calculations

Finding actual center of a target: the X system

Draw lines from opposing corners and find point of intersection
Solves perspective problem by cancelling corners

Note:

To determine which corner is which

Sort points by slope from the origin
Take most extreme two and least extreme two, sort those by their X or Y value to ensure correct corners

28 of 38

Important Math Functions - Geometric Calculations

Yaw

Pitch

Finding Yaw/Pitch Offset to Target

Used for alignment to target around an axis

TargetX = The X value of the center of your target

TotalX = Total Width of screen

This will return a value -1 to 1 of how much the robot needs to move/rotate

-1 is left, 1 is right

TargetY = The Y value of the center of your target

TotalY = Total Height of screen

This will return a value -1 to 1 of how much the robot needs to move

1 is up, -1 is down

29 of 38

Comparison- Step 4 “Calculations”

Due to mis-detections, it’s important to verify sensibility of data. This can be done through comparing current point to previous data.

Store

If the data point is good, store it in the place of the previous data for future comparisons.

Timestamp

Use your language’s time system to assign a variable to the time of the frame for latency comparison

Compare

Compare your current point to your most recent data to verify that it is possible. If not, disregard that point and wait for next data point.

30 of 38

Transferring Vision Data

Moving Vision Data from the Processor To Your Robot Controller

31 of 38

Methods of Communication - Moving Vision Data

Serial Communication

Pros

Faster

Cons

Unstructured / no built in support for multiple values
Potentially hard to debug

Network Communication

Pros

Easier
Built in support for multiple values

Cons

Slower
Requires a radio port

32 of 38

Network Communication - Moving Vision Data

NetworkTables Library

Built into WPILib
Can be compiled independently
PyNetworkTables for Python
System of sending multiple values over radio

Network Configuration

Use RoboRIO as Server
Use Coprocessor as Client
Send data over radio

Vision Processor

RoboRIO

Radio

33 of 38

Using Vision Data

Moving Vision Data from the Processor to the Robot Controller

34 of 38

Using Vision Data to Actuate Actuators

Vision data will be used to move the robot
The values you have available:

Yaw Offset (-1 to 1)
Pitch Offset (-1 to 1)
Average height of contour for distance

35 of 38

Relative Tuning - Using Vision Data

Relative Tuning:

Using simple math and geometric properties to find relative position of target
Tuning robot mechanisms to that data.

Use relative tuning to find a function that tells the robot specifically how to behave based on sensor data (in this case, Vision data)

Example:

Sensor Data: Height of contour for distance

Feedback: RPM of flywheel

Goal: Adjust RPM of flywheel to match distance

Vision Processor

RoboRIO

PID Loop

Motor

Piston

36 of 38

Relative Tuning - How To

Create a data table
Use a dynamic variable as your X axis (height of target contour)
Use your robot adjustment for your Y axis (flywheel RPM to get to goal)
Repeat a lot of times at different X values
Plot the data & create a line of best fit
Use that equation in your programming for interpreting vision data.

37 of 38

What is Computer Vision?

Taking an image (often using a camera)
Manipulating it into usable data (pitch, yaw, height/distance)

Vision Processor

RoboRIO

Camera

Target

38 of 38

Thanks!

Any questions?

You can email us at programming@kingtec2169.com

Or me personally at devosmaxwell@gmail.com

Or see me after the presentation :)

Presentation can be found on https://kingtec2169.com/resources /documents/