2 of 60

Basics

Edge pixels are those at which image intensity changes abruptly.

Edge segments or edges are sets of connected edge pixels.

3 of 60

Digital derivatives: first versus second

Along a ramp (i.e. intensity change with constant slope), first derivative is a non-zero constant.

Second derivative is zero along the ramp, except at the start and end!

Second derivative is preferable for image sharpening! Many edges in images are ramp-like, in which case first derivative will give thick edges (undesirable), whereas second derivative gives thin edges two pixels wide (desirable).

Second derivative changes sign midway at a step edge (zero crossing property).

4 of 60

At roof edges, second derivative gives stronger response than first derivative (i.e. has larger magnitude) – except if both parts of the roof were at 45 degrees, where both will have equal response.

At isolated points, second derivative gives stronger response than first derivative

A rotationally invariant second-derivative operator is used for edge detection: the Laplacian, in fact zero-crossings of a Laplacian.

Digital derivatives: first versus second

5 of 60

Laplacian of an image

Laplacian operators: second operator is obtained by adding second derivatives along both the diagonals, to the first operator

Rotationally symmetric operator (in the continuous domain)

6 of 60

From book by Gonzalez and Woods

Intensity discontinuity, also called step edge

7 of 60

Edge detection under noise

First and second derivatives are very sensitive to noise!

Left to right in each row: original image, first derivative, second derivative

In any column: increase in noise level

8 of 60

Steps in robust edge detection

Image smoothing to reduce noise: because first and second derivatives are noisy

Actual detection of edge points (may produce thick or disconnected edges)

Pruning or linking of the edge points

9 of 60

Edge detection: image gradient

An image gradient is defined as follows:

10 of 60

Robust edge operators

These operators: Prewitt and Sobel perform some smoothing prior to difference computation, and hence are more robust than the simple finite differences from the previous slide.

11 of 60

Prewitt and Sobel

Apply the differencing mask [-1 0 1]^T and then apply the smoothing mask [1 1 1] on the result. This is equivalent to applying the first Prewitt mask (i.e. for f_y).
For the Prewitt operator for f_x, apply [-1 0 1] and then [1 1 1]^T.

Apply the differencing mask [-1 0 1]^T and then apply the smoothing mask [1 2 1] on the result. This is equivalent to applying the first Sobel mask (i.e. for f_y).
For the Sobel operator for f_x, apply [-1 0 1] and then [1 2 1]^T.

12 of 60

Original image

Derivatives in y direction: f_y

Derivatives in x direction: f_x

| f_x| + |f_y|

Edges with Sobel Mask

13 of 60

Original image

Derivatives in y direction: f_y

Derivatives in x direction: f_x

|f_x| + |f_y|

Edges with 5 x 5 averaging followed by Sobel Mask

Notice: finer details removed

14 of 60

Edge map after Thresholding the earlier map.

15 of 60

Marr-Hildreth edge detector

Principled approach to combined edge detection and image smoothing

An edge operator should satisfy two criteria:
Notion of scale in edge detection (operators of different sizes)
Differential operator to compute first or second derivatives of the intensity

Uses the Laplacian of Gaussian operator on an image satisfies both criteria

16 of 60

Laplacian of Gaussian (LoG or Mexican hat)

18 of 60

Why LoG for edge detection?

Remember: we are interesting in zero-crossings of the second derivative!
We want the second derivative operator to be rotationally invariant. The Laplacian is the simplest (though not the only) such operator.
The Gaussian is a low-pass filter, which blurs structures with intensity difference smaller than σ.
Gaussian is smooth in spatial and frequency domains.

19 of 60

Using LoG for edge detection

(1) Convolve the image with the LoG operator, i.e.:

(2) Find the zero-crossings in the resultant image.

20 of 60

Implementing the convolution

Use an n x n mask, created by sampling the formula for LoG on a discrete Cartesian crid of size n x n.

How to choose n? Depends on σ: n should be >= 6σ, because more than 99% of the volume underneath a 2D Gaussian lies between μ-3σ and μ+3σ.

21 of 60

Finding zero-crossings

Look at a 3 x 3 window around a pixel.

There exists a zero-crossing at that pixel if the signs of at least one pair of opposing neighbors (top & bottom, left & right, top-left & right-bottom, top-right & left-bottom) differ and the difference in their values must exceed a threshold p.

(x,y)

22 of 60

σ = 4, n = 25

Convolut-ion with LoG

Zero-crossings

Zero-crossings + thresholding

23 of 60

Some properties of the Marr-Hildreth Edge Detector

Detected edges are 1-pixel in thickness

Can be used for different sigma values, keeping only those edge points that are common to multiple scales

24 of 60

Canny Edge Detector: using 1^st derivatives

Three aims/objectives:
Low error (no spurious responses due to noise)
Well-localized edge points
Single pixel wide edges

Canny’s detector was formulated by analyzing step edges in 1D under zero-mean i.i.d. Gaussian noise, and expressing the preceding three criteria mathematically and deriving an (approximate) operator that obeys all criteria.

25 of 60

Canny’s Edge Detector

In 1D, the approximate operator was the first derivative of the Gaussian:

In 2D, the approximate operator was the first derivative of the Gaussian in the direction of the local image gradient (i.e. perpendicular to the edge).

But the true gradient is not known, so we first smooth the image with a 2D Gaussian and then use its gradient magnitude and direction.

26 of 60

Canny’s Edge Detector

Gradient magnitude and direction:

M(x,y) contains wide ridges along the true edges of the image.
Though the maximum values are along the edge, those values don’t immediately fall off to 0 on either side of the edge.
Recall ramp edges.

27 of 60

Canny edge detector

This requires us to perform a thinning operation – called as non-maximal suppression (i.e. we set to zero those values of M(x,y) which are likely to NOT be local maxima).

28 of 60

Canny edge detector

Consider the four quantized directions shown on the previous slide. Pick the direction d_k, which is closest to ϴ(x,y).
If M(x,y) is less than either of its two neighbors along the direction d_k, then set it to 0 (i.e. suppress the non-maximum), otherwise we leave it as is.
After this processing, M(x,y) is the non-maximal suppressed image.

29 of 60

Canny Edge Detector

The (thinned) edge map so far may still contain some false positives. This is taken care of (to some extent) by hysteresis thresholding.
This uses a low-threshold TL and a high threshold TH.
We keep only those pixels for which

-M(x,y) > TH, OR

-M(x,y) > TL AND (x,y) is a neighbor of at least one pixel with M(x,y) > TH.

(x,y)

The 8 neighbors of pixel (x,y)

30 of 60

Summary: Canny Edge detector

Convolve the image with a Gaussian Filter. Compute the gradient magnitudes and orientations.

(2) Perform non-maximal suppression.

(3) Perform hysteresis thresholding.

33 of 60

Gaussian filtered (sigma = 1.4, 5 x 5 mask)

Intensity gradients superimposed

Non-maximum suppression

Double thresholding (TL = 0.1, TH = 0.3)

After edge-linking (hysteresis)

https://en.wikipedia.org/wiki/Canny_edge_detector

34 of 60

Corner Detection

35 of 60

Importance of corners

Corners are considered salient feature points.

Useful in many tasks in computer vision and image processing

Example: corresponding control points for image alignment

37 of 60

Importance of corners

In some applications, we need to find matching patches.
But some patch matches are easier to find than others.
See next slide for examples.

38 of 60

Good patch for matching

Bad patch for matching

http://www.cse.psu.edu/~rtc12/CSE486/lecture06.pdf

39 of 60

Importance of corners

Good patches should contain corners – large intensity variations in all directions.
Shifting the window in any direction yields large appearance change.

http://www.cse.psu.edu/~rtc12/CSE486/lecture06.pdf

40 of 60

Ambiguity in point matching in flat intensity regions or along edges

41 of 60

Principle behind Harris corner detector

Consider two patches – one each centered at (x,y) and (x+u,y+v) in two images (patch domain: Ω, typically rectangular, like say 7 x7 or 5 x 5) with u,v being small.

The image have same intensity at physically corresponding locations.

The sum of squared difference (SSD) between their intensities is given as:

42 of 60

Structure tensor matrix (size 2 x 2)

43 of 60

Principle behind Harris corner detector

The local A matrix on the previous slide is called the structure tensor.

A carries information about local image geometry.

It is always positive semi-definite, i.e. its two eigenvalues are always non-negative.

In locally flat regions, A will be close to a zero matrix and hence both its eigenvalues will be close to 0.

44 of 60

Proof that the structure tensor matrix is positive semi-definite: Method 1

Any matrix that can be written in the form A = ZZ^T is always positive semi-definite (quoting a result from linear algebra)

45 of 60

Proof that the structure tensor matrix is positive semi-definite: Method 2

46 of 60

Principle behind Harris corner detector

At a point lying on an edge, only one of the eigenvalues is large (corresponding to the eigenvector that points across the edge) and the other is close to 0 (corresponding to the eigenvector that points along the edge)

At a corner point, both eigenvectors will be large.

We would like the SSD to be large for all non-zero shifts (u,v) so that we can allow for maximum discriminability and easier point matching.

47 of 60

http://www.cse.psu.edu/~rtc12/CSE486/lecture06.pdf

48 of 60

http://www.cse.psu.edu/~rtc12/CSE486/lecture06.pdf

49 of 60

http://www.cse.psu.edu/~rtc12/CSE486/lecture06.pdf

50 of 60

“Corner”-ness measure

We would like both eigenvalues of this matrix to be large.

Corner response or “corner”ness measure:

R_H = det(A) – k (trace(A))², k between 0.04 to 0.06

Does not require explicit eigenvalue/eigenvector computation:

51 of 60

http://www.ipol.im/pub/art/2018/229/article_lr.pdf

53 of 60

Computing corners

Smooth image I to compute various derivatives

Choose a window size (say 7 x 7) and compute the structure tensor at each pixel, given this window size

Compute the corner-ness measure for each pixel

Perform thresholding or non-maximal suppression to create a corners map.

59 of 60

Invariance

The detected corners are invariant under affine intensity change, i.e. image J replaced by aJ + b for scalars a,b.

The corners are also invariant to rotation and translation.

60 of 60

Other measures of corner-ness

Based on just the minimum eigenvalue – declared a corner if the minimum eigenvalue exceeds a threshold

Harmonic mean of eigenvalues

1 of 60

2 of 60

3 of 60

4 of 60

5 of 60

6 of 60

7 of 60

8 of 60

9 of 60

10 of 60

11 of 60

12 of 60

13 of 60

14 of 60

15 of 60

16 of 60

17 of 60

18 of 60

19 of 60

20 of 60

21 of 60

22 of 60

23 of 60

24 of 60

25 of 60

26 of 60

27 of 60

28 of 60

29 of 60

30 of 60

31 of 60

32 of 60

33 of 60

34 of 60

35 of 60

36 of 60

37 of 60

38 of 60

39 of 60

40 of 60

41 of 60

42 of 60

43 of 60

44 of 60

45 of 60

46 of 60

47 of 60

48 of 60

49 of 60

50 of 60

51 of 60

52 of 60

53 of 60

54 of 60

55 of 60

56 of 60

57 of 60

58 of 60

59 of 60

60 of 60