1 of 38

Announcements

Proj3: It’s out. TIme to get started.

Last semester’s students requested a walkthrough/getting-started video, so I’ll be making one of those tomorrow.

Midterm 2 grades will be out late Tuesday/early Wednesday.

Grading is effectively done, but some makeups tomorrow.

2 of 38

CS61B

Lecture 32: Sorting I

The Sorting Problem
Selection Sort, Heapsort
Mergesort
Insertion Sort
Shell’s Sort (Extra)

3 of 38

Sorting - Definitions

A sort is a permutation (re-arrangement) of a sequence of elements that brings them into order according to some total order. A total order ≼ is:

Total: x ≼ y or y ≼ x for all x, y
Reflexive: x ≼ x
Antisymmetric: x ≼ y and y ≼ x iff x = y (x and y are equivalent).
Transitive: x ≼ y and y ≼ z implies x ≼ z.

Unequal items may be ‘equivalent’ under these definitions:

Example: Given strings, sorted by length, either may be put first.�

4 of 38

Sorting: An Alternate Viewpoint

An inversion is a pair of elements that are out of order.

Goal of sorting:

Given a sequence of elements with Z inversions.
Perform a sequence of operations that reduces inversions to 0.

0 1 1 2 3 4 8 6 9 5 7

8-6 8-5 8-7 6-5 9-5 9-7

(6 inversions out of 55 max)

5 of 38

Performance Definitions

Characterizations of the runtime efficiency are sometimes called the time complexity of an algorithm. Examples:

DFS has time complexity Θ(V+E)

Characterizations of the “extra” memory usage of an algorithm is sometimes called the space complexity of an algorithm.

DFS has space complexity Θ(V)

Note that the graph takes up space Θ(V+E), but we don’t count this as part of the runtime of DFS, since we’re only accounting for the extra space that DFS uses.

6 of 38

Selection Sort and Heapsort

7 of 38

Selection Sort

We’ve seen this already.

Find smallest item.
Swap this item to the front and ‘fix’ it.
Repeat for unfixed items until all items are fixed.
Demo: https://goo.gl/g14Cit

Sort Properties:

Θ(N²) time if we use an array (or similar data structure).

Seems inefficient: We look through entire remaining array every time to find the minimum.

8 of 38

Naive Heapsort: Leveraging a Max-Oriented Heap

Idea: Instead of rescanning entire array looking for minimum, maintain a heap so that getting the minimum is fast!

For reasons that will become clear soon, we’ll use a max-oriented heap.

Naive heapsorting N items:

Insert all items into a max heap, and discard input array. Create output array.
Repeat N times:

Delete largest item from the max heap.
Put largest item at the end of the unused part of the output array.

Naive Heapsort Demo: https://goo.gl/EZWwSJ

A min heap would work as well, but wouldn’t be able to take advantage of the fancy trick in a few slides.

9 of 38

Naive Heapsort Runtime: http://shoutkey.com/any

Naive heapsorting N items:

Insert all items into a max heap, and discard input array. Create output array.
Repeat N times:

Delete largest item from the max heap.
Put largest item at the end of the unused part of the output array.

What is the TOTAL runtime of naive heapsort?

Θ(N)
Θ(N log N)
Θ(N²), but faster than selection sort.

10 of 38

Heapsort Runtime Analysis

Use the magic of the heap to sort our data.

Getting items into the heap Θ(N Log N) time.
Selecting largest item: Θ(1) time.
Removing largest item: O(log N) for each removal.

Informally, overall runtime is Θ(N Log N) + Θ(N) + O(N log N) = Θ(N Log N)

Far better that selection sort!

Memory usage is Θ(N) to build the additional copy of all of our data.

Worse than selection sort, but probably no big deal (??).
Can eliminate this extra memory cost with same fancy trickery.

If every swim fails…. Then it’d be theta(N) in that case. So maybe best to say that this is O(N log N).

A thing to ponder. I’ll put on the study guide with answers.

11 of 38

In-place Heapsort

Alternate approach, treat input array as a heap!

Rather than inserting into a new array of length N + 1, use a process known as “bottom-up heapification” to convert the array into a heap.

To bottom-up heapify, just sink nodes in reverse level order.

Avoids need for extra copy of all data.
Once heapified, algorithm is almost the same as naive heap sort.

In-place heap sort: Demo

32

15

2

17

19

26

41

17

41

19

32

17

15

26

2

17

heapification

For this algorithm we don’t leave spot 0 blank.

12 of 38

In-place Heapsort Runtime: http://shoutkey.com/salad

Use the magic of the heap to sort our data.

Bottom-up Heapification: O(???) time.
Selecting largest item: Θ(1) time.
Removing largest item: O(log N) for each removal.

Give the time complexity of in-place heapsort in big O notation.

O(N)
O(N log N)
O(N²)

13 of 38

In-place Heapsort Runtime

Use the magic of the heap to sort our data.

Heapification: O(N log N) time.
Selecting largest item: Θ(1) time.
Removing largest item: O(log N) for each removal.

Give the time complexity of in-place heapsort in big O notation.

O(N log N)

Bottom-up heapification is N sink operations, each taking no more than O(log N) time, so overall runtime for heapification is O(N log N).

Extra for experts, show that bottom-up heapification is Θ(N) in the worst case.
More extra for experts, show heapsort is Θ(N log N) in the worst case.

14 of 38

In-place Heapsort: http://shoutkey.com/scream

What is the memory complexity of Heapsort?

Θ(1)
Θ(log N)
Θ(N)
Θ(N log N)
Θ(N²)

15 of 38

In-place Heapsort

What is the memory complexity of Heapsort?

Θ(1)
Θ(log N)
Θ(N)
Θ(N log N)
Θ(N²)

The only extra memory we need is a constant number instance variables, e.g. size.

Unimportant caveat: If we employ recursion to implement various heap operations, space complexity is Θ(log N) due to need to track recursive calls. The difference between Θ(log N) and Θ(1) space is effectively nothing.

16 of 38

Sorts So Far

	Best Case Runtime	Worst Case Runtime	Space	Demo	Notes
Selection Sort	Θ(N²)	Θ(N²)	Θ(1)	Link
Heapsort (in place)	Θ(N)*	Θ(N log N)	Θ(1)	Link	Bad cache (61C) performance.

*: An array of all duplicates yields linear runtime for heapsort.

17 of 38

Mergesort

18 of 38

Mergesort

We’ve seen this one before as well.

Mergesort: Demo

Split items into 2 roughly even pieces.
Mergesort each half (steps not shown, this is a recursive algorithm!)
Merge the two sorted halves to form the final result.

Time complexity, analysis from previous lecture: Θ(N log N runtime)

Space complexity with aux array: Costs Θ(N) memory.

Also possible to do in-place merge sort, but algorithm is very complicated, and runtime performance suffers by a significant constant factor.

19 of 38

Sorts So Far

	Best Case Runtime	Worst Case Runtime	Space	Demo	Notes
Selection Sort	Θ(N²)	Θ(N²)	Θ(1)	Link
Heapsort (in place)	Θ(N)*	Θ(N log N)	Θ(1)	Link	Bad cache (61C) performance.
Mergesort	Θ(N log N)	Θ(N log N)	Θ(N)	Link	Faster than heap sort.

*: An array of all duplicates yields linear runtime for heapsort.

20 of 38

Insertion Sort

21 of 38

Insertion Sort

General strategy:

Starting with an empty output sequence.
Add each item from input, inserting into output at right point.

Naive approach, build entirely new output: Demo (Link)

32

15

2

17

19

26

41

17

Input:

Output:

22 of 38

Insertion Sort

General strategy:

Starting with an empty output sequence.
Add each item from input, inserting into output at right point.

Naive approach, build entirely new output: Demo (Link)

32

15

2

17

19

26

41

17

Input:

Output:

32

15

2

17

19

26

41

17

23 of 38

Insertion Sort

General strategy:

Starting with an empty output sequence.
Add each item from input, inserting into output at right point.

For naive approach, if output sequence contains k items, worst cost to insert a single item is k.

Might need to move everything over.

More efficient method:

Do everything in place using swapping: Demo (Link)

24 of 38

In-place Insertion Sort

Two more examples.

7 swaps:

36 swaps:

P O T A T O

P O T A T O (0 swaps)

O P T A T O (1 swap )

O P T A T O (0 swaps)

A O P T T O (3 swaps)

A O P T T O (0 swaps)

A O O P T T (3 swaps)

S O R T E X A M P L E

S O R T E X A M P L E (0 swaps)

O S R T E X A M P L E (1 swap )

O R S T E X A M P L E (1 swap )

O R S T E X A M P L E (0 swaps)

E O R S T X A M P L E (4 swaps)

E O R S T X A M P L E (0 swaps)

A E O R S T X M P L E (6 swaps)

A E M O R S T X P L E (5 swaps)

A E M O P R S T X L E (4 swaps)

A E L M O P R S T X E (7 swaps)

A E E L M O P R S T X (8 swaps)

Purple: Element that we’re moving left (with swaps).

Black: Elements that got swapped with red.

Grey: Not considered this iteration.

25 of 38

Insertion Sort Runtime: http://shoutkey.com/drink

What is the runtime of insertion sort?

Ω(1), O(N)
Ω(N), O(N)
Ω(1), O(N²)
Ω(N), O(N²)
Ω(N²), O(N²)

36 swaps:

26 of 38

Picking the Best Sort: http://shoutkey.com/kiwi

Suppose we do the following:

Read 1,000,000 integers from a file into an array of length 1,000,000.
Mergesort these integers.
Select one integer randomly and change it.
Sort using algorithm X of your choice.

Which sorting algorithm would be the fastest choice for X?

Selection Sort
Heapsort
Mergesort
Insertion Sort

27 of 38

(Expected stop here unless we have extra time)

28 of 38

To Be Continued

(slides that follow this point are for next time)

29 of 38

Observation: Insertion Sort on Almost Sorted Arrays

For arrays that are almost sorted, insertion sort does very little work.

Left array: 5 inversions, so only 5 swaps.
Right array: 3 inversion, so only 3 swaps.

30 of 38

Picking the Best Sort (Poll Everywhere)

Suppose we do the following:

Read 1,000,000 integers from a file into an array of length 1,000,000.
Mergesort these integers.
Select one integer randomly and change it.
Sort using algorithm X of your choice.
In the worst case, we have 999,999 inversions: Θ(N) inversions.

Which sorting algorithm would be the fastest choice for X? Worst case run-times:

Selection Sort: Θ(N²)
Heapsort: Θ(N log N)
Mergesort: Θ(N log N)
Insertion Sort: Θ(N)

31 of 38

Insertion Sort Sweet Spots

On arrays with a small number of inversions, insertion sort is extremely fast.

One exchange per inversion (and number of comparisons is similar). Runtime is Θ(N + K) where K is number of inversions.
Define an almost sorted array as one in which number of inversions ≤ cN for some c. Insertion sort is excellent on these arrays.

Less obvious: For small arrays (N < 15 or so), insertion sort is fastest.

More of an empirical fact than a theoretical one.
Theoretical analysis beyond scope of the course.
Rough idea: Divide and conquer algorithms like heapsort and mergesort spend too much time dividing, whereas insertion sort goes straight to the conquest.

32 of 38

Sorts So Far

	Best Case Runtime	Worst Case Runtime	Space	Demo	Notes
Selection Sort	Θ(N²)	Θ(N²)	Θ(1)	Link
Heapsort (in place)	Θ(N)*	Θ(N log N)	Θ(1)	Link	Bad cache (61C) performance.
Mergesort	Θ(N log N)	Θ(N log N)	Θ(N)	Link	Fastest of these.
Insertion Sort (in-place)	Θ(N)	Θ(N²)	Θ(1)	Link	Best for small N or almost sorted.

An earlier version of this slide assumed that Heapsort was always given an array with no duplicate elements. If we omit this assumption, heapsort’s best case is Θ(N).

33 of 38

Shell’s Sort (Extra)

(Not on Exam)

34 of 38

Optimizing Insertion Sort: Shell’s Sort

Big idea: Fix multiple inversions at once.

Instead of comparing adjacent items, compare items that are one stride length h apart.
Start with large stride, and decrease towards 1.
Example: h = 7, 3, 1.

35 of 38

Optimizing Insertion Sort: Shell’s Sort

7-sorting:

3-sorting:

3 swaps

5 swaps

1-sorting:

6 swaps

Total swaps: 14 (vs. 31)

S O R T E X A M P L E

M O R T E X A S P L E (1 swap, > 1 inversion)

M O R T E X A S P L E (0 swaps)

M O L T E X A S P R E (1 swap, > 1 inversion)

M O L E E X A S P R T (1 swap, > 1 inversion)

M O L E E X A S P R T

E O L M E X A S P R T (1 swap, >1 inversions)

E E L M O X A S P R T (1 swap, >1 inversions)

E E L M O X A S P R T (0 swaps, 0 inversions)

A E L E O X M S P R T (2 swaps, >1 inversions)

A E L E O X M S P R T (0 swaps, 0 inversions)

A E L E O P M S X R T (1 swaps, >1 inversions)

A E L E O P M S X R T (0 swaps, 0 inversions)

Example from Algorithms 4th Edition

A E L E O P M S X R T

A E E L O P M S X R T

A E E L M O P S X R T

A E E L M O P R S X T

A E E L M O P R S T X

h=1 is just insertion sort.

36 of 38

Shell’s Sort: Generalization and Performance

h=1 is just normal insertion sort.

By using large strides first, fixes most of the inversions.

We used 7, 3, 1. Can generalize to 2^k - 1 from some k down to 1.

Requires Θ(N^1.5) time in the worst case (see CS170).
Other stride patterns can be faster.

37 of 38

Sorts So Far

	Best Case Runtime	Worst Case Runtime	Space	Demo	Notes
Selection Sort	Θ(N²)	Θ(N²)	Θ(1)	Link
Heapsort (in place)	Θ(N)*	Θ(N log N)	Θ(1)	Link	Bad cache (61C) performance.
Mergesort	Θ(N log N)	Θ(N log N)	Θ(N)	Link	Fastest of these.
Insertion Sort (in-place)	Θ(N)	Θ(N²)	Θ(1)	Link	Best for small N or almost sorted.
Shell’s Sort	Θ(N)	Ω(N log N), O(?)	Θ(1)	N/A	Rich theory!

An earlier version of this slide assumed that Heapsort was always given an array with no duplicate elements. If we omit this assumption, heapsort’s best case is Θ(N).

38 of 38

Citations

Lego man: http://thetechnicgear.com/wp-content/uploads/2014/02/sorting-lego.jpg

Keynote Animations of Algorithms courtesy of Kevin Wayne