1 of 120

Divide and Conquer II

Summer 2018 • Lecture 07/03

2 of 120

Homework 0 solutions

Homework 1

hw1.zip is live!

It’s due next Tuesday 7/10, but start early!

You’ve learned most of the required material.

Tutorial 2

Friday, 7/6 3:30-4:50 p.m. in STLC 115.

RSVP, so I can print enough copies for everyone:�https://goo.gl/forms/MSGUGEVBnXaS21kR2 (requires Stanford email).

Announcements

3 of 120

Algorithmic Analysis

Divide and Conquer

Randomized Algorithms

Tree Algorithms

Graph Algorithms

Dynamic Programming

Greedy Algorithms

Advanced Algorithms

Course Overview

4 of 120

Divide and Conquer II

Substitution method

Linear-time selection

Proving correctness

Proving runtime with recurrence relations

Problems: selection

Algorithms: Select

Reading: CLRS 9

Today’s Outline

5 of 120

So far...

Proving correctness

Proving runtime

Iterative

Recursive

Induction on the iteration, leveraging a loop variant

(e.g. insertion sort)

Induction on the input size (e.g. mergesort)

Defining and solving recurrence relations

Intuition

6 of 120

So far...

Proving correctness

Proving runtime

Iterative

Recursive

Induction on the iteration, leveraging a loop variant

(e.g. insertion sort)

Induction on the input size (e.g. mergesort)

Defining and solving recurrence relations

Intuition

7 of 120

Divide-and-conquer algorithms via defining and solving recurrence relations

After deriving the recurrence relation, we learned several methods to find the closed-form runtime expression: recursion-tree method, iteration method, Master method.

Today, I owe you another method: substitution method!

So far...

8 of 120

Substitution Method

9 of 120

Guess what the answer is.

Formally prove that’s what the answer is.

Let’s try it out with an example recurrence from last time:

T(1) ≤ 1

T(n) ≤ 2T(n/2) + n

Substitution Method

10 of 120

Guess what the answer is.

Try unwinding it...�T(n) = 2T(n/2) + n�T(n) = 2(2T(n/4) + n/2) + n�T(n) = 4T(n/4) + 2n�T(n) = 4(2T(n/8) + n/4) + 2n�T(n) = 8T(n/8) + 3n�…

Following the pattern…�T(n) = nT(1) + n log(n) = n (log(n) + 1)

Substitution Method

T(1) ≤ 1

T(n) ≤ 2T(n/2) + n

11 of 120

Formally prove that’s what the answer is.

Inductive hypothesis T(k) ≤ k(log(k) + 1) for all 1 ≤ k < n.

Base case T(1) = 1 = 1(log(1) + 1).

Inductive step

T(n) = 2T(n/2) + n Substitute n/2 into inductive hyp.� ≤ 2((n/2)(log(n/2) + 1) + n� = 2((n/2)(log(n) - 1 + 1) + n� = 2((n/2) log(n)) + n� = n(log(n) + 1)

Conclusion By induction, T(n) = n(log(n) + 1) for all n > 0.

Substitution Method

T(1) ≤ 1

T(n) ≤ 2T(n/2) + n

12 of 120

Let’s try it out with a new recurrence:

T(n) = 10n when 1 ≤ n ≤ 10

T(n) = 3n + T(n/5) + T(n/2) otherwise

Substitution Method

13 of 120

Guess what the answer is.

Try unwinding it�[Whiteboard] - Gets ugly fast.

Try plotting it

Substitution Method

T(n) = 10n when 1 ≤ n ≤ 10

T(n) = 3n + T(n/5) + T(n/2) otherwise

Guess O(n)

14 of 120

Formally prove that’s what the answer is.

Inductive hypothesis T(k) ≤ Ck for all 1 ≤ k < n.

Base case T(k) ≤ Ck for all k ≤ 10.

Inductive step

T(n) = 3n + T(n/5) + T(n/2)� ≤ 3n + C(n/5) + C(n/2)� = 3n + (C/5)n + (C/2)n� ≤ Cn

Conclusion There exists some C such that for all n > 1, T(n) ≤ Cn. Therefore, T(n) = O(n).

Substitution Method

T(n) = 10n when 1 ≤ n ≤ 10

T(n) = 3n + T(n/5) + T(n/2) otherwise

C is some constant we’ll have to fill in later!

C must be ≥ 10 since the recurrence states T(k) = 10k when 1 ≤ k ≤ 10

Solve for C to satisfy the inequality. C = 10 works.

15 of 120

Formally prove that’s what the answer is.

Inductive hypothesis T(k) ≤ 10k for all 1 ≤ k < n.

Base case T(k) ≤ 10k for all k ≤ 10.

Inductive step

T(n) = 3n + T(n/5) + T(n/2)� ≤ 3n + 10(n/5) + 10(n/2)� = 3n + (10/5)n + (10/2)n� ≤ 10n

Conclusion For all n > 1, T(n) ≤ 10n. Therefore, T(n) = O(n).

Substitution Method

T(n) = 10n when 1 ≤ n ≤ 10

T(n) = 3n + T(n/5) + T(n/2) otherwise

Pretend we knew C = 10 all along.

16 of 120

Guess what the answer is.

Formally prove that’s what the answer is.

Might need to leave some constants unspecified until the end and see what they need to be for the proof to work.

Substitution Method

17 of 120

Divide and Conquer II

Substitution method Done!

Linear-time selection

Proving correctness

Proving runtime with recurrence relations

Problems: selection

Algorithms: Select

Reading: CLRS 9

Today’s Outline

18 of 120

Linear-Time Selection

19 of 120

Task Find the k^th smallest element in an unsorted list in O(n)-time.

Such an algorithm could find the min in O(n)-time if k=0 or the max if k=n-1.

Such an algorithm could find the median in O(n)-time if k=⌈n/2⌉-1 (this definition allows the median of lists of even-length to always be elements of the list, as opposed to the average of two elements).

Linear-Time Selection

1

64

9

49

16

4

0

25

36

81

20 of 120

Finding the min and max Iterate through the list and keep track of the smallest and largest elements. Runtime O(n).

Finding the k^th smallest element (naive) Sort the list and return the element in index k of the sorted list.

Linear-Time Selection

1

64

9

49

16

4

0

25

36

81

0

1

4

9

16

25

36

49

64

81

k=3

21 of 120

Not Quite Linear-Time Selection

def naive_select(A, k):

A = mergesort(A)

return A[k]

Worst-case runtime Θ(n log(n))

22 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

1

64

9

49

16

4

0

25

36

81

23 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

1

64

9

49

16

4

0

25

36

81

Select a pivot at random (for now)

24 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

1

64

9

49

16

4

0

25

36

81

Select a pivot at random (for now)

1

9

16

4

0

25

36

64

49

81

Partition around the pivot, such that all elements to the left are less than it and all elements to the right are greater than it�(Notice that the halves remain unsorted.)

Find element k=3 in this half since 36 occupies index 6 and k=3 < 6.

25 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

36

64

49

81

1

9

16

4

0

25

26 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

Select another pivot at random (for now)

36

64

49

81

1

9

16

4

0

25

27 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

Select another pivot at random (for now)

36

64

49

81

1

9

16

4

0

25

36

64

49

81

0

1

9

16

4

25

Partition around the pivot

Find element k=3-(1+1) in this half since 1 occupies index 1 and k=3 > 1.

28 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

Select another pivot at random (for now)

36

64

49

81

0

1

9

16

4

25

29 of 120

Key Insight Select a pivot, partition around it, and recurse.

Suppose we want to find element k=3.

Linear-Time Selection

Select another pivot at random (for now)

36

64

49

81

0

1

4

9

16

25

Partition around the pivot

We found the element!

36

64

49

81

0

1

9

16

4

25

30 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

31 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

32 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

33 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

34 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

35 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

36 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

“Worst-case” runtime Θ(n²)

37 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

“Worst-case” runtime Θ(n²)

We’ll discuss this runtime later...

38 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

“Worst-case” runtime Θ(n²)

We’ll discuss this runtime later...

Note: this is different from the “worst-case” we saw for insertion sort (we’ll revisit during Randomized Algs).

39 of 120

Linear-Time Selection

def partition_about_pivot(A, pivot):

left, right = [], []

for i in range(len(A)):

if A[i] == pivot: continue

elif A[i] < pivot:

left.append(A[i])

else:

right.append(A[i])

return left, right

40 of 120

Linear-Time Selection

def partition_about_pivot(A, pivot):

left, right = [], []

for i in range(len(A)):

if A[i] == pivot: continue

elif A[i] < pivot:

left.append(A[i])

else:

right.append(A[i])

return left, right

41 of 120

Linear-Time Selection

def partition_about_pivot(A, pivot):

left, right = [], []

for i in range(len(A)):

if A[i] == pivot: continue

elif A[i] < pivot:

left.append(A[i])

else:

right.append(A[i])

return left, right

42 of 120

Linear-Time Selection

def partition_about_pivot(A, pivot):

left, right = [], []

for i in range(len(A)):

if A[i] == pivot: continue

elif A[i] < pivot:

left.append(A[i])

else:

right.append(A[i])

return left, right

43 of 120

Linear-Time Selection

def partition_about_pivot(A, pivot):

left, right = [], []

for i in range(len(A)):

if A[i] == pivot: continue

elif A[i] < pivot:

left.append(A[i])

else:

right.append(A[i])

return left, right

44 of 120

Linear-Time Selection

def partition_about_pivot(A, pivot):

left, right = [], []

for i in range(len(A)):

if A[i] == pivot: continue

elif A[i] < pivot:

left.append(A[i])

else:

right.append(A[i])

return left, right

45 of 120

Linear-Time Selection

def partition_about_pivot(A, pivot):

left, right = [], []

for i in range(len(A)):

if A[i] == pivot: continue

elif A[i] < pivot:

left.append(A[i])

else:

right.append(A[i])

return left, right

Worst-case runtime Θ(n)

46 of 120

Linear-Time Selection

Intuition Partition the list about a pivot selected at random, either return the pivot itself or recurse on the left or right sublists (but not both).

You might have two questions at this point…

Does this actually work?

Is it fast?

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

47 of 120

Linear-Time Selection

Intuition Partition the list about a pivot selected at random, either return the pivot itself or recurse on the left or right sublists (but not both).

You might have two questions at this point…

Does this actually work?

Is it fast?

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

48 of 120

Linear-Time Selection

Does this actually work? We’ve already seen an example!

Formally, similar to last time, we proceed by induction, inducting on the length of the input list.

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

49 of 120

Proving Correctness

def proof_of_correctness_helper(algorithm):

if algorithm.type == "iterative":

# 1) Find the loop invariant

# 2) Define the inductive hypothesis

# (internal state at iteration i)

# 3) Prove the base case (i=0)

# 4) Prove the inductive step (i => i+1)

# 5) Prove the conclusion (i=n => correct)

elif algorithm.type == "recursive":

# 1) Define the inductive hypothesis

# (correct for inputs of sizes 1 to i)

# 2) Prove the base case (i < small constant)

# 3) Prove the inductive step (i => i+1 OR

{1,2,...,i} => i+1)

# 4) Prove the conclusion (i=n => correct)

# TODO

50 of 120

Proving Correctness

Recall, there are four components in a proof by induction.

Inductive Hypothesis The algorithm works on input lists of length 1 to i.

Base case The algorithm works on input lists of length 1.

Inductive step If the algorithm works on input lists of length 1 to i, then it works on input lists of length i+1.

Conclusion If the algorithm works on input lists of length n, then it works on the entire list.

51 of 120

Formally, for select...

Proving Correctness

52 of 120

Formally, for select...

Inductive Hypothesis select(A,k) correctly finds the k^th-smallest element for inputs of length 1 to i.

Proving Correctness

53 of 120

Formally, for select...

Inductive Hypothesis select(A,k) correctly finds the k^th-smallest element for inputs of length 1 to i.

Base case select(A,k) correctly finds the smallest element for inputs of length 1; it returns the element itself which is trivially the smallest.

Proving Correctness

54 of 120

Formally, for select...

Inductive Hypothesis select(A,k) correctly finds the k^th-smallest element for inputs of length 1 to i.

Base case select(A,k) correctly finds the smallest element for inputs of length 1; it returns the element itself which is trivially the smallest.

Inductive step Suppose the algorithm works on input lists of length 1 to i. Calling select(A,k) on an input list of length i+1 selects a pivot, partitions around it, and compares the length of the left list to k. There are three cases:

Proving Correctness

55 of 120

Formally, for select...

Inductive Hypothesis select(A,k) correctly finds the k^th-smallest element for inputs of length 1 to i.

Base case select(A,k) correctly finds the smallest element for inputs of length 1; it returns the element itself which is trivially the smallest.

Inductive step Suppose the algorithm works on input lists of length 1 to i. Calling select(A,k) on an input list of length i+1 selects a pivot, partitions around it, and compares the length of the left list to k. There are three cases:

len(left) == k: exactly k items less than the pivot, so return the pivot.

len(left) > k: More than k items less than the pivot, so return the k^th-smallest element of the left half of the list.

len(left) < k: There are fewer than k items ≤ to the pivot, so return the (k - len(left) - 1)^st-smallest element of the right half of the list.

Proving Correctness

56 of 120

Formally, for select...

Inductive Hypothesis select(A,k) correctly finds the k^th-smallest element for inputs of length 1 to i.

Base case select(A,k) correctly finds the smallest element for inputs of length 1; it returns the element itself which is trivially the smallest.

Inductive step Suppose the algorithm works on input lists of length 1 to i. Calling select(A,k) on an input list of length i+1 selects a pivot, partitions around it, and compares the length of the left list to k. There are three cases:

len(left) == k: exactly k items less than the pivot, so return the pivot.

len(left) > k: More than k items less than the pivot, so return the k^th-smallest element of the left half of the list.

len(left) < k: There are fewer than k items ≤ to the pivot, so return the (k - len(left) - 1)^st-smallest element of the right half of the list.

Conclusion The inductive hypothesis holds for all i. In particular, given an input list of any length n, select(A,k) correctly finds the k^th-smallest element!

Proving Correctness

57 of 120

Divide and Conquer II

Substitution method Done!

Linear-time selection

Proving correctness Done!

Proving runtime with recurrence relations

Problems: selection

Algorithms: Select

Reading: CLRS 9

Today’s Outline

58 of 120

Writing a recurrence relation for select gives:

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

59 of 120

Writing a recurrence relation for select gives:

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

O(n) len(L) == k

T(len(left)) + O(n) len(left) > k

T(len(right)) + O(n) len(left) < k

{

T(n) =

60 of 120

Writing a recurrence relation for select gives:

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

O(n) len(L) == k

T(len(left)) + O(n) len(left) > k

T(len(right)) + O(n) len(left) < k

{

T(n) =

The runtime for the recursive call to select

61 of 120

Writing a recurrence relation for select gives:

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

O(n) len(L) == k

T(len(left)) + O(n) len(left) > k

T(len(right)) + O(n) len(left) < k

{

T(n) =

The runtime to partition

about the chosen pivot

The runtime for the recursive call to select

62 of 120

Writing a recurrence relation for select gives:

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

O(n) len(L) == k

T(len(left)) + O(n) len(left) > k

T(len(right)) + O(n) len(left) < k

{

T(n) =

The runtime to partition

about the chosen pivot

The runtime for the recursive call to select

len(left) and len(right) depend on how we pick the pivot!

63 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Then we could use Master Theorem!

What’s the recurrence?

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

64 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Then we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(n/2) + O(n)

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

65 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Then we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(n/2) + O(n)

Then, a = 1, b = 2, d = 1

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

66 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Then we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(n/2) + O(n)

Then, a = 1, b = 2, d = 1 (Case 2: a < b^d)

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

67 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Then we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(n/2) + O(n)

Then, a = 1, b = 2, d = 1 (Case 2: a < b^d)

T(n) ≤ O(n^d) = O(n)

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

68 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

If we get super unlucky, we split the input, such that: len(left) = n - 1 and len(right) = 1 or vice versa.

Then it would be a lot slower.

T(n) ≤ T(n-1) + O(n)

Then, O(n) levels of O(n)

T(n) ≤ O(n²)

Analyzing Runtime

69 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

“Worst-case” runtime Θ(n²)

70 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

“Worst-case” runtime Θ(n²)

We discussed this runtime from earlier!

71 of 120

Recall pivot = random.choice(A) i.e. we randomly chose the pivot.

It’s possible to get unlucky, thus leading to runtime of Θ(n²).

We’ll formalize this unluckiness when we study Randomized Algs.

How might we pick a better pivot?

After all, it’s called linear-time selection, which implies Θ(n)-time.

Analyzing Runtime

72 of 120

Recall in an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Key Insight The ideal world requires us to pick the pivot that divides the input list in half

Analyzing Runtime

73 of 120

Recall in an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Key Insight The ideal world requires us to pick the pivot that divides the input list in half aka the median

Analyzing Runtime

74 of 120

Recall in an ideal world, we split the input exactly in half, such that: len(left) = len(right) = (n-1)/2.

Key Insight The ideal world requires us to pick the pivot that divides the input list in half aka the median aka select(A, k=⌈n/2⌉-1).

To approximate the ideal world, the linear-time select algorithm picks the pivot that divides the input list approximately in half aka close to the median.

Analyzing Runtime

75 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Once again, we could use Master Theorem!

What’s the recurrence?

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

Reasonable

76 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Once again, we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(7n/10) + O(n)

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

Reasonable

77 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Once again, we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(7n/10) + O(n)

Then, a = 1, b = 10/7, d = 1

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

Reasonable

78 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Once again, we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(7n/10) + O(n)

Then, a = 1, b = 10/7, d = 1 (Case 2: a < b^d)

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

Reasonable

79 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Once again, we could use Master Theorem!

What’s the recurrence? T(n) ≤ T(7n/10) + O(n)

Then, a = 1, b = 10/7, d = 1 (Case 2: a < b^d)

T(n) ≤ O(n^d) = O(n)

Analyzing Runtime in an Ideal World

Suppose T(n) = a⋅T(n/b) + O(n^d). The Master method states:

O(n^dlogn) if a = b^d

O(n^d) if a < b^d

O(n^log_b(a)) if a > b^d

{

T(n) =

Reasonable

80 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Analyzing Runtime in an Ideal World

Reasonable

1

9

16

4

0

25

36

64

49

81

The goal is to pick a pivot such that

3n/10 < len(left) < 7n/10 and 3n/10 < len(right) < 7n/10

81 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Analyzing Runtime in an Ideal World

Reasonable

1

9

16

4

0

25

36

64

49

81

The goal is to pick a pivot such that

3n/10 < len(left) < 7n/10 and 3n/10 < len(right) < 7n/10

82 of 120

len(left) and len(right) determine the runtime of the recursive calls to select.

In a reasonable world, we split the input roughly in half, such that: 3n/10 < len(left), len(right) < 7n/10.

Analyzing Runtime in an Ideal World

Reasonable

1

9

16

4

0

25

36

64

49

81

The goal is to pick a pivot such that

3n/10 < len(left) < 7n/10 and 3n/10 < len(right) < 7n/10

83 of 120

We can’t solve select(A,n/2) (yet).

But we can solve select(B,m/2) for len(B) = m < n.

How does having an algorithm that can find the median of smaller lists help us?

Another Divide and Conquer Algorithm

84 of 120

We can’t solve select(A,n/2) (yet).

But we can solve select(B,m/2) for len(B) = m < n.

How does having an algorithm that can find the median of smaller lists help us?

Another Divide and Conquer Algorithm

Pro tip: making the inductive hypothesis i.e. assuming correctness of the algorithm on smaller inputs is a helpful technique for designing divide and conquer algorithms.

85 of 120

We can’t solve select(A,n/2) (yet).

But we can solve select(B,m/2) for len(B) = m < n.

How does having an algorithm that can find the median of smaller lists help us? It can help us pick a pivot that’s close to the median.

Another Divide and Conquer Algorithm

Pro tip: making the inductive hypothesis i.e. assuming correctness of the algorithm on smaller inputs is a helpful technique for designing divide and conquer algorithms.

86 of 120

Goal: Use an algorithm that can find the median of smaller lists to help pick a pivot that’s close to the median of the original list.

Another Divide and Conquer Algorithm

87 of 120

Goal: Use an algorithm that can find the median of smaller lists to help pick a pivot that’s close to the median of the original list.

Divide the original list into small groups.

Another Divide and Conquer Algorithm

88 of 120

Goal: Use an algorithm that can find the median of smaller lists to help pick a pivot that’s close to the median of the original list.

Divide the original list into small groups.

Find the sub-median of each small group.

Another Divide and Conquer Algorithm

{

sub-median

89 of 120

Goal: Use an algorithm that can find the median of smaller lists to help pick a pivot that’s close to the median of the original list.

Divide the original list into small groups.

Find the sub-median of each small group.

Find the median of all of the sub-medians.

Another Divide and Conquer Algorithm

{

sub-median

median of sub-medians

90 of 120

Goal: Use an algorithm that can find the median of smaller lists to help pick a pivot that’s close to the median of the original list.

Divide the original list into small groups.

Find the sub-median of each small group.

Find the median of all of the sub-medians.

Another Divide and Conquer Algorithm

{

sub-median

median of sub-medians

median of the whole list

≈

91 of 120

Another Divide and Conquer Algorithm

2

11

9

3

13

5

16

4

6

12

...

19

14

92 of 120

Another Divide and Conquer Algorithm

2

11

9

3

13

5

16

4

6

12

17

23

10

7

21

8

18

15

1

20

22

19

14

2

11

9

3

13

5

16

4

6

12

...

19

14

Divide A into g = ⌈n/5⌉ groups

of at most 5 elements

g = ⌈n/5⌉ groups

at most 5 elements

93 of 120

Another Divide and Conquer Algorithm

2

11

9

3

13

5

16

4

6

12

17

23

10

7

21

8

18

15

1

20

22

19

14

2

11

9

3

13

5

16

4

6

12

...

19

14

2

11

9

3

13

5

16

4

6

12

17

23

10

7

21

8

18

15

1

20

22

19

14

Divide A into g = ⌈n/5⌉ groups

of at most 5 elements

g = ⌈n/5⌉ groups

at most 5 elements

Find the sub-median of each of the groups (yellow) and recursively call select to find the median of these sub-medians (pink).

94 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

“Worst-case” runtime Θ(n²)

95 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A) median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

96 of 120

Clearly, the median of medians (15) is not necessarily the actual median (12), but we claim that it’s guaranteed to be pretty close.

Analyzing Runtime

2

11

9

3

13

5

16

4

6

12

17

23

10

7

21

8

18

15

1

20

22

19

14

97 of 120

To see why, partition elements within each of the groups around the group’s median, and partition the groups around the group with the median of medians.

At least how many elements are guaranteed to be smaller than the median of medians?

Analyzing Runtime

2

3

9

11

13

5

4

6

16

12

8

1

15

18

20

22

19

14

10

7

17

23

21

98 of 120

To see why, partition elements within each of the groups around the group’s median, and partition the groups around the group with the median of medians.

At least how many elements are guaranteed to be smaller than the median of medians? At least these (1, 2, 3, 4, 5, 6, 8, 9). There might be more (7, 11, 12, 13, 14), but we are guaranteed that at least these will be smaller.

Analyzing Runtime

2

3

9

11

13

5

4

6

16

12

8

1

15

18

20

22

19

14

10

7

17

23

21

99 of 120

To see why, partition elements within each of the groups around the group’s median, and partition the groups around the group with the median of medians.

At least how many elements are guaranteed to be smaller than the median of medians? At least these (1, 2, 3, 4, 5, 6, 8, 9). There might be more (7, 11, 12, 13, 14), but we are guaranteed that at least these will be smaller.

Analyzing Runtime

2

3

9

11

13

5

4

6

16

12

8

1

15

18

20

22

19

14

10

7

17

23

21

3 elements from each small group with a median smaller than the median of medians.

2 elements from the small group containing the median of medians.

100 of 120

As a function of n (the size of the original list), how many elements are guaranteed to be smaller than the median of medians?

Let g = ⌈n/5⌉ represent the number of groups.

At least 3⋅(⌈g/2⌉ - 1 - 1) + 2 elements.

Analyzing Runtime

2

3

9

11

13

5

4

6

16

12

8

1

15

18

20

22

19

14

10

7

17

23

21

3 elements from each small group with a median smaller than the median of medians.

2 elements from the small group containing the median of medians.

To exclude the list with the median of medians.

101 of 120

As a function of n (the size of the original list), how many elements are guaranteed to be smaller than the median of medians?

Let g = ⌈n/5⌉ represent the number of groups.

At least 3⋅(⌈g/2⌉ - 1 - 1) + 2 elements.

Analyzing Runtime

2

3

9

11

13

5

4

6

16

12

8

1

15

18

20

22

19

14

10

7

17

23

21

3 elements from each small group with a median smaller than the median of medians.

2 elements from the small group containing the median of medians.

To exclude the list with the median of medians.

To exclude the list with the leftovers.

102 of 120

If at least 3⋅(⌈g/2⌉ - 2) + 2 elements are guaranteed to be smaller than the median of medians, at most how many elements are larger than the median of medians?

At most n - 1 - (3⋅(⌈g/2⌉ - 2) + 2)

Analyzing Runtime

2

3

9

11

13

5

4

6

16

12

8

1

15

18

20

22

19

14

10

7

17

23

21

At most everything besides these elements

n-1 is for all of the elements except for the median of medians.

103 of 120

If at least 3⋅(⌈g/2⌉ - 2) + 2 elements are guaranteed to be smaller than the median of medians, at most how many elements are larger than the median of medians?

At most n - 1 - (3⋅(⌈g/2⌉ - 2) + 2) ≤ 7n/10 + 3 elements.

Analyzing Runtime

2

3

9

11

13

5

4

6

16

12

8

1

15

18

20

22

19

14

10

7

17

23

21

At most everything besides these elements

n-1 is for all of the elements except for the median of medians.

104 of 120

We just showed that …

3n/10 - 4 ≤ len(left) ≤ 7n/10 + 3

3n/10 - 4 ≤ len(right) ≤ 7n/10 + 3

Analyzing Runtime

median_of_medians will choose a pivot greater than at least

3⋅(⌈g/2⌉ - 2) + 2 ≥ 3n/10 - 4 elements.

median_of_medians will choose a pivot less than at most 7n/10 + 3 elements.

105 of 120

We just showed that …

3n/10 - 4 ≤ len(left) ≤ 7n/10 + 3

3n/10 - 4 ≤ len(right) ≤ 7n/10 + 3

We can just as easily show the inverse.

Analyzing Runtime

median_of_medians will choose a pivot greater than at least

3⋅(⌈g/2⌉ - 2) + 2 ≥ 3n/10 - 4 elements.

median_of_medians will choose a pivot less than at most 7n/10 + 3 elements.

106 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A) median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

107 of 120

What’s the recurrence relation?

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

108 of 120

What’s the recurrence relation?

T(n) = nlog(n) when n ≤ 100
T(n) ≤ T(n/5) +

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

109 of 120

What’s the recurrence relation?

T(n) = nlog(n) when n ≤ 100
T(n) ≤ T(n/5) + T(7n/10) +

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

110 of 120

What’s the recurrence relation?

T(n) = nlog(n) when n ≤ 100
T(n) ≤ T(n/5) + T(7n/10) + O(n)

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

111 of 120

What’s the recurrence relation?

T(n) = nlog(n) when n ≤ 100
T(n) ≤ T(n/5) + T(7n/10) + O(n)

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

112 of 120

What’s the recurrence relation?

T(n) = nlog(n) when n ≤ 100
T(n) ≤ T(n/5) + T(7n/10) + O(n)

We can’t use Master Theorem!

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

113 of 120

What’s the recurrence relation?

T(n) = nlog(n) when n ≤ 100
T(n) ≤ T(n/5) + T(7n/10) + O(n)

We can’t use Master Theorem!

We use substitution method!

Analyzing Runtime

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k: return pivot

elif len(left) > k: return select(left, k, c)

else: return select(right, k-len(left)-1, c)

114 of 120

Guess what the answer is.

Linear-time select
Comparing to mergesort recurence, less than n log(n)

Substitution Method

T(n) = nlog(n) when n ≤ 100

T(n) ≤ T(n/5) + T(7n/10) + O(n)

Guess O(n)

115 of 120

Formally prove that’s what the answer is.

Inductive hypothesis T(k) ≤ Ck for all 1 ≤ k < n.

Base case T(k) ≤ Ck for all k ≤ 100.

Inductive step

T(n) = T(n/5) + T(7n/10) + dn� ≤ C(n/5) + C(7n/10) + dn� = (C/5)n + (7C/10)n + dn� ≤ Cn

Conclusion There exists some C = max{7, 10d} such that for all�n > 1, T(n) ≤ Cn. Therefore, T(n) = O(n).

Substitution Method

C is some constant we’ll have to fill in later!

C must be ≥ log(n) for n ≤ 100, so C ≥ 7.

Solve for C to satisfy the inequality. C ≥ 10d works.

T(n) = nlog(n) when n ≤ 100

T(n) ≤ T(n/5) + T(7n/10) + O(n)

116 of 120

Formally prove that’s what the answer is.

Inductive hypothesis T(k) ≤ max{7, 10d}k for all 1 ≤ k < n.

Base case T(k) ≤ max{7, 10d}k for all k ≤ 100.

Inductive step

T(n) = T(n/5) + T(7n/10) + dn� ≤ max{7, 10d}(n/5) + max{7, 10d}(7n/10) + dn� = (max{7, 10d}/5)n + (7max{7, 10d}/10)n + dn� ≤ max{7, 10d}n

Conclusion There exists some C = max{7, 10d} such that for all�n > 1, T(n) ≤ max{7, 10d}n. Therefore, T(n) = O(n).

Substitution Method

T(n) = nlog(n) when n ≤ 100

T(n) ≤ T(n/5) + T(7n/10) + O(n)

117 of 120

Guess what the answer is.

Formally prove that’s what the answer is.

Might need to leave some constants unspecified until the end and see what they need to be for the proof to work.

Substitution Method

118 of 120

Divide and Conquer II

Substitution method Done!

Linear-time selection

Proving correctness Done!

Proving runtime with recurrence relations Done!

Problems: selection

Algorithms: Select

Reading: CLRS 9

Today’s Outline

119 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A) median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

Worst-case runtime Θ(n)

120 of 120

Linear-Time Selection

def select(A, k, c=100):

if len(A) <= c:

return naive_select(A, k)

pivot = random.choice(A) median_of_medians(A)

left, right = partition_about_pivot(A, pivot)

if len(left) == k:

# The pivot is the kth smallest element!

return pivot

elif len(left) > k:

# The kth smallest element is left of the pivot

return select(left, k, c)

else:

# The kth smallest element is right of the pivot

return select(right, k-len(left)-1, c)

Worst-case runtime Θ(n)

Note: back to talking about the same worst-case we saw for insertion sort (we’ll revisit during Randomized Algs).