2 of 85

Anonymous functions

Often, you want to use a function without naming it; that is, create a λ-expression. In Haskell, use \ and -> like so.�

ghci> (\x -> x + 6) 10

ghci> (\x y -> y + x^3) 9 1000

1729

3 of 85

Multi-parameter lambdas

In the λ-calculus, one typically allows the shorthand:

λx. (λy. term) <==> λx y. term

Haskell also honors this in anonymous functions.

\x -> (\y -> <expr>) <==> \x y -> <expr>

\x -> (\y -> (\z -> <expr>)) <==> \x y z -> <expr>

4 of 85

Let’s talk about types.

You’ve realized by now that sometimes, the type signature is more general than what you expected.

ghci> :t 17

17 :: Num a => a

ghci> :t (\x -> x + 17)

(\x -> x + 17) :: Num a => a -> a

5 of 85

Let’s talk about types (2)

You’ve realized by now that sometimes, the type signature is more general than what you expected.

This is most often encountered at ghci. In source files, you may have already specified types.

-- Foo.hs

add17 :: Int -> Int

add17 = \x -> x + 17

6 of 85

17 :: (Num a) => a

What does 17 :: (Num a) => a mean?

Remember that types live at the expression (not value) level. The literal 17 might refer to an Int, a Double, or a Word8 (byte).

The compiler can figure out that it lives in some Num type, but not which one without more context.

7 of 85

:t (\x -> x)

Let’s look at the type for the identity function, id, defined as id x = x.

ghci> :t id

id :: a -> a

How do we know that id (5 :: Int) is well-typed? How about id id? Is that well-typed?

8 of 85

id (5 :: Int) is well typed.

The type variable a unifies with any concrete type. In this case, a -> a is shorthand for:

forall a. a -> a

meaning that id can be specialized to

id :: Int -> Int by letting a = Int.

9 of 85

Exercise: is id id well-typed?

id id :: ??

10 of 85

id id is well-typed.

Treat the two instances of id as separate:

id₂ id₁ :: ??

id₂ :: b -> b id₁ :: a -> a

The renaming is necessary because they’re different type variables. With b = a -> a, we get a well-typed expression. (id₂ :: (a -> a) -> (a -> a) so id₂ id₁ :: a -> a).

11 of 85

A simple matter of typing...

Type checking is solving a system of equations based on context clues.

:t [Left 'z', Right False]

12 of 85

A simple matter of typing...

Type checking is solving a system of equations based on context clues.

:t [Left 'z', Right False]

[a] for some a (list literal)

a = Either Char x for some x (Left 'z')

a = Either y Bool for some y (Right False)

Solution exists: x = Bool, y = Char.

13 of 85

A simple matter of typing...

Type checking is solving a system of equations based on context clues.

:t [Left 'z', Right False]

[a] for some a (list literal)

a = Either Char x for some x (Left 'z')

a = Either y Bool for some y (Right False)

Solution exists: x = Bool, y = Char.

[Either Char Bool] is the answer.

14 of 85

A simple matter of typing… (2)

:t [Left 'z', Left False]

????

15 of 85

A simple matter of typing… (2)

:t [Left 'z', Left False]

[a] for some a (list literal)

a = Either Char x for some x (Left 'z')

a = Either Bool y for some y (Left False)

No solution exists! Char ≠ Bool, so it’s ill-typed.

16 of 85

Function type quiz (1)

λ :t \x -> if x then Left "eye" else Right "ear"

????

λ :t \x -> (x, x)

????

λ :t \x -> "fail"

????

17 of 85

Function type quiz (1)

λ :t \x -> if x then Left "eye" else Right "ear"

Bool -> Either String String

λ :t \x -> (x, x)

????

λ :t \x -> "fail"

????

18 of 85

Function type quiz (1)

λ :t \x -> if x then Left "eye" else Right "ear"

Bool -> Either String String

λ :t \x -> (x, x)

a -> (a, a)

λ :t \x -> "fail"

????

19 of 85

Function type quiz (1)

λ :t \x -> if x then Left "eye" else Right "ear"

Bool -> Either String String

λ :t \x -> (x, x)

a -> (a, a)

λ :t \x -> "fail"

a -> String

20 of 85

Function type quiz (2)

λ :t \x -> x + 1

????

λ :t \x -> case x of {[] -> True; _ -> False}

????

λ :t \x -> length x + 1

????

21 of 85

Function type quiz (2)

λ :t \x -> x + 1

(Num a) => a -> a

λ :t \x -> case x of {[] -> True; _ -> False}

????

λ :t \x -> length x + 1

????

22 of 85

Function type quiz (2)

λ :t \x -> x + 1

(Num a) => a -> a

λ :t \x -> case x of {[] -> True; _ -> False}

[a] -> Bool

λ :t \x -> length x + 1

????

23 of 85

Function type quiz (2)

λ :t \x -> x + 1

(Num a) => a -> a

λ :t \x -> case x of {[] -> True; _ -> False}

[a] -> Bool

λ :t \x -> length x + 1

[a] -> Int

24 of 85

Function type quiz (3)

-- Add17.hs

-- With which type signatures will the code complile?

add17 = \x -> x + 17

(A) add17 :: Int -> Int

(B) add17 :: Double -> Double

(D) add17 :: (Num a) => a -> a

25 of 85

Function type quiz (3)

-- Add17.hs

-- With which type signatures will the code complile?

add17 = \x -> x + 17

(A) add17 :: Int -> Int

(B) add17 :: Double -> Double

(D) add17 :: (Num a) => a -> a

26 of 85

Function type quiz (3)

(A) add17 :: Int -> Int

(B) add17 :: Double -> Double

(D) add17 :: (Num a) => a -> a

(Num a) => a -> a is the general type signature. It allows the term \x -> x + 17 to specialize to a single Num type a, but not to convert from one to another. So (C) is the only illegal type.

27 of 85

(->) “function” is just a parameterized type

Recall the parameterized types we’ve encountered:

Maybe a = Nothing | Just a
[] a = [] | (:) a ([] a)

more familiarly: [a] = [] | a:[a]

(,) a b and (,,) a b c and (,,,) a b c d and…

more familiarly: (a, b) and (a, b, c) and…

a -> b is syntactic sugar for (->) a b.

28 of 85

(->) as a parameterized type:

The type-level “operator” (->) is right-associative.

a -> b <==> (->) a b

a -> b -> c <==> (->) a ((->) b c)

a -> b -> c -> d <==>

(->) a ((->) b ((->) c d))

29 of 85

Some useful functions

Arithmetic: (+), (-), (*), (/), quot, mod, negate
Ordering: (==), (/=), (<), (>), (<=), (>=)
Logical: not, (&&), (||)

shiftL and shiftR live in the Data.Bits module.

30 of 85

Don’t fear the type class

(+) :: Num a => a -> a -> a

entails the following (they’re all Num classes):

(+) :: Int -> Int -> Int

(+) :: Double -> Double -> Double

(+) :: Integer -> Integer -> Integer

(+) :: Word8 -> Word8 -> Word8

31 of 85

Code reuse

Type classes (which we’ll cover in much more detail, later) enable code reuse. You don’t have to write this sum function for each type. It works for all Num types.

sum [] = 0

sum (x:xs) = x + sum xs

λ :t sum

sum :: Num a => [a] -> a

λ sum [(1 :: Integer), 2, 3]

λ sum [1.3, 7.9, 8.1]

17.3

λ sum [(39 :: Word8), 255]

32 of 85

Conversions (Int -> Double)

λ :t fromIntegral

fromIntegral :: (Integral a, Num b) => a -> b

All the integer types (Int, Integer, Word, WordN, IntN for N = 8, 16, 32, 64) are Integral.

Since Int is Integral and Double is Num, Int -> Double is a subcase of the above. Thus, you can use fromIntegral as Int -> Double for the conversion.

33 of 85

Conversions (Double -> Integer)

λ :t truncate

(RealFrac a, Integral b) => a -> b

What’s RealFrac? Use :i or :info to find instances of a typeclass.

λ :i RealFrac

[...]

instance RealFrac Float -- Defined in ‘GHC.Float’

instance RealFrac Double -- Defined in ‘GHC.Float’

34 of 85

Conversions (Double -> Int?)

λ :t truncate

(RealFrac a, Integral b) => a -> b

You can use truncate as Double -> Integer, or as Double -> Int, but in general, you should convert to Integer. Unless you know for sure that you’re working with a value that will fit in an Int, prefer Integer.

35 of 85

Integer vs. Int

Most Haskell programmers consider the Int type to be “legacy”; its size is system-dependent (30 to 64 bits) and, because it’s boxed, it’s not much faster than Integer. (Unboxed types are used in high-performance computing, but won’t be used now).

Many functions use Int, but (arbitrary-precision) Integer is preferable if there is any risk of integer overflow. It’s only slightly slower than Int.

36 of 85

Conversions (a -> String)

For any type that is in the Show type class, you have the function:

show :: (Show a) => a -> String�

Almost all of the base types are Show. Function types are not Show.

37 of 85

Int -> Int is not Show

ghci> \x -> (x :: Int)

<interactive>:22:1:

No instance for (Show (Int -> Int)) arising from a use of ‘print’

In a stmt of an interactive GHCi command: print it

There’s nothing wrong with the function value. This fails because ghci (a “read-eval-print loop”) cannot call print :: (Show a) => a -> IO () on and Int -> Int, which has no Show instance.

38 of 85

Conversions (String -> a)

The inverse of Show/show is Read/read.

ghci> :t read

read :: Read a => String -> a

ghci> read "15" :: Int

ghci> read "meow" :: Int

*** Exception: Prelude.read: no parse

39 of 85

read is out of fashion.

General Haskell consensus is that non-IO code shouldn’t throw exceptions, which read will do if it can’t parse the String.

In general, prefer the Safe module’s readMay.

ghci> import Safe

ghci> :t readMay

readMay :: Read a => String -> Maybe a

ghci> readMay "12.5" :: Maybe Int

Nothing

ghci> readMay "12.5" :: Maybe Double

Just 12.5

40 of 85

Recursion vs. looping

Or: “Dude, where’s my FOR-loop?”

Usually, Haskell prefers recursion over looping.

41 of 85

Recursion vs. looping

int factorial(int n) {

int out = 1;

for (int i = n; i > 0; i--)

out = out * n;

return out;

}

factorial n =

if n == 0 then 1

else n * factorial (n - 1)

factorial n = loop 1 n

where loop acc n =

if (n == 0) then acc

else loop (acc * n) (n - 1)

factorial n = product [1..n]

42 of 85

sequence, mapM and cousins

For effectful computations, looping is sometimes a useful idiom:

sequence :: [IO a] -> IO [a]

sequence [a1, …, aN] = do r1 <- a1

...

rN <- rN

return [r1, …, rN]

43 of 85

sequence example

import System.Posix.Files

getFileSize :: String -> IO Int

getFileSize fname = do

stat <- getFileStatus fname

return $ fromIntegral $

fileSize stat

λ getFileSize "Lab1.hs"

7038

-- return is IO Int,

-- ghci prints the Int

λ getFileSize "Lab2.hs"

21845

λ sequence [

getFileSize "Lab1.hs",

getFileSize "Lab2.hs",

getFileSize "README.md"]

[7038,21845,1051]

44 of 85

sequence, mapM and cousins

For effectful computations, looping is sometimes a useful idiom:

sequence :: [IO a] -> IO [a]

sequence [a1, …, aN] = do r1 <- a1

...

rN <- rN

return [r1, …, rN]

Exercise (answer given in Lecture #4): write sequence.

45 of 85

sequence, mapM and cousins

sequence :: [IO a] -> IO [a]

mapM :: (a -> IO b) -> [a] -> IO [b]

mapM f xs = sequence $ map f xs

mapM_ :: (a -> IO b) -> [a] -> IO ()

mapM_ f xs = mapM f xs >> return ()

forM = flip mapM, forM_ flip mapM_

46 of 85

mapM and forM examples

λ sequence [getFileSize "Lab1.hs", getFileSize "Lab2.hs", getFileSize "README.md"]

[7038,21845,1051]

λ mapM getFileSize ["Lab1.hs", "Lab2.hs", "README.md"]

[7038,21845,1051]

λ forM ["Lab1.hs", "Lab2.hs", "README.md"] getFileSize

[7038,21845,1051]

47 of 85

looping with forM_

λ import Control.Monad

λ forM_ [1..5] $ \i ->

print i

-- returns ()

λ sequence $

map (\x -> (print $ x^2)

>> (return $ x^2))

[1..4]

[1,4,9,16]

48 of 85

Change of topic: laziness

49 of 85

Yes, Haskell is “lazy”.

You’ve heard that Haskell’s “laziness” makes it hard to reason about code. The concern is mostly overblown. It is true that:

Errors can happen at unexpected times.
Laziness can make it hard to reason about performance.

50 of 85

Yes, Haskell is “lazy”.

You’ve heard that Haskell’s “laziness” makes it hard to reason about code. The concern is mostly overblown. It is true that:

Errors can happen at unexpected times…

… this can be mitigated by avoiding exceptions in non-IO code (throwing an exn is a side effect).

Laziness can make it hard to reason about performance.

Laziness is opt-out, not mandatory. You can write strict Haskell and high-performance Haskell often is strict.

51 of 85

Advantages of laziness

Avoidance of unnecessary work.
Infinite collections.
No distinction between “control flow” and regular functions.

52 of 85

Infinite collections

-- This is clearly NOT the most efficient

-- way to do this computation, but it is

-- correct.

taxicab :: Integer -> Bool

taxicab n =

let m = last $

takeWhile (\x -> x^3 <= n) [1..]

solns = [(x, y) | x <- [1..m],

y <- [1..m],

x <= y,

x^3 + y^3 == n]

in length solns >= 2

ghci> take 5 $

filter taxicab [1..]

[1729,4104,13832,20683,32832]

[1..] is infinite, so evaluating

filter taxicab [1..]

in its entirety would take (literally) forever, but we can take its first 5 elements in finite time.

53 of 85

Thunks

A “lazy” thunk is a piece of code that hasn’t been evaluated yet.

In a strict-by-default language like Scheme or OCaml, you might define a lazy X as a 0-argument function returning X.

54 of 85

Thunks in Clojure and OCaml

(defn force [th]

(th))

(defn thunk-7 []

(fn [] (+ 3 4)))

type ‘a thunk = unit -> ‘a

force th = th ()

let thunk_7 =

(fun _ -> 3 + 4)

55 of 85

Thunks in Haskell

In Haskell, an a is, by default, a thunk producing an a when evaluated.

thunk7 :: Int

thunk7 = 3 + 4

Lazy-by-default makes the infinite data structure “magic” possible.

56 of 85

take 2 [1..] ({} as thunk)

take 0 _ = []

take n [] = []

take n (x:xs) = x:(take (n-1) xs)

take 2 {[1..]} = take 2 (1:{[2..]})

= 1:(take 1 {[2..]})

= 1:(take 1 2:{[3..]})

= 1:(2:(take 0 {[3..]}))

= 1:(2:[]) = [1, 2]

57 of 85

take 2 [1..] ({} as thunk)

take 0 _ = []

take n [] = []

take n (x:xs) = x:(take (n-1) xs)

take 2 {[1..]} = take 2 (1:{[2..]})

= 1:(take 1 {[2..]})

= 1:(take 1 2:{[3..]})

= 1:(2:(take 0 {[3..]}))

= 1:(2:[]) = [1, 2]

never evaluated!

58 of 85

Short-circuiting (&&) for free

In many languages, you can’t define short-circuiting AND as a function (hence macros).

clojure> (defn my-and [x y] (if x y false))

#'user/my-and

clojure> (my-and false

(println "DROP TABLE Students"))

DROP TABLE Students

false

59 of 85

Short-circuiting (&&) for free

In many languages, you can’t define short-circuiting AND as a function. With Haskell’s thunks, you can. If it weren’t provided, you could write this:

(&&) :: Bool -> Bool -> Bool

(&&) True b = b

(&&) False _ = False

60 of 85

“Short-circuiting” function example

(&&) True b = b

(&&) False _ = False

{False && error "barf"} = (&&) {False} {error "barf"}

= (&&) False {error "barf"}

= False

61 of 85

Non-strict (or lazy) evaluation

λ 1 `quot` 0

*** Exception: divide by zero

λ length [7, 1 `quot` 0]

???

62 of 85

Non-strict (or lazy) evaluation

λ 1 `quot` 0

*** Exception: divide by zero

λ length [7, 1 `quot` 0]

63 of 85

{} means “deferred” (thunk) here

length {[7, 1 `quot` 0]}

= length ( (:) {7} {[1 `quot` 0]})

= 1 + length {[1 `quot` 0]}

= 1 + length ( (:) {1 `quot` 0} {[]})

= 1 + 1 + length {[]}

= 1 + 1 + length []

= 1 + 1 + 0

= 2

64 of 85

A danger at scale: thunk leaks!

65 of 85

foldl :: (a -> b -> a) -> a -> [b] -> a

Haskell’s fold (foldl, foldr, etc.) family of functions represent “reductions” or accumulations.

foldl :: (a -> b -> a) -> a -> [b] -> a

foldl f z [] = z

foldl f z (x:xs) = (foldl f (z `f` x) xs)

foldl f x0 [x1, … , xN] = ((x0 `f` x1) `f` … xN_1) `f` xN)

66 of 85

foldl and foldr in action

foldl (+) 867 [5, 3, 0, 9] =

(((867 + 5) + 3) + 0) + 9 = 884

foldr (++) "!"

["A ", "Red Dragon ", "draws near"] =

"A " ++ ("Red Dragon " ++ ("draws near" ++ "!")) =

"A Red Dragon draws near!"

67 of 85

foldl and foldr

λ foldl (+) 0 [1..5]

????

λ foldl (*) 1 [1..4]

????

λ foldr (++) "Zebra" ["Apple", "Bear"]

????

λ foldr (\x y -> x) 0 [1, 2, 3]

????

λ foldr (\x y -> x) 0 []

????

68 of 85

foldl and foldr

λ foldl (+) 0 [1..5]

λ foldl (*) 1 [1..4]

????

λ foldr (++) "Zebra" ["Apple", "Bear"]

????

λ foldr (\x y -> x) 0 [1, 2, 3]

????

λ foldr (\x y -> x) 0 []

????

69 of 85

foldl and foldr

λ foldl (+) 0 [1..5]

λ foldl (*) 1 [1..4]

λ foldr (++) "Zebra" ["Apple", "Bear"]

????

λ foldr (\x y -> x) 0 [1, 2, 3]

????

λ foldr (\x y -> x) 0 []

????

70 of 85

foldl and foldr

λ foldl (+) 0 [1..5]

λ foldl (*) 1 [1..4]

λ foldr (++) "Zebra" ["Apple", "Bear"]

"AppleBearZebra"

λ foldr (\x y -> x) 0 [1, 2, 3]

????

λ foldr (\x y -> x) 0 []

????

71 of 85

foldl and foldr

λ foldl (+) 0 [1..5]

λ foldl (*) 1 [1..4]

λ foldr (++) "Zebra" ["Apple", "Bear"]

"AppleBearZebra"

λ foldr (\x y -> x) 0 [1, 2, 3]

λ foldr (\x y -> x) 0 []

????

72 of 85

foldl and foldr

λ foldl (+) 0 [1..5]

λ foldl (*) 1 [1..4]

λ foldr (++) "Zebra" ["Apple", "Bear"]

"AppleBearZebra"

λ foldr (\x y -> x) 0 [1, 2, 3]

λ foldr (\x y -> x) 0 []

73 of 85

The bad news: “thunk leaks”

foldl :: (a -> b -> a) -> a -> [b] -> a

foldl f z [] = z

foldl f z (x:xs) = (foldl f (z `f` x) xs)

This can* leak memory: foldl (+) 0 [1..10^9]

foldl (+) 0 [1..10^9] = foldl (+) (0 + 1) [2..10^9]

= foldl (+) ((0 + 1) + 2) [3..10^9]

= ...

*Technically, it’s undefined behavior. The compiler may optimize the leak away.

74 of 85

Building up a nested thunk

foldl (+) 0 [1, 2, 3] =

foldl (+) (0 + 1) [2, 3] =

foldl (+) ((0 + 1) + 2) [3] =

foldl (+) (((0 + 1) + 2) + 3) [] =

(((0 + 1) + 2) + 3) =

((1 + 2) + 3) = 3 + 3 = 6

It builds up a nested thunk. This is a non-issue for [1..3], but (unless the compiler optimizes the thunks away) a show-stopper for [1..10^9], which should be summable in constant space, but instead can blow out memory.

75 of 85

foldl' and seq

foldl' is a non-leaky version of foldl. To understand it, we need to understand seq:

seq :: a -> b -> b

seq x y forces x and returns y.

foldl' f z [] = z

foldl' f z (x:xs) =

let z' = f z x in z' `seq` foldl' f z' xs

76 of 85

Comparison: foldl vs. foldl'

foldl f z [] = z

foldl f z (x:xs) =

let z' = f z x in

foldl f z' xs

foldl (+) 0 [1, 2, 3] =

foldl (+) (0 + 1) [2, 3] =

foldl (+) ((0 + 1) + 2) [3] =

foldl (+) (((0 + 1) + 2) + 3) [] =

(((0 + 1) + 2) + 3) =

((1 + 2) + 3) = 3 + 3 = 6

foldl' f z [] = z

foldl' f z (x:xs) =

let z' = f z x in

z' `seq` foldl' f z' xs

foldl' (+) 0 [1, 2, 3] =

let z' = 0 + 1 in

z' `seq` foldl' (+) z' [2, 3] =

foldl' (+) 1 [2, 3] =

foldl' (+) 3 [3] =

foldl' (+) 6 [] = 6

77 of 85

Technical note: lazy vs. nonstrict

While lazy (vs. eager) and nonstrict (vs. strict) are often used interchangeably, they aren’t always the same thing.

Laziness is a matter of when work is done. Nonstrictness is a formal semantic property (covered in the next slide).

You could have nonstrictness without laziness.

78 of 85

Defining strictness

Recall ⊥, the “value” assigned to failure (inf. loop or exn). We use “= ⊥” to mean “computation fails”.

1 `quot` 0 = ⊥ (exception)

head [] = ⊥ (pattern match failure)

take 1 $ filter (\_ -> False) [1..] = ⊥ (non-term.)

length [1 `quot` 0] = 1

True && ⊥ = ⊥

False && ⊥ = False

79 of 85

Defining strictness

We use “= ⊥” to mean “computation fails”.

An N-ary function f is strict in argument x* ∈ {x1, … xN} if:

x* = ⊥ ⇒ f x1 … xN = ⊥

(+) x1 x2 is strict in both arguments.

(&&) b1 b2 is nonstrict in b2; False && ⊥ = False

80 of 85

What Haskell actually does

Thunks (e.g. Int) are evaluated, actions (e.g. IO Int) are executed (and may have side effects).

When a program runs, the action called main, with type signature IO (), is run. Actions are executed, and thunks are evaluated, as needed for main to do its job.

81 of 85

Which bits of code will run?

a :: Int

a = 5 + 6

b :: Int

b = 4 * 7

c :: Int

c = 5 + 3

d :: Int

d = a - c

action1 :: IO ()

action1 = print $ b + c

action2 :: IO Int

action2 = return $ a * a

action3 :: IO ()

action3 = print d

action4 :: IO Int

action4 = error "barf"

main :: IO ()

main = do

n <- action2

print n

action3

82 of 85

Which bits of code will run?

a :: Int

a = 5 + 6

b :: Int

b = 4 * 7

c :: Int

c = 5 + 3

d :: Int

d = a - c

action1 :: IO ()

action1 = print $ b + c

action2 :: IO Int

action2 = return $ a * a

action3 :: IO ()

action3 = print d

action4 :: IO Int

action4 = error "barf"

main :: IO ()

main = do

n <- action2

print n

action3

stdout:

121

a = 11

83 of 85

Which bits of code will run?

a :: Int

a = 5 + 6

b :: Int

b = 4 * 7

c :: Int

c = 5 + 3

d :: Int

d = a - c

action1 :: IO ()

action1 = print $ b + c

action2 :: IO Int

action2 = return $ a * a

action3 :: IO ()

action3 = print d

action4 :: IO Int

action4 = error "barf"

main :: IO ()

main = do

n <- action2

print n

action3

stdout:

121

a = 11

c = 8

d = 3

84 of 85

Which bits of code will run?

a :: Int

a = 5 + 6

b :: Int

b = 4 * 7

c :: Int

c = 5 + 3

d :: Int

d = a - c

action1 :: IO ()

action1 = print $ b + c

action2 :: IO Int

action2 = return $ a * a

action3 :: IO ()

action3 = print d

action4 :: IO Int

action4 = error "barf"

main :: IO ()

main = do

n <- action2

print n

action3

stdout:

121

a = 11

c = 8

d = 3