1 of 224

“Data, Data, Data… Watson, I need Data!”

CS280: Computer Vision

  1. Efros, UC Berkeley, Spring 2026

2 of 224

.

3 of 224

(much of) The Magic is in the Data

4 of 224

Importance of Past Experience

Claude Monet

Gare St.Lazare

Paris, 1877

5 of 224

There is almost nothing inside!

6 of 224

Seeing more than meets the eye

Video by Antonio Torralba (starring Rob Fergus)

7 of 224

But actually…

Video by Antonio Torralba (starring Rob Fergus)

8 of 224

“Our perception relies

on memory as much as it does on incoming information, which blurs the border between perception and cognition.”

-- Moshe Bar

“Mind" is largely an emergent property of "data."

-- Lance Williams

9 of 224

“Our perception relies

on memory as much as it does on incoming information, which blurs the border between perception and cognition.”

-- Moshe Bar

“Mind" is largely an emergent property of "data."

-- Lance Williams

10 of 224

In old days, data got little respect

Algorithm

Features

Data

11 of 224

Face Detection: Early Success Story (late 1990s)

    • Rowley, Baluja, and Kanade, 1998
      • algorithm: neural network
    • Schniderman & Kanade, 1999
      • algorithm: naïve Bayes
    • Viola & Jones, 2001
      • algorithm: boosted cascade

12 of 224

Our Scientific Narcissism

All things being equal, we prefer to credit our own cleverness

13 of 224

“Unreasonable Effectiveness of Data”

  • Parts of our world can be explained by elegant mathematics:
    • physics, chemistry, astronomy, etc.
  • But some cannot:
    • psychology, genetics, economics, AI, etc.

  • Enter: The Magic of Data

[Halevy, Norvig, Pereira 2009]

Decade before “The Bitter Lesson”

14 of 224

Evolution is messy

Navier-Stokes Equation

+ weather

+ location

+ …

15 of 224

Let’s Define a Tree

Brown trunk moving upward and branching with leaves

16 of 224

Are these trees?

  • Hard to give a good definition of tree
  • But any 3-year-old can tell a tree from a non-tree
  • The 3-year-old has learned from data!

Slide from Makdon Ismail

17 of 224

18 of 224

With enough data, brain-dead lookup (aka Nearest Neighbor classifier) works surprisingly well

19 of 224

How many images does a 5-year-old see?

  • “80 million tiny images: a large dataset for non-parametric object and scene recognition”, Antonio Torralba, Rob Fergus and William T. Freeman. PAMI 2008.

Where CIFAR-10/100 came from!

20 of 224

Tiny Image pack a punch!

4x4

8x8

16x16

32x32

64x64

21 of 224

Human Scene Recognition

22 of 224

32x32 turns out to be enough!

23 of 224

80,000,000 images

75.000 non-abstract nouns from WordNet

7 Online image search engines

Google: 80 million images

And after 1 year downloading images

A. Torralba, R. Fergus, W.T. Freeman. PAMI 2008

2 years before ImageNet!

24 of 224

Powers of 10

Number of images on my hard drive: 104

Number of images seen during my first 10 years: 108

(3 images/second * 60 * 60 * 16 * 365 * 10 = 630720000)

Number of images seen by all humanity: 1020

106,456,367,669 humans1 * 60 years * 3 images/second * 60 * 60 * 16 * 365 =

1 from http://www.prb.org/Articles/2002/HowManyPeopleHaveEverLivedonEarth.aspx

Number of photons in the universe: 1088

Number of all 32x32 images: 107373

256 32*32*3 ~ 107373

A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

25 of 224

Some images are unique

26 of 224

But not all image are so original

27 of 224

But not all image are so original

28 of 224

Lots

Of

Images

A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

29 of 224

Lots

Of

Images

A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

30 of 224

Lots

Of

Images

31 of 224

32 of 224

Automatic Colorization

Grayscale input High resolution

Colorization of input using average

A. Torralba, R. Fergus, W.T.Freeman. 2008

33 of 224

First Scaling Law!

34 of 224

[Hays & Efros, SIGGRAPH’07]

35 of 224

36 of 224

2 Million Flickr Images

37 of 224

38 of 224

39 of 224

40 of 224

41 of 224

42 of 224

43 of 224

44 of 224

45 of 224

46 of 224

Why does it work?

47 of 224

48 of 224

Nearest neighbors from a�collection of 20 thousand images

49 of 224

Nearest neighbors from a�collection of 2 million images

50 of 224

51 of 224

52 of 224

53 of 224

… 200 scene matches

54 of 224

55 of 224

56 of 224

im2GPS�(using 6 million GPS-tagged Flickr images)

Im2gps [Hays & Efros, CVPR’08]

57 of 224

6 Million Flickr Images

58 of 224

im2GPS�(using 6 million GPS-tagged Flickr images)

Im2gps [Hays & Efros, CVPR’08]

59 of 224

15 years later…

60 of 224

Algorithm vs. Data

PlaNet, 2016

im2gps, 2008

  • Deep Net
  • 91 million images
  • Nearest Neighbors
  • 6 million images

61 of 224

Algorithm vs. Data

62 of 224

The Good News

Really stupid algorithms + Lots of Data

= “Unreasonable Effectiveness”

[Halevy, Norvig, Pereira 2009]

63 of 224

But can humans ever remember so much?

[Halevy, Norvig, Pereira 2009]

64 of 224

What’s the Capacity of Visual Long Term Memory?

http://olivalab.mit.edu/MM/

Aude Oliva, MIT

65 of 224

What’s the Capacity of Visual Long Term Memory?

“Basically, my recollection is that we just separated the pictures into distinct thematic categories: e.g. cars, animals, single-person, 2-people, plants, etc.) Only a few slides were selected which fell into each category, and they were visually distinct.”

According to Standing

Standing (1973)

10,000 images

83% Recognition

What was known…

What was not known…

Sparse Details

Dogs

Playing Cards

“Gist” Only

Highly Detailed

… people can remember thousands of images

… what people are remembering for each item?

Slide by Aude Oliva

66 of 224

Completely

different objects...

Different exemplars

of the same kind of object...

Different states of

the same object...

Massive Memory Experiment I

A stream of objects will be presented on the screen for

~ 3 second each.

Your primary task:

Remember them ALL!

afterwards you will be tested with…

67 of 224

Your other task:

Detect exact repeats anywhere in the stream

Massive Memory Experiment I

68 of 224

Ready?

(Seriously, get ready to clap. The images go by fast…)

69 of 224

70 of 224

71 of 224

72 of 224

<clap!>

73 of 224

74 of 224

75 of 224

76 of 224

77 of 224

78 of 224

79 of 224

80 of 224

81 of 224

82 of 224

83 of 224

84 of 224

<clap!>

85 of 224

86 of 224

87 of 224

88 of 224

89 of 224

90 of 224

10 Minutes Later...

91 of 224

92 of 224

93 of 224

94 of 224

95 of 224

96 of 224

97 of 224

98 of 224

<clap!>

99 of 224

100 of 224

101 of 224

102 of 224

<clap!>

103 of 224

104 of 224

105 of 224

106 of 224

30 Minutes Later...

107 of 224

108 of 224

109 of 224

110 of 224

111 of 224

112 of 224

113 of 224

114 of 224

115 of 224

116 of 224

117 of 224

118 of 224

1 Hour Later...

119 of 224

120 of 224

121 of 224

122 of 224

123 of 224

124 of 224

125 of 224

126 of 224

127 of 224

128 of 224

<clap!>

129 of 224

130 of 224

131 of 224

132 of 224

2 Hours Later...

133 of 224

134 of 224

135 of 224

136 of 224

<clap!>

137 of 224

138 of 224

139 of 224

140 of 224

141 of 224

142 of 224

143 of 224

144 of 224

145 of 224

146 of 224

4 Hours Later...

147 of 224

148 of 224

149 of 224

150 of 224

151 of 224

152 of 224

153 of 224

154 of 224

155 of 224

156 of 224

<clap!>

157 of 224

158 of 224

159 of 224

160 of 224

5:30 Hours Later...

161 of 224

162 of 224

Which one did you see?

(go ahead and shout out your answer)

163 of 224

-A-

-B-

164 of 224

-A-

-B-

165 of 224

-A-

-B-

166 of 224

Recognition Memory Results

Visual Cognition

Expert Predictions

92%

Replication of Standing (1973)

167 of 224

Recognition Memory Results

92%

88%

87%

Brady, et al. (2008), PNAS

168 of 224

So, do humans have �“photographic memory”?

169 of 224

“No meaning” textures matching 1st & 2nd order statistics of real scenes

(by Portila & Simmonceli)

Isola & Oliva,

unpublished

170 of 224

Ready?

Clap your hands when

you see an image repeat

171 of 224

172 of 224

173 of 224

174 of 224

<clap!>

175 of 224

176 of 224

177 of 224

178 of 224

179 of 224

180 of 224

181 of 224

182 of 224

183 of 224

184 of 224

185 of 224

186 of 224

<clap!>

187 of 224

188 of 224

189 of 224

190 of 224

191 of 224

192 of 224

<clap!>

193 of 224

194 of 224

195 of 224

196 of 224

197 of 224

198 of 224

199 of 224

200 of 224

<clap!>

201 of 224

202 of 224

203 of 224

204 of 224

205 of 224

206 of 224

207 of 224

208 of 224

209 of 224

210 of 224

<clap!>

211 of 224

212 of 224

213 of 224

214 of 224

215 of 224

216 of 224

<clap!>

217 of 224

218 of 224

219 of 224

220 of 224

<clap!>

221 of 224

222 of 224

223 of 224

d

Scene/object

texture

4

3

2

1

0

Isola & Oliva,

unpublished

chance

Humans do more than just memorize

224 of 224

Memorable

Hit rate: 67/70

False alarm rate: 4/80

Average

Hit rate: 59/81

False alarm rate: 7/92

Forgettable

Hit rate: 21/68

False alarm rate: 3/82

Memorability (Isola et al)