Published using Google Docs
CS294-192: Visual Scene Understanding (Spring 2022)
Updated automatically every 5 minutes

CS294-192: Visual Scene Understanding

Spring 2022

Instructor: Alexei Efros

Course Coordinator: Allan Jabri

Class time: MW 11am-12:30pm

Location: 1215 BWW

Registration #: 32761 (with code)

Prerequisites: CS280 or equivalent

(no exceptions!)

Piazza signup: piazza.com/berkeley/spring2022/cs294192

https://fredericforest.com/

“Бог создал мир из ничего.

Учись, художник, у него”  

– К. Бальмонт

             "The aim of computer vision is to overfit to our visual world"

-- Antonio Torralba (after his third beer)

Overview:

In this small, advanced computer vision class, we will explore the thesis that knowledge of, and appreciation for, the vision problem specifically (as opposed to treating it as a generic learning machine) should be helpful in the development of a truly general visual AI system, as well as provide insights about human vision and cognition.   Consequently, we will be reading an eclectic mix of papers, from the hot-off-the-arxiv to the classic works in computer vision, human vision, psychophysics, and cognitive science.   We will revisit the ideas of Gibson, Koenderink, et al. and see if they apply to the modern age.   The class aims to give students a broader, more historically-informed view of scene understanding, and will hopefully spark their inspiration and creativity in their own research.  Requirements: summaries of weekly readings, 1-2 presentations, active participation in class discussions, and a final project.  

Prerequisites: CS280 or equivalent (no exceptions!)

Course Requirements:

Schedule

Date

Topic

Assigned Papers

presenter(s)

slides

Jan 24

Introduction

Alyosha

slides

Jan 26

Theories of Vision

Alyosha

slides

Jan 31

Philosophies of Vision

Alyosha

slides

Feb 2

Philosophies of Vision, cont.

Alyosha

slides (cont)

Feb 7

Texture / pre-attentive processing

Alyosha +

Antonio

Alyosha slides

+

Antonio slides

Feb 9

Memory / Memorability

Alyosha

slides

Feb  14

Memory / Memorability II

Brent + Vongani

Brent slides 

+

Vongani slides

Feb 16

Images as Attractors

Alyosha + Allan

Alyosha Slides

+

Allan slides

Feb 21

NO CLASS

(President’s day)

Feb 23

Images as Attractors II

Raven + Kehan

Raven slides 

+

Kehan slides

Feb 28

Grouping / Segmentation

Alyosha

slides

March 2

Grouping and Transformers

Xudong + Norman

Xudong slides 

+

Norman slides

March 9 (instead of March 7)

Scenes and Qualitative 3D

Alyosha

slides

March 11 (Friday)

Scenes and Qualitative 3D

Georgios + Dave

Georgios slides

+

Dave slides

March 14

Self-Supervision

Alyosha

slides

March 16

Self-Supervision (cont.)

Alyosha

slides

March 28

Self-Supervision (cont.)

Norman + Xudong

Norman slides

+

Xudong slides

March 30

Self-Supervision + Disentanglement

Justin

Justin slides

April 4

Disentanglement + Emergence

Alyosha + Ethan

Alyosha slides

+

Ethan slides

April 6

Disentanglement + Emergence

Rudy + Brent

Brent slides

April 11

Categories

Alyosha + Jashushan + Neerja

Alyosha slides 

+

Jashushan slides

April 13

Affordances + Bias

Ethan + Alyosha

Ethan slides 

+

Alyosha slides

April 18

Dataset Bias

Yossi + Justin + Neerja

Yossi slides

+

Justin slides

+

Neerja slides

April 20

Continual / Lifelong Learning

Alyosha + Yu

Alyosha slides 

+

Yu slides

April 25

Continual / Lifelong Learning

Toru + Yossi + Jathushan

Toru slides

+

Yossi slides

+

Jathushan slides

April 27

Grand Finale!

 Allan + Alyosha

Allan slides

+

Alyosha slides

May 4th

FINAL PROJECTS

everyone

Very Tentative Paper List

Week 1:  Theories of Vision: Bottom-Up vs. Top-Down:

Week 2: Texture / pre-attentive processing:

ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness, ICLR’19

Week 3: Memory / Memorability:

Week 4: Images as attractors:

Week 5: Grouping / Segmentation / Self-attention

Diffusion

Week 6: Scenes and Qualitative 3D

for View Synthesis from a Single Image, 2021

Week 7: Self-Supervision by space / time / geometry / physics

Week 8: Emergence of Structure and Disentanglement

Week 9: Categorization and Hierarchies

   

SKIPPING Week 10: Correspondence (explicit vs. implicit), Context and Analogies

Week 11: Datasets, bias, overfitting

Week 12: Continuous / lLifelong  Learning / Test-time Training:

Week 13:  Active Vision, Robotics, Evolution

Other misc topics:

Similar Courses: