Date |
Topic |
Presenter |
Slides/videos |
Papers/code |
Week 1 |
Introduction |
|
|
|
W
Sept. 3 |
Class goals |
Antonio |
lecture1.ppt
blur.avi
highres.avi |
|
Week 2 |
Single class object detection,
Objects without scenes |
|
|
|
M
Sept. 8 |
Overview on object recognition and
one practical example |
Antonio |
lecture2.ppt |
Biederman. Recognition-by-Components:
A Theory of Human Image Understanding. Psychological Review,
1987.
Fischler and Elschlager. The
representation and matching of pictorial images. IEEE Transactions
on Computers, Volume 22, 1973.
Code:
A simple object detector with boosting, from the Short course
on recognizing and learning object categories, by Fei-Fei, Fergus,
and Torralba. 2005.
|
W
Sept. 10 |
Template matching and
gradient histograms |
Presenter:
Nicolas Pinto
Evaluator:
Jenny Yuen |
Nicolas
presentation.pdf
Evaluation:
Connecting
labelme and the DT detector by Jenny |
Lowe. Object recognition from local
scale-invariant features, ICCV 1999. (code)
Dalal and Triggs. Histograms
of Oriented Gradients for Human Detection, CVPR 2005. (code)
Felzenszwalb, McAllester and Ramanan. A
Discriminatively Trained, Multiscale, Deformable Part Model.
CVPR 2008. (code)
|
Week 3 |
Thousands of categories |
|
|
|
M
Sept. 15 |
Levels of categorization
and multiclass object recognition |
Antonio |
lecture3.ppt |
E. Rosch. Principles of Categorization.
1978.
S. E. Palmer. "Vision Science", chapter 9. (In fact,
just read the entire book).
B. Russell, A. Torralba, K. Murphy, W. T. Freeman. LabelMe:
a database and web-based tool for image annotation. IJCV 2008.
(website)
|
W
Sept. 17 |
Sharing parts for intraclass transfer learning |
Presenter:
Sharat Chikkerur
Evaluator:
Hueihan Jhuang |
Sharat's presentation.pdf
Evaluation:
Shared
parts for actions by Hueihan |
Fergus, Perona, and Zisserman. Object
class recognition by unsupervised scale invariant learning.
CVPR 2003. (code)
Fei-Fei, Fergus and Perona. One-Shot
learning of object categories. PAMI, 2006.
Torralba, Murphy and Freeman. Sharing
visual features for multiclass and multiview object detection.
PAMI 2007.
|
M
Sept. 22 |
Student Holiday - No class
|
|
|
|
Week 4 |
3D object models |
|
|
|
W
Sept. 24 |
Explicit and implicit 3D object models
|
Antonio |
lecture4.ppt
Class drawings will be posted soon. |
S. E. Palmer. "Vision Science", chapter 9.
Joseph L. Mundy. Object Recognition
in the Geometric Era: a Retrospective. 2006.
|
M
Sept. 29 |
Recognition of 3D objects |
Presenter:
Alec Rivers |
Alec's presentation.ppt |
J. Winn and J. Shotton. The
Layout Consistent Random Field for Recognizing and Segmenting Partially
Occluded Objects. CVPR 2006.
D. Hoiem, C. Rother, and J. Winn. 3D
LayoutCRF for Multi-View Object Class Recognition and Segmentation.
CVPR 2007.
S. Savarese and L. Fei-Fei. 3D
generic object categorization, localization and pose estimation.
ICCV 2007.
|
Week 5 |
Scenes without objects |
|
|
|
W
Oct. 1 |
Global scene representations |
Presenter:
Tilke Judd
Evaluator:
Nicolas Pinto |
Tilke's presentation.pdf
Evaluation:
Nicolas'
gist implementation |
A. Oliva, A. Torralba. Modeling
the shape of the scene: a holistic representation of the spatial
envelope. IJCV 2001. (gist
code)
L. Fei-Fei and P. Perona. A
Bayesian Hierarchical Model for Learning Natural Scene Categories.
CVPR. 2005.
S. Lazebnik, C. Schmid, and J. Ponce. Beyond
Bags of Features: Spatial Pyramid Matching for Recognizing Natural
Scene Categories. CVPR 2006.
|
M
Oct. 6 |
Scene recognition |
Aude Oliva |
lecture5.pdf |
A. Oliva. Gist
of the scene. Chapter, Neurobiology of attention |
Week 6 |
Objects in context |
|
|
|
W
Oct. 8 |
Scenes and objects |
Antonio |
lecture6.ppt |
I. Biederman, R.J. Mezzanotte, and J.C. Rabinowitz. Scene
perception: Detecting and judging objects undergoing relational
violations. Cognitive Psychology, 1982.
A. Oliva, A. Torralba. The
role of context in object recognition. Trends in Cognitive Sciences,
2007.
|
M.
Oct. 13 |
Student Holiday - No class
|
|
|
|
W.
Oct. 15 |
ECCV - No class |
|
|
|
Week 7 |
Internet vision and the power of lots of data |
|
|
|
M.
Oct. 20 |
Powers of 10 |
Antonio |
lecture7.ppt |
|
W.
Oct. 22 |
|
Presenter:
Vladimir Bychkovsky
Evaluator:
Krista Ehinger |
Evaluation: Scene
completion demo by Krista |
N. Snavely, S. M. Seitz, R. Szeliski. Photo tourism: Exploring photo
collections in 3D, Siggraph 2006 (website)
(code)
J. Hays, A. A. Efros. Scene
Completion Using Millions of Photographs. SIGGRAPH 2007, (website
and code)
A. Torralba, R. Fergus, W. T. Freeman, 80
million tiny images: a large dataset for non-parametric object and
scene recognition. PAMI 2008. (website)
|
Week 8 |
Low and Mid-level vision |
|
|
|
M.
Oct. 27 |
Low-level vision:
shading, reflectance, and texture. |
Bill Freeman |
lecture8.ppt shadingReflSurvey.pdf
|
M. Tappen, W. Freeman and E. Adelson. Recovering
intrinsic images from a single image. PAMI 2005.
A. Efros and W. Freeman. Image
Quilting for Texture Synthesis and transfer. SIGGRAPH 2001
|
W.
Oct. 29 |
Edges, regions and textures |
Presenter:
Tom Ouyang
Evaluator:
Gokberk Cinbis
|
Tom's presentation.ppt
Evaluation:
Gokberk
Cinbis |
H.G. Barrow, J.M. Tenenbaum. Recovering
Intrinsic Scene Characteristics from Images. Artificial Intelligence
1977
J. Shi and J. Malik. Normalized Cuts
and Image Segmentation.
J. Portilla and E. Simoncelli. A
Parametric Texture Model Based on Joint Statistics of Complex Wavelet
Coefficients. IJCV 2004. (code)
|
Week 9 |
Grammars for
objects and scenes |
|
|
|
M.
Nov. 3 |
Grammars and topic models |
Antonio & Meg Aycinena |
lecture9.ppt |
S.C. Zhu and D. Mumford. A
Stochastic Grammar of Images. Foundations and Trends in Computer
Graphics and Vision, 2006. |
W.
Nov. 5 |
Grammars for low, mid and high level vision |
Presenters:
Tom Kollar &
Hueihan Jhuang |
Kollar's presentation.ppt
Hueihan's presentation.ppt |
Zhuowen Tu; Song-Chun Zhu. Image
segmentation by data-driven Markov chain Monte Carlo
Zhuowen Tu, Xiangrong Chen, Alan L. Yuille, Song-Chun Zhu. Image
Parsing: Unifying Segmentation, Detection, and Recognition
Z.J. Xu, H. Chen, S.C. Zhu, and J. Luo. A
Composite Template for Human Face Modeling and Sketch
F. Han and S.C. Zhu. Bottom-up/Top-down
Image Parsing with Attribute Graph Grammar
|
Week 10 |
3D scene models |
|
|
|
M.
Nov. 10 |
Student Holiday - No class
|
|
|
|
W.
Nov. 12 |
3D scenes |
Antonio |
lecture10.ppt |
A. Criminisi, I. Reid, and A. Zisserman. "Single View Metrology".
Proceedings of the 7th International Conference on Computer Vision,
Kerkyra, Greece, 1999. (website)
LabelMe 3D (website)
|
M.
Nov. 17 |
|
Presenter:
Krista Ehinger
Evaluator:
Tom Kollar |
Krista's presentation.ppt
Evaluation:
A comparison of
5 techniques by Tom |
Y. Horry, K.I. Anjyo and K. Arai. "Tour Into the Picture:
Using a spidery mesh user interface to make animation from a single
image". ACM SIGGRAPH 1997 (website)
D. Hoiem, A.A. Efros, and M. Hebert, "Automatic Photo Pop-up",
ACM SIGGRAPH 2005. (website)
A. Saxena, M. Sun, A. Y. Ng. "Learning 3-D Scene Structure
from a Single Still Image". In ICCV workshop on 3D Representation
for Recognition (3dRR-07), 2007 (website) |
Week 6 |
Objects in context, part 2 |
|
|
|
W.
Nov. 19 |
|
Presenter:
Gokberk Cinbis
Evaluator:
Sharat Chikkerur |
Gokberk's presentation.ppt
Evaluation:
Recipies for computing
'gist' features by Sharat |
A. Torralba. Contextual priming for object detection. IJCV 2003.
D Hoiem, A. Efros, and M Hebert. Geometric context from a single
image. ICCV 2005.
A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora and S. Belongie.
Objects in Context. ICCV 2007
|
Week 11 |
Hierarchies |
|
|
|
M.
Nov. 24 |
Biological inspired computer vision |
Antonio
Presenter:
Nat Twarog |
lecture11.ppt
Twarog's presentation.ppt |
T. Serre, L. Wolf and T. Poggio. Object
recognition with features inspired by visual cortex. CVPR 2005
B. Epshtein and S. Ullman. Feature
Hierarchies for Object Classification. ICCV 05
D. Geman. Coarse-to-Fine
Classification and Scene Labeling.
|
|
What happens if we solve object recognition? |
|
|
|
W.
Nov. 26 |
|
Antonio
Presenter:
Jenny Yuen |
lecture12.ppt and a few notes
on how to prepare a talk. |
P. Cavanagh, Vision
is getting easier every day, Perception 1996
Where are
the flying cars?
Y. Jin, S. Baluja, H. Rowley. Canonical
Image Selection from the Web, CIVR 2007 |
Week 12 |
Class project presentations |
|
|
|
M.
Dec. 1 |
20 minutes for each presentation |
2:30 pm - 4:30pm
|
|
2:35 Jenny Yuen
3:00 Gokberk Cinbis
3:25 Alec Rivers
3:50 Tom Ouyang |
W.
Dec. 3 |
|
1:30pm - 4:00pm |
|
1:35 Hueihan Jhuang and Sharat Chikkerur
2:00 Nathaniel R Twarog
2:25 Tom Kollar
2:50 Tilke Judd and Vladimir Bychkovsky
3:15 Nicolas Pinto
3:40 Vote count!
|