Lecture Schedule

From cs331b Special Topics in 3dRR

(Difference between revisions)
Jump to: navigation, search
 
(32 intermediate revisions not shown)
Line 11: Line 11:
| Introductions  
| Introductions  
|  
|  
-
| [http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/4/4d/Admin_CS331.pdf slides]
+
| [http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/5/5a/Lecture1.pdf lecture slides]
|-
|-
| 2  
| 2  
Line 19: Line 19:
|  
|  
3D object recognition by aspect graph-based models:  
3D object recognition by aspect graph-based models:  
 +
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/8/83/Lecture2.pdf lecture slides]
*Y.Xiang and S.Savarese. "Estimating the aspect layout of object categories". CVPR2012 [http://cvgl.stanford.edu/papers/xiang_cvpr12.pdf link]<br>
*Y.Xiang and S.Savarese. "Estimating the aspect layout of object categories". CVPR2012 [http://cvgl.stanford.edu/papers/xiang_cvpr12.pdf link]<br>
Line 26: Line 28:
| 9 /30  
| 9 /30  
| Representing and Recognizing Objects  
| Representing and Recognizing Objects  
-
| &nbsp;  
+
| Olga Russakovsky&nbsp;and Caleb Jordan
-
| 3D detection from D-RGB data:  
+
|  
 +
3D detection from D-RGB data:  
 +
 
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/c/c0/Lecture3.pdf slides]
 +
 
*L. Bo, K. Lai, Xiaofeng Ren, and D. Fox "Object&nbsp;Recognition with Hierarchical Kernel Descriptors" [https://homes.cs.washington.edu/~xren/publication/bo_cvpr11_hkdes.pdf link]  
*L. Bo, K. Lai, Xiaofeng Ren, and D. Fox "Object&nbsp;Recognition with Hierarchical Kernel Descriptors" [https://homes.cs.washington.edu/~xren/publication/bo_cvpr11_hkdes.pdf link]  
*L. Bo, X. Ren, and D. Fox. "Kernel Descriptors for Visual&nbsp;Recognition". NIPS, December 2010 [http://www.cs.washington.edu/ai/Mobile_Robotics/postscripts/kdes-nips-10.pdf link]  
*L. Bo, X. Ren, and D. Fox. "Kernel Descriptors for Visual&nbsp;Recognition". NIPS, December 2010 [http://www.cs.washington.edu/ai/Mobile_Robotics/postscripts/kdes-nips-10.pdf link]  
Line 37: Line 43:
| Representing and Recognizing Objects  
| Representing and Recognizing Objects  
| Guest lecturer: Dr. M. Stark  
| Guest lecturer: Dr. M. Stark  
-
| 3D object recognition by deformable part models:  
+
|  
 +
3D object recognition by deformable part models:  
 +
 
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/e/e5/Lecture4.pdf slides]
 +
 
*B. Pepik, M. Stark, P. Gehler and B. Schiele, "Teaching 3D&nbsp;geometry to deformable part models". CVPR2012 [http://www.d2.mpi-inf.mpg.de/sites/default/files/pepik12cvpr.pdf link]
*B. Pepik, M. Stark, P. Gehler and B. Schiele, "Teaching 3D&nbsp;geometry to deformable part models". CVPR2012 [http://www.d2.mpi-inf.mpg.de/sites/default/files/pepik12cvpr.pdf link]
Line 44: Line 54:
| 10/7  
| 10/7  
| Representing the 3D Space  
| Representing the 3D Space  
-
| &nbsp;  
+
|  
-
| Large scale 3D reconstruction by structure from motion&nbsp;:  
+
&nbsp;Devin LaSalle Guillory and&nbsp;Ziang Xie
 +
 
 +
|  
 +
Large scale 3D reconstruction by structure from motion&nbsp;:  
 +
 
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/6/64/Lecture5_2.pdf slides 1]
 +
 
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/d/d8/Lecture5.pdf slides 2]
 +
 
*S. Agarwal, Y. Furukawa, N. Snavely, I. Simon, B. Curless, S. Seitz and R. Szeliski. Communications of the ACM, Vol. 54, No. 10, Pages 105-112,&nbsp;October 2011 [http://grail.cs.washington.edu/projects/rome/rome_paper.pdf link]  
*S. Agarwal, Y. Furukawa, N. Snavely, I. Simon, B. Curless, S. Seitz and R. Szeliski. Communications of the ACM, Vol. 54, No. 10, Pages 105-112,&nbsp;October 2011 [http://grail.cs.washington.edu/projects/rome/rome_paper.pdf link]  
*J. Frahm, P. Georgel, D. Gallup, T. Johnson, R. Raguram, C. Wu, Y. Jen,&nbsp;E. Dunn, B. Clipp, S. Lazebnik, and M. Pollefeys. "Building Rome on a Cloudless Day". ECCV2010 [http://www.cs.illinois.edu/homes/slazebni/publications/eccv10_rome.pdf link]
*J. Frahm, P. Georgel, D. Gallup, T. Johnson, R. Raguram, C. Wu, Y. Jen,&nbsp;E. Dunn, B. Clipp, S. Lazebnik, and M. Pollefeys. "Building Rome on a Cloudless Day". ECCV2010 [http://www.cs.illinois.edu/homes/slazebni/publications/eccv10_rome.pdf link]
Line 53: Line 71:
| 10/9  
| 10/9  
| Representing the 3D Space  
| Representing the 3D Space  
-
| &nbsp;
+
| Matt Swaner Vitelli
| Indoor scene layout reconstruction:  
| Indoor scene layout reconstruction:  
-
*D. Lee, T. Kanadem, and M. Hebert. "3D Scene&nbsp;Analysis". CVPR2009 [http://www.cs.cmu.edu/~dclee/pub/cvpr09lee.pdf link]  
+
*D. Lee, T. Kanade, and M. Hebert. "3D Scene&nbsp;Analysis". CVPR2009 [http://www.cs.cmu.edu/~dclee/pub/cvpr09lee.pdf link]  
-
*A. Schwing and R. Urtasun. "Efficient Exact Inference for 3D&nbsp;Indoor Scene Understanding". ECCV2012 [http://www.alexander-schwing.de/papers/SchwingEtAl_ECCV2012.pdf link]
+
*A. Schwing and R. Urtasun. "Efficient Exact Inference for 3D&nbsp;Indoor Scene Understanding". ECCV2012 [http://www.alexander-schwing.de/papers/SchwingEtAl_ECCV2012.pdf link]
 +
*D. Hoiem, A.A. Efros, and M. Hebert, "Geometric Context from a Single Image", ICCV 2005 [http://www.cs.uiuc.edu/~dhoiem/publications/Hoiem_Geometric.pdf link]
|-
|-
Line 62: Line 81:
| 10/14  
| 10/14  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;  
+
| &nbsp;David Joseph Mandle and&nbsp;Serena Yu-Ching Yeung
| Joint 3D reconstruction and object detection:  
| Joint 3D reconstruction and object detection:  
*D. Hoiem, A. Efros, and M. Hebert, "Putting Objects in&nbsp;Perspective". CVPR 2006 [http://www.cs.uiuc.edu/~dhoiem/publications/hoiem_cvpr06.pdf link]  
*D. Hoiem, A. Efros, and M. Hebert, "Putting Objects in&nbsp;Perspective". CVPR 2006 [http://www.cs.uiuc.edu/~dhoiem/publications/hoiem_cvpr06.pdf link]  
Line 72: Line 91:
| Understanding Complex Scenes  
| Understanding Complex Scenes  
| Guest lecturer: B. Kim  
| Guest lecturer: B. Kim  
-
| 2D/3D CRF models fo understanding:  
+
| 2D/3D CRF models for scene understanding:  
*B. Kim, M. Sun ,P. Kohli, and S. Savarese. "Relating things and stuff&nbsp;by High-Order Poterntial modeling". ECCV2012 [http://cvgl.stanford.edu/papers/kim_hipotws_eccv12.pdf link]
*B. Kim, M. Sun ,P. Kohli, and S. Savarese. "Relating things and stuff&nbsp;by High-Order Poterntial modeling". ECCV2012 [http://cvgl.stanford.edu/papers/kim_hipotws_eccv12.pdf link]
|-
|-
| &nbsp;  
| &nbsp;  
-
| <span style="color: rgb(255, 0, 0);">'''10/17&nbsp;'''
+
| <span style="color: rgb(255, 0, 0);">'''10/17&nbsp;
-
 
+
-
 
+
-
 
+
-
 
+
 +
'''
Line 103: Line 119:
| 10/23  
| 10/23  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;  
+
| &nbsp;Kevin Jared Miller
-
| Tutorial on MCMC:  
+
|  
 +
Joint segmentation and reconstruction from videos:
 +
 
 +
*Tutorial on MCMC [http://www.kev-smith.com/tutorial/rjmcmc.php link]
*C. Wojek, S. Roth, K. Schindler, B. Schiele.&nbsp;"Monocular 3D Scene Modeling and Inference: Understanding&nbsp;Multi-Object Traffic Scenes". [http://domino.mpi-inf.mpg.de/intranet/d2/d2publ.nsf/0/ac2eb7bd29ab4279c12578110057e2d9/$FILE/wojek2010eccv.pdf link]
*C. Wojek, S. Roth, K. Schindler, B. Schiele.&nbsp;"Monocular 3D Scene Modeling and Inference: Understanding&nbsp;Multi-Object Traffic Scenes". [http://domino.mpi-inf.mpg.de/intranet/d2/d2publ.nsf/0/ac2eb7bd29ab4279c12578110057e2d9/$FILE/wojek2010eccv.pdf link]
Line 111: Line 130:
| 10/28  
| 10/28  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;  
+
| &nbsp;David Joseph Mandle and&nbsp;Serena Yu-Ching Yeung
| Joint segmentation and reconstruction from D-RGB data (I):  
| Joint segmentation and reconstruction from D-RGB data (I):  
*N. Silberman, D. Hoiem, P. Kohli, R. Fergus. "Indoor Segmentation and Support Inference from&nbsp;RGBD Images". ECCV2012 [http://cs.nyu.edu/~silberman/papers/indoor_seg_support.pdf link]  
*N. Silberman, D. Hoiem, P. Kohli, R. Fergus. "Indoor Segmentation and Support Inference from&nbsp;RGBD Images". ECCV2012 [http://cs.nyu.edu/~silberman/papers/indoor_seg_support.pdf link]  
Line 120: Line 139:
| 10/30  
| 10/30  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;
+
| Kevin Jared Miller and Caleb Stephen Jordan
| Joint segmentation and reconstruction from D-RGB data (II):  
| Joint segmentation and reconstruction from D-RGB data (II):  
-
*D. Munoz, J. Bagnell, and M. Hebert. "Stacked Hierarchical Labeling". ECCV2010 [http://www.ri.cmu.edu/pub_files/2010/9/munoz_eccv_10.pdf link]
+
*D. Munoz, J. Bagnell, and M. Hebert. "Stacked Hierarchical Labeling". ECCV2010 [http://www.ri.cmu.edu/pub_files/2010/9/munoz_eccv_10.pdf link]  
-
*D. Munoz, J. Bagnell, and M. Hebert.&nbsp;"Co-inference for Multi-modal Scene Analysis". ECCV2012
+
*D. Munoz, J. Bagnell, and M. Hebert.&nbsp;"Co-inference for Multi-modal Scene Analysis". ECCV2012 [http://www.ri.cmu.edu/pub_files/2012/10/munoz_eccv_12.pdf link]
|-
|-
Line 132: Line 151:
| Joint segmentation and reconstruction from D-RGB data (III):  
| Joint segmentation and reconstruction from D-RGB data (III):  
*C. Häne, C. Zach, A. Cohen, R. Angst, and M. Pollefeys. "Joint 3D scene reconstruction and&nbsp;class segmentation".&nbsp;[http://www.inf.ethz.ch/personal/chaene/publications/haene2013joint.pdf link]
*C. Häne, C. Zach, A. Cohen, R. Angst, and M. Pollefeys. "Joint 3D scene reconstruction and&nbsp;class segmentation".&nbsp;[http://www.inf.ethz.ch/personal/chaene/publications/haene2013joint.pdf link]
 +
 +
|-
 +
| 15
 +
|
 +
<span style="color: rgb(0, 0, 255);">'''11/4'''</span>
 +
 +
| Understanding Human Activities
 +
| &nbsp;Sam James Corbett-Davies and&nbsp;Christopher Choy and Kyunghee Kim
 +
| Human pose estimation and activity recognition from D-RGB&nbsp;<span style="line-height: 1.5em;">data:</span>
 +
*Tutorial on Randomized decision trees and forests [http://www.iis.ee.ic.ac.uk/~tkkim/iccv09_tutorial link]
 +
*J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp,&nbsp;M. Finocchio, R. Moore, A. Kipman, and A. Blake. "Real Time human pose recognition in parts from single&nbsp;depth image". CVPR2011 [http://research.microsoft.com/pubs/145347/BodyPartRecognition.pdf link]
 +
*J. Wang, Z. Liu, Y. Wu, and J. Yuan.&nbsp;"Mining Actionlet Ensemble for Action Recognition with&nbsp;Depth Cameras".&nbsp;[http://research.microsoft.com/en-us/um/people/zliu/papers/joint_modeling_final.pdf link]
|-
|-
| &nbsp;  
| &nbsp;  
| <span style="color: rgb(255, 0, 0);">'''11/5'''&nbsp;
| <span style="color: rgb(255, 0, 0);">'''11/5'''&nbsp;
-
 
-
 
-
 
-
 
-
 
-
 
-
 
-
 
-
 
</span>
</span>
| <span style="color: rgb(255, 0, 0);">''' Project Mid-term Report due'''</span>  
| <span style="color: rgb(255, 0, 0);">''' Project Mid-term Report due'''</span>  
Line 151: Line 173:
|-
|-
| 14  
| 14  
-
| 11/6  
+
|  
 +
11/6  
 +
 
| Understanding Human Pose  
| Understanding Human Pose  
-
| &nbsp;  
+
| Vivardhan Kanoria and&nbsp;Devin LaSalle Guillory
-
| Human pose estimation and activity recognition from 2D  
+
| Human pose estimation and activity recognition from 2D&nbsp;<span style="line-height: 1.5em;">images:</span>
-
images:  
+
-
 
+
*L. Bourdev, and J. Malik. "Poselets: Body Part&nbsp;Detectors Trained Using 3D Human Pose Annotations". ICCV2009 [http://www.eecs.berkeley.edu/Research/Projects/CS/vision/human/poselets_iccv09.pdf link]  
*L. Bourdev, and J. Malik. "Poselets: Body Part&nbsp;Detectors Trained Using 3D Human Pose Annotations". ICCV2009 [http://www.eecs.berkeley.edu/Research/Projects/CS/vision/human/poselets_iccv09.pdf link]  
*L. Bourdev, S. Maji, T. Brox, and J. Malik. "Detecting People Using Mutually Consistent Poselet&nbsp;Activations". ECCV2010 [http://www.cs.berkeley.edu/~lbourdev/poselets/poselets-eccv10.pdf link]  
*L. Bourdev, S. Maji, T. Brox, and J. Malik. "Detecting People Using Mutually Consistent Poselet&nbsp;Activations". ECCV2010 [http://www.cs.berkeley.edu/~lbourdev/poselets/poselets-eccv10.pdf link]  
-
*B. Yao and L. Fei-Fei.&nbsp;"Action Recognition with Exemplar Based 2.5D Graph&nbsp;Matching". ECCV2012
+
*B. Yao and L. Fei-Fei.&nbsp;"Action Recognition with Exemplar Based 2.5D Graph&nbsp;Matching". ECCV2012 [http://vision.stanford.edu/documents/YaoFei-Fei_ECCV12.pdf link]
-
 
+
-
|-
+
-
| 15
+
-
| tba
+
-
| Understanding Human Activities
+
-
| &nbsp;
+
-
| Human pose estimation and activity recognition from D-RGB
+
-
data:
+
-
 
+
-
*Tutorial on Randomized decision trees and forests
+
-
*J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp,&nbsp;M. Finocchio, R. Moore, A. Kipman, and A. Blake. "Real Time human pose recognition in parts from single&nbsp;depth image". CVPR2011 [http://research.microsoft.com/pubs/145347/BodyPartRecognition.pdf link]
+
-
*J. Wang, Z. Liu, Y. Wu, and J. Yuan.&nbsp;"Mining Actionlet Ensemble for Action Recognition with&nbsp;Depth Cameras".&nbsp;[http://research.microsoft.com/en-us/um/people/zliu/papers/joint_modeling_final.pdf link]
+
|-
|-
Line 180: Line 190:
| Recognizing collective activities:  
| Recognizing collective activities:  
*W. Choi, K. Shahid, and S. Savarese "What are they doing?: Collective Activity Classification Using Spatio-Temporal&nbsp;Relationship Among People". VSWS2009 and ICCV2009 [http://www.eecs.umich.edu/vision/paper/Wongun_CollectiveActivityRecognition09.pdf link]  
*W. Choi, K. Shahid, and S. Savarese "What are they doing?: Collective Activity Classification Using Spatio-Temporal&nbsp;Relationship Among People". VSWS2009 and ICCV2009 [http://www.eecs.umich.edu/vision/paper/Wongun_CollectiveActivityRecognition09.pdf link]  
-
*W. Choi and S. Savarese "A Unified Framework for MultiTarget Tracking and Collective Activity Recognition". ECCV2012
+
*W. Choi and S. Savarese "A Unified Framework for MultiTarget Tracking and Collective Activity Recognition". ECCV2012 [http://www-personal.umich.edu/~wgchoi/choi_eccv_12.pdf link]
|-
|-
| 17  
| 17  
-
| 11/13
 
-
| Understanding Human Activities
 
-
| &nbsp;
 
|  
|  
-
*J. Gall, A. Fossati, and L. van Gool. (2011). "Functional&nbsp;categorization of objects using real-time markerless motion&nbsp;capture". CVPR2011  
+
'''<span style="color: rgb(0, 0, 255);">11/11 </span>'''
-
*H. Koppula, R. Gupta, and A. Saxena.&nbsp;"Learning Human Activities and Object Affordances from&nbsp;RGB-D Videos". IJRR2013<br>
+
 
 +
'''<span style="color: rgb(0, 0, 255);">7pm </span>'''
 +
 
 +
'''<span style="color: rgb(0, 0, 255);">380-380c</span>'''
 +
 
 +
| Understanding Human Activities
 +
| &nbsp;Sam James Corbett-Davies and&nbsp;Vivardhan Kanoria
 +
| Joint estimation of object/scene affordances and human activities (I):<br>
 +
*J. Gall, A. Fossati, and L. van Gool. (2011). "Functional Categorization of Objects Using Real-Time Markerless Motion Capture". CVPR2011 [http://www.vision.ee.ethz.ch/~gallju/download/jgall_dynamiccat_cvpr11.pdf link]
 +
*H. Koppula, R. Gupta, and A. Saxena.&nbsp;"Learning Human Activities and Object Affordances from&nbsp;RGB-D Videos". IJRR2013 [http://arxiv.org/pdf/1210.1207v2.pdf link]<br>
|-
|-
| 18  
| 18  
-
| tba
+
| 11/13
| Understanding Human Activities  
| Understanding Human Activities  
-
| &nbsp;  
+
| &nbsp;Christopher Choy and&nbsp;Ziang Xie and Kyunghee Kim
-
| Joint estimation of object/scene affordances and human  
+
| Joint estimation of object/scene affordances and human activities (II):<br>
-
pose:  
+
-
 
+
*V. Delatitre, D. Fouhey, I. Laptec, Sivic, A. Gupta, and A. A. Efros. "Scene semantics from long term observation of&nbsp;people". ECCV2012 [http://repository.cmu.edu/cgi/viewcontent.cgi?article=1775&context=robotics link]  
*V. Delatitre, D. Fouhey, I. Laptec, Sivic, A. Gupta, and A. A. Efros. "Scene semantics from long term observation of&nbsp;people". ECCV2012 [http://repository.cmu.edu/cgi/viewcontent.cgi?article=1775&context=robotics link]  
-
*A. Gupta, S. Satkin, A. A. Efros, and M. Hebert. "From 3D scene&nbsp;geometry to Human workspace". CVPR2011 [http://www.cs.cmu.edu/~abhinavg/papers/0586.pdf link]  
+
*A. Gupta, S. Satkin, A. A. Efros, and M. Hebert. "From 3D scene&nbsp;geometry to Human workspace". CVPR2011 [http://www.cs.cmu.edu/~abhinavg/papers/0586.pdf link]<br>
-
*D. Fouhey, V. Delaitre, A. Gupta, A. A. Efros, I. Laptev, and J. Sivic.&nbsp;"People Watching: Human Actions as a Cue for Single View&nbsp;Geometry". &nbsp;ECCV2012
+
*D. Fouhey, V. Delaitre, A. Gupta, A. A. Efros, I. Laptev, and J. Sivic.&nbsp;"People Watching: Human Actions as a Cue for Single View Geometry".&nbsp;ECCV2012 [http://graphics.cs.cmu.edu/projects/peopleWatching/dfouhey_people.pdf link]
|-
|-
-
| '''<span style="color: rgb(255, 0, 0);">19</span><br>'''  
+
| '''<br>'''
-
| '''<span style="color: rgb(255, 0, 0);"> 11/18&nbsp;</span><br>'''  
+
| '''<span style="color: rgb(255, 0, 0);"> </span>'''11/18'''<span style="color: rgb(255, 0, 0);">&nbsp;</span><br>'''  
-
| '''<span style="color: rgb(255, 0, 0);"> Final Project Presentations</span><br>'''  
+
| '''<span style="color: rgb(255, 0, 0);"> </span>'''no class'''<br>'''  
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;
|-
|-
-
| '''<span style="color: rgb(255, 0, 0);"> 20</span><br>'''  
+
| '''<br>'''  
-
| '''<span style="color: rgb(255, 0, 0);"> 11/20&nbsp;</span><br>'''  
+
| '''<span style="color: rgb(255, 0, 0);"> </span>'''11/20'''<span style="color: rgb(255, 0, 0);">&nbsp;</span><br>'''  
-
| '''<span style="color: rgb(255, 0, 0);"> Final Project Presentation</span><span style="color: rgb(255, 0, 0);">s</span>'''  
+
| no class
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;
Line 228: Line 242:
| &nbsp;
| &nbsp;
|-
|-
-
| &nbsp;  
+
| <span style="color: rgb(255, 0, 0);">'''&nbsp; 19'''</span>
-
| 12/2  
+
| <span style="color: rgb(255, 0, 0);">''' 12/2
 +
 
 +
'''
 +
 
 +
 
 +
 
 +
 
 +
</span>
|  
|  
-
no class due to ICCV<br>  
+
<span style="color: rgb(255, 0, 0);">'''Final Project Presentations'''
 +
</span><br>  
-
| &nbsp;  
+
| <span style="color: rgb(255, 0, 0);">'''
-
| &nbsp;
+
 
 +
 
 +
'''
 +
 
 +
</span>
 +
| <span style="color: rgb(255, 0, 0);">''' &nbsp;'''
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
</span>
|-
|-
-
| &nbsp;  
+
| <span style="color: rgb(255, 0, 0);">'''&nbsp; 20'''</span>
-
| 12/4
+
|  
-
| no class due to ICCV
+
<span style="color: rgb(255, 0, 0);">'''12/2'''</span>
-
| &nbsp;  
+
 
 +
<span style="color: rgb(255, 0, 0);">'''7pm @ Gates 219'''</span>
 +
 
 +
|  
 +
<span style="color: rgb(255, 0, 0);">''' Final Project Presentations'''</span>
 +
 
 +
| <span style="color: rgb(255, 0, 0);">''' &nbsp;'''</span>
| &nbsp;&nbsp;
| &nbsp;&nbsp;
|-
|-
| &nbsp;  
| &nbsp;  
-
| '''<span style="color: rgb(255, 0, 0);">12/13&nbsp;</span><br>'''
+
|  
-
| '''<span style="color: rgb(255, 0, 0);">Final Project Report Due</span>'''
+
12/4<br>  
 +
 
 +
| no class due to ICCV
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;
|-
|-
| &nbsp;  
| &nbsp;  
-
| &nbsp;  
+
| &nbsp; '''<span style="color: rgb(255, 0, 0);">12/13</span>'''
-
| &nbsp;  
+
| &nbsp;<span style="color: rgb(255, 0, 0);">''' Final Project Report Due'''</span>
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;

Latest revision as of 15:32, 12 November 2013

Lect. Date Topics Presenter Paper/Slides
1 9/23 Introductions lecture slides
2 9/25 Representing and Recognizing Objects
Guest lecturer: Y. Xiang

3D object recognition by aspect graph-based models:

lecture slides

  • Y.Xiang and S.Savarese. "Estimating the aspect layout of object categories". CVPR2012 link
3 9 /30 Representing and Recognizing Objects Olga Russakovsky and Caleb Jordan

3D detection from D-RGB data:

slides

  • L. Bo, K. Lai, Xiaofeng Ren, and D. Fox "Object Recognition with Hierarchical Kernel Descriptors" link
  • L. Bo, X. Ren, and D. Fox. "Kernel Descriptors for Visual Recognition". NIPS, December 2010 link
  • K. Lai, L. Bo, X. Ren, and D. Fox. "A Large-Scale Hierarchical Multi-View RGB-D Object Dataset". IEEE International Conference on on Robotics and Automation, 2011 link
4 10/2 Representing and Recognizing Objects Guest lecturer: Dr. M. Stark

3D object recognition by deformable part models:

slides

  • B. Pepik, M. Stark, P. Gehler and B. Schiele, "Teaching 3D geometry to deformable part models". CVPR2012 link
5 10/7 Representing the 3D Space

 Devin LaSalle Guillory and Ziang Xie

Large scale 3D reconstruction by structure from motion :

slides 1

slides 2

  • S. Agarwal, Y. Furukawa, N. Snavely, I. Simon, B. Curless, S. Seitz and R. Szeliski. Communications of the ACM, Vol. 54, No. 10, Pages 105-112, October 2011 link
  • J. Frahm, P. Georgel, D. Gallup, T. Johnson, R. Raguram, C. Wu, Y. Jen, E. Dunn, B. Clipp, S. Lazebnik, and M. Pollefeys. "Building Rome on a Cloudless Day". ECCV2010 link
6 10/9 Representing the 3D Space Matt Swaner Vitelli Indoor scene layout reconstruction:
  • D. Lee, T. Kanade, and M. Hebert. "3D Scene Analysis". CVPR2009 link
  • A. Schwing and R. Urtasun. "Efficient Exact Inference for 3D Indoor Scene Understanding". ECCV2012 link
  • D. Hoiem, A.A. Efros, and M. Hebert, "Geometric Context from a Single Image", ICCV 2005 link
7 10/14 Understanding Complex Scenes  David Joseph Mandle and Serena Yu-Ching Yeung Joint 3D reconstruction and object detection:
  • D. Hoiem, A. Efros, and M. Hebert, "Putting Objects in Perspective". CVPR 2006 link
  • V. Hedau, D. Hoiem, and D. Forsyth. "Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry". ECCV 2010 link
8 10/16 Understanding Complex Scenes Guest lecturer: B. Kim 2D/3D CRF models for scene understanding:
  • B. Kim, M. Sun ,P. Kohli, and S. Savarese. "Relating things and stuff by High-Order Poterntial modeling". ECCV2012 link
  10/17 



Project Proposal due    
9 10/21 Understanding Complex Scenes Guest lecturer: Y. Bao Semantic structure from motion:
  • S. Bao and S. Savarese, "Semantic Structure from Motion". CVPR2011 link
10 10/23 Understanding Complex Scenes  Kevin Jared Miller

Joint segmentation and reconstruction from videos:

  • Tutorial on MCMC link
  • C. Wojek, S. Roth, K. Schindler, B. Schiele. "Monocular 3D Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes". link
11 10/28 Understanding Complex Scenes  David Joseph Mandle and Serena Yu-Ching Yeung Joint segmentation and reconstruction from D-RGB data (I):
  • N. Silberman, D. Hoiem, P. Kohli, R. Fergus. "Indoor Segmentation and Support Inference from RGBD Images". ECCV2012 link
  • S. Gupta, P. Arbelaez, and J. Malik. "Perceptual Organization and Recognition of Indoor Scenes from RGBD Images". CVPR2013 (oral). link
12 10/30 Understanding Complex Scenes Kevin Jared Miller and Caleb Stephen Jordan Joint segmentation and reconstruction from D-RGB data (II):
  • D. Munoz, J. Bagnell, and M. Hebert. "Stacked Hierarchical Labeling". ECCV2010 link
  • D. Munoz, J. Bagnell, and M. Hebert. "Co-inference for Multi-modal Scene Analysis". ECCV2012 link
13 11/4 Understanding Human Pose Guest lecturer: Dr. Roland Angst Joint segmentation and reconstruction from D-RGB data (III):
  • C. Häne, C. Zach, A. Cohen, R. Angst, and M. Pollefeys. "Joint 3D scene reconstruction and class segmentation". link
15

11/4

Understanding Human Activities  Sam James Corbett-Davies and Christopher Choy and Kyunghee Kim Human pose estimation and activity recognition from D-RGB data:
  • Tutorial on Randomized decision trees and forests link
  • J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, A. Kipman, and A. Blake. "Real Time human pose recognition in parts from single depth image". CVPR2011 link
  • J. Wang, Z. Liu, Y. Wu, and J. Yuan. "Mining Actionlet Ensemble for Action Recognition with Depth Cameras". link
  11/5 

Project Mid-term Report due     
14

11/6

Understanding Human Pose Vivardhan Kanoria and Devin LaSalle Guillory Human pose estimation and activity recognition from 2D images:
  • L. Bourdev, and J. Malik. "Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations". ICCV2009 link
  • L. Bourdev, S. Maji, T. Brox, and J. Malik. "Detecting People Using Mutually Consistent Poselet Activations". ECCV2010 link
  • B. Yao and L. Fei-Fei. "Action Recognition with Exemplar Based 2.5D Graph Matching". ECCV2012 link
16 11/11 Understanding Human Activities Guest lecturer: Dr. W. Choi  Recognizing collective activities:
  • W. Choi, K. Shahid, and S. Savarese "What are they doing?: Collective Activity Classification Using Spatio-Temporal Relationship Among People". VSWS2009 and ICCV2009 link
  • W. Choi and S. Savarese "A Unified Framework for MultiTarget Tracking and Collective Activity Recognition". ECCV2012 link
17

11/11

7pm

380-380c

Understanding Human Activities  Sam James Corbett-Davies and Vivardhan Kanoria Joint estimation of object/scene affordances and human activities (I):
  • J. Gall, A. Fossati, and L. van Gool. (2011). "Functional Categorization of Objects Using Real-Time Markerless Motion Capture". CVPR2011 link
  • H. Koppula, R. Gupta, and A. Saxena. "Learning Human Activities and Object Affordances from RGB-D Videos". IJRR2013 link
18 11/13 Understanding Human Activities  Christopher Choy and Ziang Xie and Kyunghee Kim Joint estimation of object/scene affordances and human activities (II):
  • V. Delatitre, D. Fouhey, I. Laptec, Sivic, A. Gupta, and A. A. Efros. "Scene semantics from long term observation of people". ECCV2012 link
  • A. Gupta, S. Satkin, A. A. Efros, and M. Hebert. "From 3D scene geometry to Human workspace". CVPR2011 link
  • D. Fouhey, V. Delaitre, A. Gupta, A. A. Efros, I. Laptev, and J. Sivic. "People Watching: Human Actions as a Cue for Single View Geometry". ECCV2012 link

11/18 
no class
   

11/20 
no class    
  11/25 no class - Thanksgiving Break    
  11/27 no class - Thanksgiving Break    
  19 12/2



Final Project Presentations


 




  20

12/2

7pm @ Gates 219

Final Project Presentations

    
 

12/4

no class due to ICCV    
    12/13   Final Project Report Due    
         
         
         



Personal tools