Lecture Schedule

From cs331b Special Topics in 3dRR

(Difference between revisions)
Jump to: navigation, search
 
(46 intermediate revisions not shown)
Line 11: Line 11:
| Introductions  
| Introductions  
|  
|  
-
| [http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/4/4d/Admin_CS331.pdf slides]
+
| [http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/5/5a/Lecture1.pdf lecture slides]
|-
|-
| 2  
| 2  
Line 20: Line 20:
3D object recognition by aspect graph-based models:  
3D object recognition by aspect graph-based models:  
-
*Y.Xiang, S.Savarese. "Estimating the aspect layout of object categories". CVPR2012 [http://cvgl.stanford.edu/papers/xiang_cvpr12.pdf link]<br>
+
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/8/83/Lecture2.pdf lecture slides]
 +
 
 +
*Y.Xiang and S.Savarese. "Estimating the aspect layout of object categories". CVPR2012 [http://cvgl.stanford.edu/papers/xiang_cvpr12.pdf link]<br>
|-
|-
Line 26: Line 28:
| 9 /30  
| 9 /30  
| Representing and Recognizing Objects  
| Representing and Recognizing Objects  
-
| &nbsp;  
+
| Olga Russakovsky&nbsp;and Caleb Jordan
-
| 3D detection from D-RGB data:  
+
|  
-
*Liefeng Bo, Kevin Lai, Xiaofeng Ren, and Dieter Fox "Object&nbsp;Recognition with Hierarchical Kernel Descriptors"  
+
3D detection from D-RGB data:  
-
*L. Bo, X. Ren, and D. Fox. Kernel Descriptors for Visual&nbsp;Recognition. In NIPS, December 2010  
+
 
-
*K. Lai, L. Bo, X. Ren, and D. Fox. A Large-Scale Hierarchical&nbsp;Multi-View RGB-D Object Dataset. In IEEE International&nbsp;Conference on on Robotics and Automation, 2011
+
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/c/c0/Lecture3.pdf slides]
 +
 
 +
*L. Bo, K. Lai, Xiaofeng Ren, and D. Fox "Object&nbsp;Recognition with Hierarchical Kernel Descriptors" [https://homes.cs.washington.edu/~xren/publication/bo_cvpr11_hkdes.pdf link]
 +
*L. Bo, X. Ren, and D. Fox. "Kernel Descriptors for Visual&nbsp;Recognition". NIPS, December 2010 [http://www.cs.washington.edu/ai/Mobile_Robotics/postscripts/kdes-nips-10.pdf link]
 +
*K. Lai, L. Bo, X. Ren, and D. Fox. "A Large-Scale Hierarchical&nbsp;Multi-View RGB-D Object Dataset". IEEE International&nbsp;Conference on on Robotics and Automation, 2011 [http://www.cs.washington.edu/ai/Mobile_Robotics/postscripts/rgbd-dataset-icra-11.pdf link]
|-
|-
Line 37: Line 43:
| Representing and Recognizing Objects  
| Representing and Recognizing Objects  
| Guest lecturer: Dr. M. Stark  
| Guest lecturer: Dr. M. Stark  
-
| 3D object recognition by deformable part models:  
+
|  
-
*B.Pepik, M.Stark, P.Gehler and B.Schiele, "Teaching 3D&nbsp;geometry to deformable part models". CVPR2012 [http://www.d2.mpi-inf.mpg.de/sites/default/files/pepik12cvpr.pdf link]
+
3D object recognition by deformable part models:  
 +
 
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/e/e5/Lecture4.pdf slides]
 +
 
 +
*B. Pepik, M. Stark, P. Gehler and B. Schiele, "Teaching 3D&nbsp;geometry to deformable part models". CVPR2012 [http://www.d2.mpi-inf.mpg.de/sites/default/files/pepik12cvpr.pdf link]
|-
|-
Line 44: Line 54:
| 10/7  
| 10/7  
| Representing the 3D Space  
| Representing the 3D Space  
-
| &nbsp;  
+
|  
-
| Large scale 3D reconstruction by structure from motion&nbsp;:  
+
&nbsp;Devin LaSalle Guillory and&nbsp;Ziang Xie
-
*Sameer Agarwal, Yasutaka Furukawa, Noah Snavely, Ian&nbsp;Simon, Brian Curless, Steven M. Seitz and Richard Szeliski&nbsp;Communications of the ACM, Vol. 54, No. 10, Pages 105-112,&nbsp;October 2011 [http://grail.cs.washington.edu/projects/rome/rome_paper.pdf link]  
+
 
-
*Jan-Michael Frahm, Pierre Georgel, David Gallup, Tim&nbsp;Johnson, Rahul Raguram, Changchang Wu, Yi-Hung Jen,&nbsp;Enrique Dunn, Brian Clipp, Svetlana Lazebnik, Marc Pollefeys,&nbsp;ECCV 2010
+
|  
 +
Large scale 3D reconstruction by structure from motion&nbsp;:  
 +
 
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/6/64/Lecture5_2.pdf slides 1]
 +
 
 +
[http://www.stanford.edu/class/archive/cs/cs331b/cs331b.1142/wikiupload/d/d8/Lecture5.pdf slides 2]
 +
 
 +
*S. Agarwal, Y. Furukawa, N. Snavely, I. Simon, B. Curless, S. Seitz and R. Szeliski. Communications of the ACM, Vol. 54, No. 10, Pages 105-112,&nbsp;October 2011 [http://grail.cs.washington.edu/projects/rome/rome_paper.pdf link]  
 +
*J. Frahm, P. Georgel, D. Gallup, T. Johnson, R. Raguram, C. Wu, Y. Jen,&nbsp;E. Dunn, B. Clipp, S. Lazebnik, and M. Pollefeys. "Building Rome on a Cloudless Day". ECCV2010 [http://www.cs.illinois.edu/homes/slazebni/publications/eccv10_rome.pdf link]
|-
|-
Line 53: Line 71:
| 10/9  
| 10/9  
| Representing the 3D Space  
| Representing the 3D Space  
-
| &nbsp;
+
| Matt Swaner Vitelli
| Indoor scene layout reconstruction:  
| Indoor scene layout reconstruction:  
-
*David Lee, Takeo Kanadem Matrial Hebert. "3D Scene&nbsp;Analysis". CVPR2009 [http://www.cs.cmu.edu/~dclee/pub/cvpr09lee.pdf link]  
+
*D. Lee, T. Kanade, and M. Hebert. "3D Scene&nbsp;Analysis". CVPR2009 [http://www.cs.cmu.edu/~dclee/pub/cvpr09lee.pdf link]  
-
*A. Schwing and R. Urtasun, Efficient Exact Inference for 3D&nbsp;Indoor Scene Understanding In European Conference on&nbsp;Computer Vision (ECCV), Florence, Italy, October 2012
+
*A. Schwing and R. Urtasun. "Efficient Exact Inference for 3D&nbsp;Indoor Scene Understanding". ECCV2012 [http://www.alexander-schwing.de/papers/SchwingEtAl_ECCV2012.pdf link]
 +
*D. Hoiem, A.A. Efros, and M. Hebert, "Geometric Context from a Single Image", ICCV 2005 [http://www.cs.uiuc.edu/~dhoiem/publications/Hoiem_Geometric.pdf link]
|-
|-
Line 62: Line 81:
| 10/14  
| 10/14  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;  
+
| &nbsp;David Joseph Mandle and&nbsp;Serena Yu-Ching Yeung
| Joint 3D reconstruction and object detection:  
| Joint 3D reconstruction and object detection:  
-
*D. Hoiem, A.A. Efros, and M. Hebert, "Putting Objects in&nbsp;Perspective", to appear in CVPR 2006.  
+
*D. Hoiem, A. Efros, and M. Hebert, "Putting Objects in&nbsp;Perspective". CVPR 2006 [http://www.cs.uiuc.edu/~dhoiem/publications/hoiem_cvpr06.pdf link]
-
*Thinking Inside the Box: Using Appearance Models and&nbsp;Context Based on Room Geometry; V. Hedau, D. Hoiem, and&nbsp;D.A. Forsyth ECCV 2010
+
*V. Hedau, D. Hoiem, and D. Forsyth.&nbsp;"Thinking Inside the Box: Using Appearance Models and&nbsp;Context Based on Room Geometry". ECCV 2010 [http://www.cs.uiuc.edu/~dhoiem/publications/eccv2010_InsideTheBox_varsha link]
|-
|-
Line 72: Line 91:
| Understanding Complex Scenes  
| Understanding Complex Scenes  
| Guest lecturer: B. Kim  
| Guest lecturer: B. Kim  
-
| 2D/3D CRF models fo understanding:  
+
| 2D/3D CRF models for scene understanding:  
-
*B.Kim, M.Sun ,P.Kohli, S.Savarese, "Relating things and stuff&nbsp;by High-Order Poterntial modeling" ECCV2012
+
*B. Kim, M. Sun ,P. Kohli, and S. Savarese. "Relating things and stuff&nbsp;by High-Order Poterntial modeling". ECCV2012 [http://cvgl.stanford.edu/papers/kim_hipotws_eccv12.pdf link]
|-
|-
| &nbsp;  
| &nbsp;  
| <span style="color: rgb(255, 0, 0);">'''10/17&nbsp;
| <span style="color: rgb(255, 0, 0);">'''10/17&nbsp;
 +
'''
'''
Line 93: Line 113:
| Guest lecturer: Y. Bao  
| Guest lecturer: Y. Bao  
| Semantic structure from motion:  
| Semantic structure from motion:  
-
*S. Bao and S. Savarese, "Semantic Structure from Motion".&nbsp;Proceedings of the IEEE International Conference on&nbsp;Computer Vision and Pattern Recognition, 2011
+
*S. Bao and S. Savarese, "Semantic Structure from Motion". CVPR2011 [http://www.eecs.umich.edu/vision/papers/bao_ssfm_cvpr2011.pdf link]
|-
|-
Line 99: Line 119:
| 10/23  
| 10/23  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;  
+
| &nbsp;Kevin Jared Miller
-
| Tutorial on MCMC:  
+
|  
-
*Monocular 3D Scene Modeling and Inference: Understanding&nbsp;Multi-Object Traffic Scenes, Christian Wojek, Stefan Roth,&nbsp;Konrad Schindler, Bernt Schiele
+
Joint segmentation and reconstruction from videos:
 +
 
 +
*Tutorial on MCMC [http://www.kev-smith.com/tutorial/rjmcmc.php link]
 +
*C. Wojek, S. Roth, K. Schindler, B. Schiele.&nbsp;"Monocular 3D Scene Modeling and Inference: Understanding&nbsp;Multi-Object Traffic Scenes". [http://domino.mpi-inf.mpg.de/intranet/d2/d2publ.nsf/0/ac2eb7bd29ab4279c12578110057e2d9/$FILE/wojek2010eccv.pdf link]
|-
|-
Line 107: Line 130:
| 10/28  
| 10/28  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;  
+
| &nbsp;David Joseph Mandle and&nbsp;Serena Yu-Ching Yeung
| Joint segmentation and reconstruction from D-RGB data (I):  
| Joint segmentation and reconstruction from D-RGB data (I):  
-
*Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob&nbsp;Fergus "Indoor Segmentation and Support Inference from&nbsp;RGBD Images" ECCV, 2012 [http://cs.nyu.edu/~silberman/papers/indoor_seg_support.pdf link]  
+
*N. Silberman, D. Hoiem, P. Kohli, R. Fergus. "Indoor Segmentation and Support Inference from&nbsp;RGBD Images". ECCV2012 [http://cs.nyu.edu/~silberman/papers/indoor_seg_support.pdf link]  
-
*S. Gupta, P. Arbelaez and J. Malik. Perceptual Organization&nbsp;and Recognition of Indoor Scenes from RGBD Images. In CVPR&nbsp;2013 (oral). [http://www.cs.berkeley.edu/~arbelaez/publications/gam_cvpr2013.pdf link]
+
*S. Gupta, P. Arbelaez, and J. Malik. "Perceptual Organization&nbsp;and Recognition of Indoor Scenes from RGBD Images". CVPR2013 (oral). [http://www.cs.berkeley.edu/~arbelaez/publications/gam_cvpr2013.pdf link]
|-
|-
Line 116: Line 139:
| 10/30  
| 10/30  
| Understanding Complex Scenes  
| Understanding Complex Scenes  
-
| &nbsp;
+
| Kevin Jared Miller and Caleb Stephen Jordan
| Joint segmentation and reconstruction from D-RGB data (II):  
| Joint segmentation and reconstruction from D-RGB data (II):  
-
*Munoz, D., Bagnell, J.A., Hebert, M.: Stacked hierarchical&nbsp;labeling. In: ECCV. (2010)
+
*D. Munoz, J. Bagnell, and M. Hebert. "Stacked Hierarchical Labeling". ECCV2010 [http://www.ri.cmu.edu/pub_files/2010/9/munoz_eccv_10.pdf link]
-
*Co-inference for Multi-modal Scene Analysis Daniel Munoz,&nbsp;J. Andrew (Drew) Bagnell, and Martial Hebert European&nbsp;Conference on Computer Vision (ECCV), October, 2012.
+
*D. Munoz, J. Bagnell, and M. Hebert.&nbsp;"Co-inference for Multi-modal Scene Analysis". ECCV2012 [http://www.ri.cmu.edu/pub_files/2012/10/munoz_eccv_12.pdf link]
|-
|-
Line 127: Line 150:
| Guest lecturer: Dr. Roland Angst  
| Guest lecturer: Dr. Roland Angst  
| Joint segmentation and reconstruction from D-RGB data (III):  
| Joint segmentation and reconstruction from D-RGB data (III):  
-
*Christian Häne, Christopher Zach, Andrea Cohen, Roland&nbsp;Angst, Marc Pollefeys, Joint 3D scene reconstruction and&nbsp;class segmentation [http://www.inf.ethz.ch/personal/chaene/publications/haene2013joint.pdf link]
+
*C. Häne, C. Zach, A. Cohen, R. Angst, and M. Pollefeys. "Joint 3D scene reconstruction and&nbsp;class segmentation".&nbsp;[http://www.inf.ethz.ch/personal/chaene/publications/haene2013joint.pdf link]
 +
 
 +
|-
 +
| 15
 +
|
 +
<span style="color: rgb(0, 0, 255);">'''11/4'''</span>
 +
 
 +
| Understanding Human Activities
 +
| &nbsp;Sam James Corbett-Davies and&nbsp;Christopher Choy and Kyunghee Kim
 +
| Human pose estimation and activity recognition from D-RGB&nbsp;<span style="line-height: 1.5em;">data:</span>
 +
*Tutorial on Randomized decision trees and forests [http://www.iis.ee.ic.ac.uk/~tkkim/iccv09_tutorial link]
 +
*J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp,&nbsp;M. Finocchio, R. Moore, A. Kipman, and A. Blake. "Real Time human pose recognition in parts from single&nbsp;depth image". CVPR2011 [http://research.microsoft.com/pubs/145347/BodyPartRecognition.pdf link]
 +
*J. Wang, Z. Liu, Y. Wu, and J. Yuan.&nbsp;"Mining Actionlet Ensemble for Action Recognition with&nbsp;Depth Cameras".&nbsp;[http://research.microsoft.com/en-us/um/people/zliu/papers/joint_modeling_final.pdf link]
|-
|-
Line 138: Line 173:
|-
|-
| 14  
| 14  
-
| 11/6  
+
|  
 +
11/6  
 +
 
| Understanding Human Pose  
| Understanding Human Pose  
-
| &nbsp;  
+
| Vivardhan Kanoria and&nbsp;Devin LaSalle Guillory
-
| Human pose estimation and activity recognition from 2D  
+
| Human pose estimation and activity recognition from 2D&nbsp;<span style="line-height: 1.5em;">images:</span>
-
images:  
+
*L. Bourdev, and J. Malik. "Poselets: Body Part&nbsp;Detectors Trained Using 3D Human Pose Annotations". ICCV2009 [http://www.eecs.berkeley.edu/Research/Projects/CS/vision/human/poselets_iccv09.pdf link]  
-
 
+
*L. Bourdev, S. Maji, T. Brox, and J. Malik. "Detecting People Using Mutually Consistent Poselet&nbsp;Activations". ECCV2010 [http://www.cs.berkeley.edu/~lbourdev/poselets/poselets-eccv10.pdf link]  
-
*Lubomir Bourdev, Jitendra Malik,"Poselets: Body Part&nbsp;Detectors Trained Using 3D Human Pose Annotations", ICCV&nbsp;2009 [http://www.eecs.berkeley.edu/Research/Projects/CS/vision/human/poselets_iccv09.pdf link]  
+
*B. Yao and L. Fei-Fei.&nbsp;"Action Recognition with Exemplar Based 2.5D Graph&nbsp;Matching". ECCV2012 [http://vision.stanford.edu/documents/YaoFei-Fei_ECCV12.pdf link]
-
*Lubomir Bourdev, Subhransu Maji, Thomas Brox, Jitendra&nbsp;Malik,"Detecting People Using Mutually Consistent Poselet&nbsp;Activations", ECCV 2010 [http://www.cs.berkeley.edu/~lbourdev/poselets/poselets-eccv10.pdf link]  
+
-
*Action Recognition with Exemplar Based 2.5D Graph&nbsp;Matching Bangpeng Yao and Li Fei-Fei, European Conference&nbsp;on Computer Vision (ECCV). Firenze, Italy. October 7-13,&nbsp;2012.
+
-
 
+
-
|-
+
-
| 15
+
-
| tba
+
-
| Understanding Human Activities
+
-
| &nbsp;  
+
-
| Human pose estimation and activity recognition from D-RGB
+
-
data:
+
-
 
+
-
*Tutorial on Randomized decision trees and forests
+
-
*Jamie Shotton , Andrew Fitzgibbon , Mat Cook , Toby Sharp ,&nbsp;Mark Finocchio , Richard Moore , Alex Kipman , Andrew&nbsp;Blake, "Real Time human pose recognition in parts from single&nbsp;depth image". CVPR, 2011 [http://research.microsoft.com/pubs/145347/BodyPartRecognition.pdf link]
+
-
*Mining Actionlet Ensemble for Action Recognition with&nbsp;Depth Cameras , Jiang Wang, Zicheng Liu, Ying Wu,&nbsp;Junsong Yuan [http://research.microsoft.com/en-us/um/people/zliu/papers/joint_modeling_final.pdf link]
+
|-
|-
Line 166: Line 189:
| Guest lecturer: Dr. W. Choi&nbsp;  
| Guest lecturer: Dr. W. Choi&nbsp;  
| Recognizing collective activities:  
| Recognizing collective activities:  
-
*W. Choi, K. Shahid, S. Savarese "What are they doing?: Collective Activity Classification Using Spatio-Temporal&nbsp;Relationship Among People", 9th International Workshop on&nbsp;Visual Surveillance (VSWS09) in conjuction with ICCV 09 [http://www.eecs.umich.edu/vision/paper/Wongun_CollectiveActivityRecognition09.pdf link]
+
*W. Choi, K. Shahid, and S. Savarese "What are they doing?: Collective Activity Classification Using Spatio-Temporal&nbsp;Relationship Among People". VSWS2009 and ICCV2009 [http://www.eecs.umich.edu/vision/paper/Wongun_CollectiveActivityRecognition09.pdf link]  
-
*W. Choi and S. Savarese "A Unified Framework for MultiTarget Tracking and Collective Activity Recognition" ECCV&nbsp;2012
+
*W. Choi and S. Savarese "A Unified Framework for MultiTarget Tracking and Collective Activity Recognition". ECCV2012 [http://www-personal.umich.edu/~wgchoi/choi_eccv_12.pdf link]
|-
|-
| 17  
| 17  
-
| 11/13
 
-
| Understanding Human Activities
 
-
| &nbsp;
 
|  
|  
-
*Gall, J., Fossati, A., and van Gool, L. (2011). Functional&nbsp;categorization of objects using real-time markerless motion&nbsp;capture. In CVPR.  
+
'''<span style="color: rgb(0, 0, 255);">11/11 </span>'''
-
*Learning Human Activities and Object Affordances from&nbsp;RGB-D Videos, Hema S Koppula, Rudhir Gupta, Ashutosh&nbsp;Saxena. International Journal of Robotics Research (IJRR),&nbsp;32(8):951-970, July 2013.<br>
+
 
 +
'''<span style="color: rgb(0, 0, 255);">7pm </span>'''
 +
 
 +
'''<span style="color: rgb(0, 0, 255);">380-380c</span>'''
 +
 
 +
| Understanding Human Activities
 +
| &nbsp;Sam James Corbett-Davies and&nbsp;Vivardhan Kanoria
 +
| Joint estimation of object/scene affordances and human activities (I):<br>
 +
*J. Gall, A. Fossati, and L. van Gool. (2011). "Functional Categorization of Objects Using Real-Time Markerless Motion Capture". CVPR2011 [http://www.vision.ee.ethz.ch/~gallju/download/jgall_dynamiccat_cvpr11.pdf link]
 +
*H. Koppula, R. Gupta, and A. Saxena.&nbsp;"Learning Human Activities and Object Affordances from&nbsp;RGB-D Videos". IJRR2013 [http://arxiv.org/pdf/1210.1207v2.pdf link]<br>
|-
|-
| 18  
| 18  
-
| tba
+
| 11/13
| Understanding Human Activities  
| Understanding Human Activities  
-
| &nbsp;  
+
| &nbsp;Christopher Choy and&nbsp;Ziang Xie and Kyunghee Kim
-
| Joint estimation of object/scene affordances and human  
+
| Joint estimation of object/scene affordances and human activities (II):<br>
-
pose:  
+
*V. Delatitre, D. Fouhey, I. Laptec, Sivic, A. Gupta, and A. A. Efros. "Scene semantics from long term observation of&nbsp;people". ECCV2012 [http://repository.cmu.edu/cgi/viewcontent.cgi?article=1775&context=robotics link]
-
 
+
*A. Gupta, S. Satkin, A. A. Efros, and M. Hebert. "From 3D scene&nbsp;geometry to Human workspace". CVPR2011 [http://www.cs.cmu.edu/~abhinavg/papers/0586.pdf link]<br>
-
*V.Delatitre, D.Fouhey, I.Laptec, Sivic, A.Gupta,&nbsp;A.A.Efros "Scene semantics from long term observation of&nbsp;people" ECCV2012&nbsp;
+
*D. Fouhey, V. Delaitre, A. Gupta, A. A. Efros, I. Laptev, and J. Sivic.&nbsp;"People Watching: Human Actions as a Cue for Single View Geometry".&nbsp;ECCV2012 [http://graphics.cs.cmu.edu/projects/peopleWatching/dfouhey_people.pdf link]
-
*A.Gupta, S.Satkin, A.A.Efros, M.Hebert "From 3D scene&nbsp;geometry to Human workspace" CVPR2011&nbsp;
+
-
*People Watching: Human Actions as a Cue for Single View&nbsp;Geometry, David Fouhey, Vincent Delaitre, Abhinav Gupta,&nbsp;Alexei A. Efros, Ivan Laptev, and Josef Sivic&nbsp;European Conference on Computer Vision (ECCV), October,&nbsp;2012.
+
|-
|-
-
| '''<span style="color: rgb(255, 0, 0);">19</span><br>'''  
+
| '''<br>'''
-
| '''<span style="color: rgb(255, 0, 0);"> 11/18&nbsp;</span><br>'''  
+
| '''<span style="color: rgb(255, 0, 0);"> </span>'''11/18'''<span style="color: rgb(255, 0, 0);">&nbsp;</span><br>'''  
-
| '''<span style="color: rgb(255, 0, 0);"> Final Project Presentations</span><br>'''  
+
| '''<span style="color: rgb(255, 0, 0);"> </span>'''no class'''<br>'''  
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;
|-
|-
-
| '''<span style="color: rgb(255, 0, 0);"> 20</span><br>'''  
+
| '''<br>'''  
-
| '''<span style="color: rgb(255, 0, 0);"> 11/20&nbsp;</span><br>'''  
+
| '''<span style="color: rgb(255, 0, 0);"> </span>'''11/20'''<span style="color: rgb(255, 0, 0);">&nbsp;</span><br>'''  
-
| '''<span style="color: rgb(255, 0, 0);"> Final Project Presentation</span><span style="color: rgb(255, 0, 0);">s</span>'''  
+
| no class
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;
Line 215: Line 242:
| &nbsp;
| &nbsp;
|-
|-
-
| &nbsp;  
+
| <span style="color: rgb(255, 0, 0);">'''&nbsp; 19'''</span>
-
| 12/2  
+
| <span style="color: rgb(255, 0, 0);">''' 12/2
 +
 
 +
'''
 +
 
 +
 
 +
 
 +
 
 +
</span>
|  
|  
-
no class due to ICCV<br>  
+
<span style="color: rgb(255, 0, 0);">'''Final Project Presentations'''
 +
</span><br>  
-
| &nbsp;  
+
| <span style="color: rgb(255, 0, 0);">'''
-
| &nbsp;
+
 
 +
 
 +
'''
 +
 
 +
</span>
 +
| <span style="color: rgb(255, 0, 0);">''' &nbsp;'''
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
 +
</span>
|-
|-
-
| &nbsp;  
+
| <span style="color: rgb(255, 0, 0);">'''&nbsp; 20'''</span>
-
| 12/4
+
|  
-
| no class due to ICCV
+
<span style="color: rgb(255, 0, 0);">'''12/2'''</span>
-
| &nbsp;  
+
 
 +
<span style="color: rgb(255, 0, 0);">'''7pm @ Gates 219'''</span>
 +
 
 +
|  
 +
<span style="color: rgb(255, 0, 0);">''' Final Project Presentations'''</span>
 +
 
 +
| <span style="color: rgb(255, 0, 0);">''' &nbsp;'''</span>
| &nbsp;&nbsp;
| &nbsp;&nbsp;
|-
|-
| &nbsp;  
| &nbsp;  
-
| '''<span style="color: rgb(255, 0, 0);">12/13&nbsp;</span><br>'''
+
|  
-
| '''<span style="color: rgb(255, 0, 0);">Final Project Report Due</span>'''
+
12/4<br>  
 +
 
 +
| no class due to ICCV
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;
|-
|-
| &nbsp;  
| &nbsp;  
-
| &nbsp;  
+
| &nbsp; '''<span style="color: rgb(255, 0, 0);">12/13</span>'''
-
| &nbsp;  
+
| &nbsp;<span style="color: rgb(255, 0, 0);">''' Final Project Report Due'''</span>
| &nbsp;  
| &nbsp;  
| &nbsp;
| &nbsp;

Latest revision as of 15:32, 12 November 2013

Lect. Date Topics Presenter Paper/Slides
1 9/23 Introductions lecture slides
2 9/25 Representing and Recognizing Objects
Guest lecturer: Y. Xiang

3D object recognition by aspect graph-based models:

lecture slides

  • Y.Xiang and S.Savarese. "Estimating the aspect layout of object categories". CVPR2012 link
3 9 /30 Representing and Recognizing Objects Olga Russakovsky and Caleb Jordan

3D detection from D-RGB data:

slides

  • L. Bo, K. Lai, Xiaofeng Ren, and D. Fox "Object Recognition with Hierarchical Kernel Descriptors" link
  • L. Bo, X. Ren, and D. Fox. "Kernel Descriptors for Visual Recognition". NIPS, December 2010 link
  • K. Lai, L. Bo, X. Ren, and D. Fox. "A Large-Scale Hierarchical Multi-View RGB-D Object Dataset". IEEE International Conference on on Robotics and Automation, 2011 link
4 10/2 Representing and Recognizing Objects Guest lecturer: Dr. M. Stark

3D object recognition by deformable part models:

slides

  • B. Pepik, M. Stark, P. Gehler and B. Schiele, "Teaching 3D geometry to deformable part models". CVPR2012 link
5 10/7 Representing the 3D Space

 Devin LaSalle Guillory and Ziang Xie

Large scale 3D reconstruction by structure from motion :

slides 1

slides 2

  • S. Agarwal, Y. Furukawa, N. Snavely, I. Simon, B. Curless, S. Seitz and R. Szeliski. Communications of the ACM, Vol. 54, No. 10, Pages 105-112, October 2011 link
  • J. Frahm, P. Georgel, D. Gallup, T. Johnson, R. Raguram, C. Wu, Y. Jen, E. Dunn, B. Clipp, S. Lazebnik, and M. Pollefeys. "Building Rome on a Cloudless Day". ECCV2010 link
6 10/9 Representing the 3D Space Matt Swaner Vitelli Indoor scene layout reconstruction:
  • D. Lee, T. Kanade, and M. Hebert. "3D Scene Analysis". CVPR2009 link
  • A. Schwing and R. Urtasun. "Efficient Exact Inference for 3D Indoor Scene Understanding". ECCV2012 link
  • D. Hoiem, A.A. Efros, and M. Hebert, "Geometric Context from a Single Image", ICCV 2005 link
7 10/14 Understanding Complex Scenes  David Joseph Mandle and Serena Yu-Ching Yeung Joint 3D reconstruction and object detection:
  • D. Hoiem, A. Efros, and M. Hebert, "Putting Objects in Perspective". CVPR 2006 link
  • V. Hedau, D. Hoiem, and D. Forsyth. "Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry". ECCV 2010 link
8 10/16 Understanding Complex Scenes Guest lecturer: B. Kim 2D/3D CRF models for scene understanding:
  • B. Kim, M. Sun ,P. Kohli, and S. Savarese. "Relating things and stuff by High-Order Poterntial modeling". ECCV2012 link
  10/17 



Project Proposal due    
9 10/21 Understanding Complex Scenes Guest lecturer: Y. Bao Semantic structure from motion:
  • S. Bao and S. Savarese, "Semantic Structure from Motion". CVPR2011 link
10 10/23 Understanding Complex Scenes  Kevin Jared Miller

Joint segmentation and reconstruction from videos:

  • Tutorial on MCMC link
  • C. Wojek, S. Roth, K. Schindler, B. Schiele. "Monocular 3D Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes". link
11 10/28 Understanding Complex Scenes  David Joseph Mandle and Serena Yu-Ching Yeung Joint segmentation and reconstruction from D-RGB data (I):
  • N. Silberman, D. Hoiem, P. Kohli, R. Fergus. "Indoor Segmentation and Support Inference from RGBD Images". ECCV2012 link
  • S. Gupta, P. Arbelaez, and J. Malik. "Perceptual Organization and Recognition of Indoor Scenes from RGBD Images". CVPR2013 (oral). link
12 10/30 Understanding Complex Scenes Kevin Jared Miller and Caleb Stephen Jordan Joint segmentation and reconstruction from D-RGB data (II):
  • D. Munoz, J. Bagnell, and M. Hebert. "Stacked Hierarchical Labeling". ECCV2010 link
  • D. Munoz, J. Bagnell, and M. Hebert. "Co-inference for Multi-modal Scene Analysis". ECCV2012 link
13 11/4 Understanding Human Pose Guest lecturer: Dr. Roland Angst Joint segmentation and reconstruction from D-RGB data (III):
  • C. Häne, C. Zach, A. Cohen, R. Angst, and M. Pollefeys. "Joint 3D scene reconstruction and class segmentation". link
15

11/4

Understanding Human Activities  Sam James Corbett-Davies and Christopher Choy and Kyunghee Kim Human pose estimation and activity recognition from D-RGB data:
  • Tutorial on Randomized decision trees and forests link
  • J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, A. Kipman, and A. Blake. "Real Time human pose recognition in parts from single depth image". CVPR2011 link
  • J. Wang, Z. Liu, Y. Wu, and J. Yuan. "Mining Actionlet Ensemble for Action Recognition with Depth Cameras". link
  11/5 

Project Mid-term Report due     
14

11/6

Understanding Human Pose Vivardhan Kanoria and Devin LaSalle Guillory Human pose estimation and activity recognition from 2D images:
  • L. Bourdev, and J. Malik. "Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations". ICCV2009 link
  • L. Bourdev, S. Maji, T. Brox, and J. Malik. "Detecting People Using Mutually Consistent Poselet Activations". ECCV2010 link
  • B. Yao and L. Fei-Fei. "Action Recognition with Exemplar Based 2.5D Graph Matching". ECCV2012 link
16 11/11 Understanding Human Activities Guest lecturer: Dr. W. Choi  Recognizing collective activities:
  • W. Choi, K. Shahid, and S. Savarese "What are they doing?: Collective Activity Classification Using Spatio-Temporal Relationship Among People". VSWS2009 and ICCV2009 link
  • W. Choi and S. Savarese "A Unified Framework for MultiTarget Tracking and Collective Activity Recognition". ECCV2012 link
17

11/11

7pm

380-380c

Understanding Human Activities  Sam James Corbett-Davies and Vivardhan Kanoria Joint estimation of object/scene affordances and human activities (I):
  • J. Gall, A. Fossati, and L. van Gool. (2011). "Functional Categorization of Objects Using Real-Time Markerless Motion Capture". CVPR2011 link
  • H. Koppula, R. Gupta, and A. Saxena. "Learning Human Activities and Object Affordances from RGB-D Videos". IJRR2013 link
18 11/13 Understanding Human Activities  Christopher Choy and Ziang Xie and Kyunghee Kim Joint estimation of object/scene affordances and human activities (II):
  • V. Delatitre, D. Fouhey, I. Laptec, Sivic, A. Gupta, and A. A. Efros. "Scene semantics from long term observation of people". ECCV2012 link
  • A. Gupta, S. Satkin, A. A. Efros, and M. Hebert. "From 3D scene geometry to Human workspace". CVPR2011 link
  • D. Fouhey, V. Delaitre, A. Gupta, A. A. Efros, I. Laptev, and J. Sivic. "People Watching: Human Actions as a Cue for Single View Geometry". ECCV2012 link

11/18 
no class
   

11/20 
no class    
  11/25 no class - Thanksgiving Break    
  11/27 no class - Thanksgiving Break    
  19 12/2



Final Project Presentations


 




  20

12/2

7pm @ Gates 219

Final Project Presentations

    
 

12/4

no class due to ICCV    
    12/13   Final Project Report Due    
         
         
         



Personal tools