Yuke Zhu
 

I am a PhD student at Stanford University. My research focuses on the principles and applications of computer vision, machine learning, and robotics, in particular visual knowledge and deep reinforcement learning. I work in Stanford Vision Lab with Prof. Fei-Fei Li. Prior to coming to Stanford, I received a BSc. degree from Simon Fraser University and a BEng. degree from Zhejiang University.

Email: yukez@cs.stanford.edu

Gates Computer Science Building, Room 242
353 Serra Mall, Stanford University
Stanford, CA 94305-9025, USA
[CV]


News

[new] We released our new paper on visual semantic planning using the extended THOR framework.

[new] We released our code and a new dataset for our CVPR'17 paper on scene graph generation.

I will be interning at Google DeepMind in London, June to September 2017.

Two papers got accepted to CVPR 2017. See you there in Honolulu, Hawaii.


Recent Papers

  • Visual Semantic Planning using Deep Successor Representations
    Yuke Zhu*, Daniel Gordon*, Eric Kolve, Dieter Fox, Li Fei-Fei, Abhinav Gupta, Roozbeh Mottaghi, Ali Farhadi
  • Scene Graph Generation by Iterative Message Passing
    Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei
    CVPR 2017
  • Knowledge Acquisition for Visual Question Answering via Iterative Querying
    Yuke Zhu, Joseph J. Lim, Li Fei-Fei
    CVPR 2017
  • Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning
    Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, Joseph J. Lim, Abhinav Gupta, Li Fei-Fei, Ali Farhadi
    ICRA 2017
  • Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
    Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen,
    Yannis Kalanditis, Li-Jia Li, David A. Shamma, Michael Bernstein, Li Fei-Fei
    IJCV 2017
  • Visual7W: Grounded Question Answering in Images
    Yuke Zhu, Oliver Groth, Michael Bernstein, Li Fei-Fei
    CVPR 2016
  • Action Recognition by Hierarchical Mid-level Action Elements
    Tian Lan*, Yuke Zhu*, Amir Roshan Zamir, Silvio Savarese [* indicates equal contribution]
    ICCV 2015
  • Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries
    Yuke Zhu, Ce Zhang, Christopher RĂ©, Li Fei-Fei
    arXiv:1507.05670
  • Reasoning About Object Affordances in a Knowledge Base Representation
    Yuke Zhu, Alireza Fathi, Li Fei-Fei
    ECCV 2014

Teaching Experience

  • Teaching Assistant

    Spring 2013-2014 | Stanford, CA, USA

    CS 431: High-Level Vision: Behaviors, Neurons and Computational Models

    Summer 2013-2014 | Stanford, CA, USA

    CS 193C: Client-Side Internet Technologies

    Fall 2014-2015 | Stanford, CA, USA

    CS 131: Computer Vision: Foundations and Applications

    Winter 2014-2015 | Stanford, CA, USA

    CS 231N: Convolutional Neural Networks for Visual Recognition

Working Experience

  • Research Intern

    Jun - Sept 2017 | London, England, United Kingdom

    Google Deepmind
  • Research Intern

    Jun - Sept 2016 | Seattle, WA, USA

    Allen Institute for Artificial Intelligence
  • Research Intern

    May - Aug 2015 | Venice, CA, USA

    Snapchat Inc.
  • Software Engineer Intern

    Apr - Jul 2013 | San Francisco, CA, USA

    Twitter Inc.
  • Research Assistant

    Dec 2011 - Apr 2013 | Vancouver, BC, Canada

    SFU Computational Logic Lab

    Jan 2012 - Apr 2013 | Vancouver, BC, Canada

    SFU Vision and Media Lab
  • Co-founder

    Aug 2011 - Aug 2013 | Hangzhou, Zhejiang, China

    Hangzhou Iserlohn Technology Co., Ltd.
  • Software Engineer Intern

    Jul - Aug 2011 | Qingdao, Shandong, China

    Qingdao Topscomm Communication Co., Ltd.