Yuke Zhu
 

I am a PhD student at Stanford University. My research focuses on the principles and applications of computer vision, machine learning, and robotics, in particular visual knowledge and deep reinforcement learning. I work in Stanford Vision Lab with Prof. Fei-Fei Li. Prior to coming to Stanford, I received a BSc. degree from Simon Fraser University and a BEng. degree from Zhejiang University.

Email: yukez@cs.stanford.edu

Gates Computer Science Building, Room 242
353 Serra Mall, Stanford University
Stanford, CA 94305-9025, USA
[CV]


News

[new] I will be interning at Google DeepMind in London, June to September 2017.

[new] Two papers got accepted to CVPR 2017. See you there in Honolulu, Hawaii.

Our new paper on generating structured scene graphs from images is on arXiv now.

Our paper on visual navigation using deep reinforcement learning has been accepted to ICRA 2017.

We released our new paper on learning robot navigation with deep reinforcement learning.


Recent Papers

  • Scene Graph Generation by Iterative Message Passing
    Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei
    CVPR 2017 (To appear)
  • Knowledge Acquisition for Visual Question Answering via Iterative Querying
    Yuke Zhu, Joseph J. Lim, Li Fei-Fei
    CVPR 2017 (To appear)
  • Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning
    Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, Joseph J. Lim, Abhinav Gupta, Li Fei-Fei, Ali Farhadi
    ICRA 2017
  • Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
    Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen,
    Yannis Kalanditis, Li-Jia Li, David A. Shamma, Michael Bernstein, Li Fei-Fei
    IJCV 2017
  • Visual7W: Grounded Question Answering in Images
    Yuke Zhu, Oliver Groth, Michael Bernstein, Li Fei-Fei
    CVPR 2016
  • Action Recognition by Hierarchical Mid-level Action Elements
    Tian Lan*, Yuke Zhu*, Amir Roshan Zamir, Silvio Savarese [* indicates equal contribution]
    ICCV 2015
  • Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries
    Yuke Zhu, Ce Zhang, Christopher RĂ©, Li Fei-Fei
    arXiv:1507.05670
  • Reasoning About Object Affordances in a Knowledge Base Representation
    Yuke Zhu, Alireza Fathi, Li Fei-Fei
    ECCV 2014

Teaching Experience

  • Teaching Assistant

    Spring 2013-2014 | Stanford, CA, USA

    CS 431: High-Level Vision: Behaviors, Neurons and Computational Models

    Summer 2013-2014 | Stanford, CA, USA

    CS 193C: Client-Side Internet Technologies

    Fall 2014-2015 | Stanford, CA, USA

    CS 131: Computer Vision: Foundations and Applications

    Winter 2014-2015 | Stanford, CA, USA

    CS 231N: Convolutional Neural Networks for Visual Recognition

Working Experience

  • Research Intern

    Jun - Sept 2017 | London, England, United Kingdom

    Google Deepmind
  • Research Intern

    Jun - Sept 2016 | Seattle, WA, USA

    Allen Institute for Artificial Intelligence
  • Research Intern

    May - Aug 2015 | Venice, CA, USA

    Snapchat Inc.
  • Software Engineer Intern

    Apr - Jul 2013 | San Francisco, CA, USA

    Twitter Inc.
  • Research Assistant

    Dec 2011 - Apr 2013 | Vancouver, BC, Canada

    SFU Computational Logic Lab

    Jan 2012 - Apr 2013 | Vancouver, BC, Canada

    SFU Vision and Media Lab
  • Co-founder

    Aug 2011 - Aug 2013 | Hangzhou, Zhejiang, China

    Hangzhou Iserlohn Technology Co., Ltd.
  • Software Engineer Intern

    Jul - Aug 2011 | Qingdao, Shandong, China

    Qingdao Topscomm Communication Co., Ltd.