Broad areas: machine learning, optimization, dynamic systems and control
Foci: reinforcement learning, approximate dynamic programming and bandit
Applications: smart grid, petroleum reservoir production, recommendation systems
B. Kveton, Z. Wen, A. Ashkan, H. Eydgahi, B. Eriksson, "Matroid Bandits: Fast Combinatorial Optimization with Learning", accepted by the Conference on Uncertainty in Artificial Intelligence (UAI), 2014 (plenary talk).
Z. Wen, "Efficient Reinforcement Learning with Value Function Generalization", PhD Dissertation, Stanford University.
Z. Wen and B. Van Roy, "Efficient Exploration and Value Function Generalization in Deterministic Systems", submitted to Mathematics of Operations Research.
An abridged version of this paper is published in Advances in Neural Information Processing Systems (NIPS) 26, MIT Press, 2013. [Link]
Z. Wen, D. O'Neill and H. R. Maei, "Optimal Demand Response Using Device Based Reinforcement Learning", submitted to IEEE Transactions on Smart Grid. [Appendix]
V. Gabillon, B. Kveton, Z. Wen, B. Eriksson and S. Muthukrishnan, "Adaptive Submodular Maximization in Bandit Setting", Advances in Neural Information Processing Systems (NIPS) 26, MIT Press, 2013. [Link]
Z. Wen, B. Kveton, B. Eriksson and S. Bhamidipati, "Sequential Bayesian Search", in Proceedings of the 30th International Conference on Machine Learning (ICML), Atlanta, Georgia, June 2013. [Appendix]
Z. Wen, L. J. Durlofsky, B. Van Roy and K. Aziz, "Approximate Dynamic Programming for Optimizing Oil Production", Chapter 25 in Reinforcement Learning and Approximate Dynamic Programming for Feedback Control, edited by F. L. Lewis and D. Liu, Wiley-IEEE Press, 2012.
Z. Wen, L. J. Durlofsky, B. Van Roy and K. Aziz, "Use of Approximate Dynamic Programming for Production Optimization", in Society of Petroleum Engineers (SPE) Proceedings, the Woodlands, Texas, February 2011.
Z. Wen, S. Roy and A. Saberi, "On the Dynamic Response of a Saturating Static-Feedback-Controlled Single Integrator Driven by White Noise", IEEE Transactions on Automatic Control, vol. 55, no. 4, pp. 959-965, April 2010.
Z. Wen, S. Roy and A. Saberi, "On the Disturbance Response and External Stability of a Saturating Static-Feedback-Controlled Double Integrator ", Automatica, vol. 44, pp. 2191-2196, August 2008.
An earlier version of this paper is published in Proceedings of the American Control Conference, New York City, July 2007.
Dynamic Programming and Stochastic Control (MS&E 351), Fall 2013, Stanford University.
Teaching assistant for:
Dynamic Programming and Stochastic Control (MS&E 351), Fall 2012, Stanford University.
Approximate Dynamic Programming (MS&E 339), Fall 2009, Stanford University.
Linear Systems (EE 501), Introduction to Control Systems (EE 489), Circuits (EE 321), 2005-2007, Washington State University.
PhD minor, Business/Finance, Stanford Graduate School of Business, June 2013.
BE, Electrical Engineering, Harbin Institute of Technology, July 2005.
Undergraduate exchange student, Hong Kong University of Science and Technology, 2004.
Research scientist, Technicolor Research Lab, Palo Alto, CA, 2013-2014.
Research intern, Technicolor Research Lab, Palo Alto, CA, 2012.
ST Microelectronics Stanford Graduate Fellowship, Stanford University, 2008-2012.
Outstanding MS Graduate Student, Washington State University, May 2007.
Outstanding Student Scholarship, Harbin Institute of Technology, 2002.