<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Kapil Gupta | Nima Hamidi</title>
    <link>/~hamidi/author/kapil-gupta/</link>
      <atom:link href="/~hamidi/author/kapil-gupta/index.xml" rel="self" type="application/rss+xml" />
    <description>Kapil Gupta</description>
    <generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><lastBuildDate>Fri, 01 Nov 2019 00:00:00 +0000</lastBuildDate>
    <image>
      <url>/~hamidi/images/icon_hu8a5816724c4a49e81ada0aad75c4f4f9_4068_512x512_fill_lanczos_center_2.png</url>
      <title>Kapil Gupta</title>
      <link>/~hamidi/author/kapil-gupta/</link>
    </image>
    
    <item>
      <title>Personalizing Many Decisions with High-Dimensional Covariates</title>
      <link>/~hamidi/publication/hamidi-2019-personalizing/</link>
      <pubDate>Fri, 01 Nov 2019 00:00:00 +0000</pubDate>
      <guid>/~hamidi/publication/hamidi-2019-personalizing/</guid>
      <description>&lt;p&gt;We consider the $k$-armed stochastic contextual bandit problem with $d$ dimensional features, when both $k$ and $d$ can be large. To the best of our knowledge, all existing algorithm for this problem have a regret bound that scale as polynomials of degree at least two in $k$ and $d$. The main contribution of this paper is to introduce and theoretically analyze a new algorithm (REAL-bandit) with a regret that scales by $r^2(k+d)$ when $r$ is rank of the $k\times d$ matrix of unknown parameters. REAL-bandit relies on ideas from low-rank matrix estimation literature and a new row-enhancement subroutine that yields sharper bounds for estimating each row of the parameter matrix that may be of independent interest.&lt;/p&gt;
</description>
    </item>
    
  </channel>
</rss>
