KERDREUX Thomas

< Back to ILB Patrimony
Affiliations
  • 2019 - 2021
    Département d'Informatique de l'Ecole Normale Supérieure
  • 2019 - 2020
    Ecole normale supérieure Paris
  • 2020 - 2021
    Technical University of Berlin
  • 2019 - 2020
    Apprentissage statistique et parcimonie
  • 2019 - 2020
    Sciences mathematiques de paris centre
  • 2021
  • 2020
  • Linear Bandits on Uniformly Convex Sets.

    Thomas KERDREUX, Christophe ROUX, Alexandre D ASPREMONT, Sebastian POKUTTA
    2021
    Linear bandit algorithms yield $\tilde{\mathcal{O}}(n\sqrt{T})$ pseudo-regret bounds on compact convex action sets $\mathcal{K}\subset\mathbb{R}^n$ and two types of structural assumptions lead to better pseudo-regret bounds. When $\mathcal{K}$ is the simplex or an $\ell_p$ ball with $p\in]1,2]$, there exist bandits algorithms with $\tilde{\mathcal{O}}(\sqrt{nT})$ pseudo-regret bounds. Here, we derive bandit algorithms for some strongly convex sets beyond $\ell_p$ balls that enjoy pseudo-regret bounds of $\tilde{\mathcal{O}}(\sqrt{nT})$, which answers an open question from [BCB12, \S 5.5.]. Interestingly, when the action set is uniformly convex but not necessarily strongly convex, we obtain pseudo-regret bounds with a dimension dependency smaller than $\mathcal{O}(\sqrt{n})$. However, this comes at the expense of asymptotic rates in $T$ varying between $\tilde{\mathcal{O}}(\sqrt{T})$ and $\tilde{\mathcal{O}}(T)$.
Affiliations are detected from the signatures of publications identified in scanR. An author can therefore appear to be affiliated with several structures or supervisors according to these signatures. The dates displayed correspond only to the dates of the publications found. For more information, see https://scanr.enseignementsup-recherche.gouv.fr