KAUFMANN Emilie

< Back to ILB Patrimony
Topics of productions
Affiliations
  • 2015 - 2020
    Séquential learning
  • 2020 - 2021
    Centrale Lille Institut
  • 2015 - 2020
    Centre de recherche en informatique, signal et automatique de Lille
  • 2017 - 2018
    Manouba University
  • 2012 - 2014
    Télécom ParisTech
  • 2012 - 2016
    Laboratoire traitement et communication de l'information
  • 2013 - 2014
    Informatique, telecommunications et electronique de paris
  • 2012 - 2013
    Laboratoire traitement du signal et de l'image
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2014
  • 2013
  • A kernel-based approach to non-stationary reinforcement learning in metric spaces.

    Omar DOMINGUES, Pierre MENARD, Matteo PIROTTA, Emilie KAUFMANN, Michal VALKO
    International Conference on Artificial Intelligence and Statistics | 2021
    In this work, we propose KeRNS: an algorithm for episodic reinforcement learning in nonstationary Markov Decision Processes (MDPs) whose state-action set is endowed with a metric. Using a non-parametric model of the MDP built with time-dependent kernels, we prove a regret bound that scales with the covering dimension of the state-action space and the total variation of the MDP with time, which quantifies its level of non-stationarity. Our method generalizes previous approaches based on sliding windows and exponential discounting used to handle changing environments. We further propose a practical implementation of KeRNS, we analyze its regret and validate it experimentally.
Affiliations are detected from the signatures of publications identified in scanR. An author can therefore appear to be affiliated with several structures or supervisors according to these signatures. The dates displayed correspond only to the dates of the publications found. For more information, see https://scanr.enseignementsup-recherche.gouv.fr