Emilie Kaufmann
Centre national de la recherche scientifique
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Emilie Kaufmann.
algorithmic learning theory | 2012
Emilie Kaufmann; Nathaniel Korda; Rémi Munos
The question of the optimality of Thompson Sampling for solving the stochastic multi-armed bandit problem had been open since 1933. In this paper we answer it positively for the case of Bernoulli rewards by providing the first finite-time analysis that matches the asymptotic rate given in the Lai and Robbins lower bound for the cumulative regret. The proof is accompanied by a numerical comparison with other optimal policies, experiments that have been lacking in the literature until now for the Bernoulli case.
Annals of Statistics | 2018
Emilie Kaufmann
This paper is about index policies for minimizing (frequentist) regret in a stochastic multi-armed bandit model, inspired by a Bayesian view on the problem. Our main contribution is to prove that the Bayes-UCB algorithm, which relies on quantiles of posterior distributions, is asymptotically optimal when the reward distributions belong to a one-dimensional exponential family, for a large class of prior distributions. We also show that the Bayesian literature gives new insight on what kind of exploration rates could be used in frequentist, UCB-type algorithms. Indeed, approximations of the Bayesian optimal solution or the Finite Horizon Gittins indices provide a justification for the kl-UCB+ and kl-UCB-H+ algorithms, whose asymptotic optimality is also established.
arXiv: Networking and Internet Architecture | 2017
Rémi Bonnefoi; Lilian Besson; Christophe Moy; Emilie Kaufmann; Jacques Palicot
Setting up the future Internet of Things (IoT) networks will require to support more and more communicating devices. We prove that intelligent devices in unlicensed bands can use Multi-Armed Bandit (MAB) learning algorithms to improve resource exploitation. We evaluate the performance of two classical MAB learning algorithms, UCB1 and Thompson Sampling, to handle the decentralized decision-making of Spectrum Access, applied to IoT networks; as well as learning performance with a growing number of intelligent end-devices. We show that using learning algorithms does help to fit more devices in such networks, even when all end-devices are intelligent and are dynamically changing channel. In the studied scenario, stochastic MAB learning provides a up to 16% gain in term of successful transmission probabilities, and has near optimal performance even in non-stationary and non-i.i.d. settings with a majority of intelligent devices.
algorithmic learning theory | 2016
Emilie Kaufmann; Thomas Bonald; Marc Lelarge
This paper presents a novel spectral algorithm with additive clustering, designed to identify overlapping communities in networks. The algorithm is based on geometric properties of the spectrum of the expected adjacency matrix in a random graph model that we call stochastic blockmodel with overlap (SBMO). An adaptive version of the algorithm, that does not require the knowledge of the number of hidden communities, is proved to be consistent under the SBMO when the degrees in the graph are (slightly more than) logarithmic. The algorithm is shown to perform well on simulated data and on real-world graphs with known overlapping communities.
Journal of Machine Learning Research | 2016
Emilie Kaufmann; Olivier Cappé; Aurélien Garivier
international conference on artificial intelligence and statistics | 2012
Emilie Kaufmann; Olivier Cappé; Aurélien Garivier
conference on learning theory | 2013
Emilie Kaufmann
neural information processing systems | 2016
Aurélien Garivier; Emilie Kaufmann; Tor Lattimore
conference on learning theory | 2014
Emilie Kaufmann; Olivier Cappé; Aurélien Garivier
conference on learning theory | 2016
Aurélien Garivier; Emilie Kaufmann