SequeL is a research group working in the field of machine learning; more specifically, SequeL is dedicated to the study of the problem of sequential decision making under uncertainty, that is, the study of how an “agent” having a goal to fullfill can learn an optimal behavior to achieve this goal in an unknown environment. SequeL is composed of two dozens members. Activities range from foundations of learning to algorithm design, and transfer towards companies. Questions are studied such as “What can a Turing machine learn efficiently? and in which conditions?”. Or, in a budget context, “Given an amount of computational resource, how close to the optimal behavior can an algorithm reach?”, finally application oriented questions such as those related to computational advertizing and recommendation systems for e-commerce websites, are also studied.
SequeL has led to the multi-awarded Crazy Stone go playing program. Some SequeL PhD students have been awarded the Gilles Kahn award, the Jacques Neveu award and the ECCAI award. We won the ICML 2011 Exploration vs. Exploitation challenge, and the ACM RecSYS 2014 challenge (both challenges on recommendation systems). SequeL expertize has led to collaborations with international companies like Orange Labs, Intel, Technicolor, Deezer and also with national and local SMEs.
Philippe Preux
Méthodes adaptatives pour l’optimisation dans un environnement stochastique 29/09/2021
On sampling determinantal point processes 19/05/2020
Sur le rôle de l'être humain dans le dialogue homme/machine 14/12/2018
Novel Learning and Exploration-Exploitation Methodes for Effective Recommender Systems 19/10/2018
Neural Machine Translation Architectures and Applications 15/06/2018
Efficient Sequential Learning in Structured and Constrained Environments 18/12/2017
REINFORCEMENT LEARNING: THE MULTIPLAYER CASE 18/12/2017
Exploration-Exploitation with Thompson Sampling in Linear Systems 13/12/2017
Sequential learning with similarities 28/11/2016
Usages of Graphs and Synthetic Data for Software Propagation Analysis 03/11/2016
Learning rational linear sequential systems using the method of moments 08/07/2016
Computational and sample complexity of planning and reinforcement learning algorithms 14/12/2015
Machine Learning for Decision-Making under Uncertainty 12/05/2015
Data-driven evaluation of Contextual Bandit algorithms and applications to Dynamic Recommendation. 18/12/2014
Maisqual : Amélioration de la qualité logicielle par fouille de données. 01/07/2014
Online Sequence Prediction 18/11/2013
Apprentissage incrémental en ligne sur flux de données 30/11/2012
De l'échantillonnage optimal en grande et petite dimension 05/10/2012
Compromis exploration - Exploitation en optimisation et contrôle 01/06/2012
Apprentissage Séquentiel : Bandits, Statistique et Renforcement 03/10/2011
Algorithmes d'Ensembles Actifs pour le LASSO 08/07/2011
Bandits Games and Clustering Foundations 10/06/2010