I am an assistant professor in Statistics at École Polytechnique, Palaiseau, in the applied mathematics departement.

Before that, I was postdoctoral Researcher at EPFL (Ecole Polytechnique Fédérale de Lausanne), in the MLO team, directed by Martin Jaggi. Before that, I was Ph.D. student in the Sierra Team, which is part of the DI/ENS (Computer Science Department of École Normale Supérieure). I graduated from Ecole Normale Supérieure de Paris (Ulm) in 2014 and got a Masters Degree in Mathematics, Probability and Statistics (at Université Paris-Sud, Orsay).

I was supervised by Francis Bach. My main research interests are statistics, optimization, stochastic approximation, high-dimensional learning, non-parametric statistics, scalable kernel methods.

From March to August 2016, I was a visiting scholar researcher at University of California Berkeley, under the supervision of Martin Wainwright.

News !

The applied mathematics department at École Polytechnique has open positions for Tenure Track professors, one on Statistics and the other one on Statistics and Energy. These positions offer competitive conditions.

Our team is also hiring Postdocs and Research engineers, with very competitive conditions. If you have a PhD in statistics, optimization and Machine learning and are interested to join a great team in Paris, send me an email.


10/03/2020: Optimization for Machine Learning workshop. in Luminy. On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent, Slides here!

26/01/2020: The third edition of the "Advances in Machine Learning, theory meets practice", at Applied Machine Learning Days (AMLD) in Lausanne, that we were co-organizing with Sebastian Stich was a nice occasion to bring theoreticians and practitioners together: thanks to the speakers for their great talks!
See the workshop page for slides and details.

01/2020: Our paper on using optimal transport for NLP has been accepted to AISTATS2020!

12/2019: Our paper on unsupervised time series representation has successfully passed the Neurips reproducibility challenge! About 70 papers published at NeurIPS 2019 were picked by independent researchers, to reproduce the results, assess if the description of the framework was complete, and provide feedback. You can find the discussion on our paper here and the full 12 pages report on our work here. F. Liljefors, M. M. Sorkhei and S. Broomé were able to reproduce and reimplement our methods from the description given in the paper, and to obtain the same results as in our original paper! We would like to thank them for their work on our paper!

12/2019: I will be presenting two papers at NeurIPS 2019 in Vancouver. The first one one distributed optimization, more specifically Local SGD, with K. K. Patel, and the second one on a new methods to generate representations of time series in an unsupervised fashion, with J.Y. Franceschi M. Jaggi. Links to the papers below !

12/2019: Constantin Philipenko is starting his PhD ! Constantin will be working on Federated Learning, especially on problems arising from privacy concerns. He will be supervised by myself and Éric Moulines, and will also be working with Richard Vidal and Leatitia Kameni from the research team at Accenture. Welcome to my first PhD student ! :)

10/2019: I will be giving a lecture on Large Scale Learning at the Autumn School in Machine Learning, Tbilisi, Geogia. You can find the slides here.

Publications

Debiasing Stochastic Gradient Descent to handle missing values
with Aude Sportisse, Claire Boyer and Julie Josse
Arxiv preprint
Unsupervised Scalable Representation Learning for Multivariate Time Series
with Jean-Yves Franceschi and Martin Jaggi,
Neurips 2019, 1901.10738.
Communication trade-offs for synchronized distributed SGD with large step size
with Kumar Kshitij Patel,
Neurips 2019 arXiv:1904.11325 .
Context Mover's Distance & Barycenters: Optimal transport of contexts for building representations
with Sidak Pal Singh, Andreas Hug, Martin Jaggi,
ICLR 2019, workshop, accepted at AISTATS 2020
Bridging the Gap between Constant Step Size Stochastic Gradient Descent and Markov Chains
Accepted in the Annals of Statistics, 2019 arXiv:1707.06386 [math.ST].
Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression
Journal of Machine Learning Research (JMLR), arXiv:1602.05419 [math.ST].
Non-parametric Stochastic Approximation with Large Step sizes
with Francis Bach.
Published in the Annals of Statistics.

Workshops

January 2020, Organizer of the "Theory Meets Practice" Workshop AMLD 2020, Lausanne. [ Event page]
January 2019, Organizer of the "Advances in Machine Learning, Theory Meets Practice" Workshop AMLD 2019, Lausanne. [ Event page]
December 2016, Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression, with Nicolas Flammarion, Nips, OPT16. [abstract]
December 2015, Adaptivity of stochastic gradient descent, Nips Workshop. [abstract][slides]

Teaching

2019-2020 : Statistiques , 2A (~third year bachelor) MAP 433, Polytechnique.
2019-2020 : Generallization Properties of learning algorithms , M2 Data Science.
2019-2020 : Statistics , M2 Data Science for Business, Polytechnique.
2019-2020 : Optimization and Deep learning , M2 Data Science for Business, Polytechnique.
2018-2019 : Probabilities and Statistics , 1A (~third year bachelor) MAP 361, Polytechnique.
2016-2017 : Teaching assistant, Statistics , Master 1 (31NU02MS), University Paris Diderot.
2016-2017 : Teaching assistant, Fundamental Statistics , Master 1 (ULMT42), University Paris Diderot.
2015-2016 : Teaching assistant, Calculus , (MM1), University Paris Diderot.
2014-2015 : Teaching assistant, Linear Algebra , (MM1), University Paris Diderot.
2010-2014 : Oral interrogations in ``Classes préparatoires'' (PC, MP*).

Reviews and Commintees

Reviewer for JMLR, AOS, COLT, IEEE, ACM, ICML, annales de l'IHP.

I was a member of the Jury for Belhal Karimi's PhD defense, on September, the 19th, 2019.

I am a referee for Luigi Carratino's PhD dissertation, that will be defended in early spring 2020.

I am part of the scientific committee for the seminar le Palaisien.

Some Talks

March 2020, On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent, Optimization for Machine Learning workshop. Slides here!

March 2020, On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent, Parisian Seminar of Optimization.

January 2020, On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent, Inria Paris

October 2019, Large Scale Learning and Optimization, Tbilisi, Georgia. 6 hours lecture at ASML, slides here!

April 2019, Journées Calcul et Apprentissage, Lyon

December 2018, Communication trade-offs for synchronized distributed SGD with large step size, CMStatistics 2018, Pisa, Italy [slides]

January 2018, Tutoriel d’Optimization, Journées “YSP” organisées par la SFDS, Institut Henri Poincaré [slides]

Decembrer 2017, Stochastic algorithms in Machine learning, Tutoriel at “journée algorithmes stochastiques", Paris Dauphine. [slides]

November 2017, Stochastic approximation and Markov chains, Invited talk, Télécom Paristech, Paris [slides]

February 2017, Scalable methods for Statistics, a short presentation, Cambridge, UK . [slides]

March 2016, Non parametric stochastic approxiation, UC Berkeley.

October 2015, Tradeoffs of learning in Hilbert spaces, ENSAI Rennes. [slides]

June 2015, Non parametric Stochastic Approximation, Machine Learning Summer School, Tubingen.

Thesis defense!

I defended my thesis on Thursday, September 28, at 2.30 pm, at INRIA.

You can download the final version of the manuscript (or here if you want to print it).

You can also have a look at the slides.