Michal Valko

Cited by

	All	Since 2019
Citations	11621	10774
h-index	43	39
i10-index	102	95

3500

1750

875

2625

2011201220132014201520162017201820192020202120222023202436 25 63 61 107 141 167 199 321 604 1367 2659 3427 2380

Public access

View all

53 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Rémi MunosGoogle DeepMindVerified email at inria.fr
Mohammad Gheshlaghi AzarCohereVerified email at google.com
Bilal PiotGoogle DeepmindVerified email at google.com
Daniele CalandrielloResearch Scientist, DeepMindVerified email at google.com
Corentin TallecDeepMindVerified email at google.com
Jean-bastien GrillVerified email at google.com
Zhaohan Daniel GuoDeepMindVerified email at google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Florent AltchéResearch Engineer, DeepMindVerified email at google.com
Pierre MénardOvGU MagdeburgVerified email at inria.fr
Florian STRUBCohereVerified email at cohere.com
Pierre RichemondGoogle DeepMindVerified email at deepmind.com
Emilie KaufmannCNRS & Univ. Lille (CRIStAL)Verified email at inria.fr
Yunhao TangResearch Scientist, DeepMindVerified email at columbia.edu
Omar Darwiche DominguesCohereVerified email at cohere.com
Branislav KvetonAmazonVerified email at amazon.com
Milos HauskrechtProfessor of Computer Science, University of PittsburghVerified email at pitt.edu
Mark RowlandResearch Scientist, Google DeepMindVerified email at google.com
Matteo PirottaResearch Scientist, Meta (FAIR)Verified email at fb.com
Shantanu ThakoorResearch Engineer at DeepMindVerified email at google.com

Michal Valko

Llama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind

Verified email at meta.com - Homepage

fine-tuning LLMs rl with human feedback deep reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bootstrap your own latent: A new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... Neural Information Processing Systems, 2020	6108	2020
Large-scale representation learning on graphs via bootstrapping S Thakoor, C Tallec, MG Azar, R Munos, P Veličković, M Valko International Conference on Learning Representations, 2022	393*	2022
Finite-time analysis of kernelised contextual bandits M Valko, N Korda, R Munos, I Flaounas, N Cristianini Uncertainty in Artificial Intelligence, 2013	275	2013
Outlier detection for patient monitoring and alerting M Hauskrecht, I Batal, M Valko, S Visweswaran, GF Cooper, G Clermont Journal of Biomedical Informatics, 2013	175	2013
A general theoretical paradigm to understand learning from human preferences MG Azar, M Rowland, B Piot, D Guo, D Calandriello, M Valko, R Munos International Conference on Artificial Intelligence and Statistics, 2024	159	2024
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Neural Information Processing Systems, 2017	148*	2017
Stochastic simultaneous optimistic optimization M Valko, A Carpentier, R Munos International Conference on Machine Learning, 2013	139	2013
Spectral bandits for smooth graph functions M Valko, R Munos, B Kveton, T Kocák International Conference on Machine Learning, 2014	132	2014
Broaden your views for self-supervised video learning A Recasens, P Luc, JB Alayrac, L Wang, F Strub, C Tallec, M Malinowski, ... International Conference on Computer Vision, 2021	129	2021
Efficient learning by implicit exploration in bandit problems with side observations T Kocák, G Neu, M Valko, R Munos Neural Information Processing Systems, 2014	128	2014
Episodic reinforcement learning in finite MDPs: Minimax lower bounds revisited O Darwiche Domingues, P Ménard, E Kaufmann, M Valko Algorithmic Learning Theory, 2021	117	2021
Black-box optimization of noisy functions with unknown smoothness JB Grill, M Valko, R Munos Neural Information Processing Systems, 2015	110	2015
Simple regret for infinitely many armed bandits A Carpentier, M Valko International Conference on Machine Learning, 2015	102	2015
Game Plan: What AI can do for Football, and What Football can do for AI K Tuyls, S Omidshafiei, P Muller, Z Wang, J Connor, D Hennes, I Graham, ... Journal of Artificial Intelligence Research 71, 41-88, 2021	95	2021
BYOL works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... NeurIPS 2020 Workshop: Self-Supervised Learning - Theory and Practice, 2020	94	2020
Adaptive reward-free exploration E Kaufmann, P Ménard, OD Domingues, A Jonsson, E Leurent, M Valko Algorithmic Learning Theory, 2021	89	2021
Gamification of pure exploration for linear bandits R Degenne, P Ménard, X Shang, M Valko International Conference on Machine Learning, 2020	88	2020
Gaussian process optimization with adaptive sketching: Scalable and no regret D Calandriello, L Carratino, A Lazaric, M Valko, L Rosasco Conference on Learning Theory, 2019	83	2019
Fast active learning for pure exploration in reinforcement learning P Ménard, OD Domingues, A Jonsson, E Kaufmann, E Leurent, M Valko International Conference on Machine Learning, 2021	78	2021
Monte-Carlo tree search as regularized policy optimization JB Grill, F Altché, Y Tang, T Hubert, M Valko, I Antonoglou, R Munos International Conference on Machine Learning, 2020	73	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors