Follow
Gabriel Dulac-Arnold
Gabriel Dulac-Arnold
Google DeepMind
Verified email at squirrelsoup.net - Homepage
Title
Cited by
Cited by
Year
Learning from demonstrations for real world reinforcement learning
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ...
arXiv preprint arXiv:1704.03732, 2017
1449*2017
Deep reinforcement learning in large discrete action spaces
G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ...
arXiv preprint arXiv:1512.07679, 2015
7532015
Challenges of real-world reinforcement learning
G Dulac-Arnold, D Mankowitz, T Hester
ICML Workshop on Real-Life RL, 2019
6752019
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ...
Machine Learning 110 (9), 2419-2468, 2021
607*2021
The predictron: End-to-end learning and planning
D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
International Conference on Machine Learning, 3191-3199, 2017
3082017
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2552020
Rl unplugged: A suite of benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems 33, 7248-7259, 2020
207*2020
Model-based offline planning
A Argenson, G Dulac-Arnold
ICLR 2021, 2020
1472020
Learning to run a power network challenge: a retrospective analysis
A Marot, B Donnot, G Dulac-Arnold, A Kelly, A O’Sullivan, J Viebahn, ...
NeurIPS 2020 Competition and Demonstration Track, 112-132, 2021
722021
Datum-wise classification: a sequential approach to sparsity
G Dulac-Arnold, L Denoyer, P Preux, P Gallinari
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2011
702011
Deep reinforcement learning with attention for slate markov decision processes with high-dimensional states and actions
P Sunehag, R Evans, G Dulac-Arnold, Y Zwols, D Visentin, B Coppin
arXiv preprint arXiv:1512.01124, 2015
592015
AI-based mobile application to fight antibiotic resistance
M Pascucci, G Royer, J Adamek, MA Asmar, D Aristizabal, L Blanche, ...
Nature communications 12 (1), 1173, 2021
582021
Differentiable deep clustering with cluster size constraints
A Genevay, G Dulac-Arnold, JP Vert
arXiv preprint arXiv:1910.09036, 2019
462019
Text classification: A sequential reading approach
G Dulac-Arnold, L Denoyer, P Gallinari
European Conference on Information Retrieval, 411-423, 2011
402011
Barkour: Benchmarking animal-level agility with quadruped robots
K Caluwaerts, A Iscen, JC Kew, W Yu, T Zhang, D Freeman, KH Lee, ...
arXiv preprint arXiv:2305.14654, 2023
392023
Deep multi-class learning from label proportions
G Dulac-Arnold, N Zeghidour, M Cuturi, L Beyer, JP Vert
arXiv preprint arXiv:1905.12909, 2019
392019
Fast reinforcement learning with large action sets using error-correcting output codes for mdp factorization
G Dulac-Arnold, L Denoyer, P Preux, P Gallinari
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2012
282012
Robovqa: Multimodal long-horizon reasoning for robotics
P Sermanet, T Ding, J Zhao, F Xia, D Dwibedi, K Gopalakrishnan, C Chan, ...
2024 IEEE International Conference on Robotics and Automation (ICRA), 645-652, 2024
242024
Learning dynamics models for model predictive agents
M Lutter, L Hasenclever, A Byravan, G Dulac-Arnold, P Trochim, N Heess, ...
arXiv preprint arXiv:2109.14311, 2021
242021
Residual reinforcement learning from demonstrations
M Alakuijala, G Dulac-Arnold, J Mairal, J Ponce, C Schmid
arXiv preprint arXiv:2106.08050, 2021
242021
The system can't perform the operation now. Try again later.
Articles 1–20