Sebastian Borgeaud
Sebastian Borgeaud
Verified email at
Cited by
Cited by
Flamingo: a visual language model for few-shot learning
JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ...
Advances in neural information processing systems 35, 23716-23736, 2022
Emergent abilities of large language models
J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ...
arXiv preprint arXiv:2206.07682, 2022
Training compute-optimal large language models
J Hoffmann, S Borgeaud, A Mensch, E Buchatskaya, T Cai, E Rutherford, ...
arXiv preprint arXiv:2203.15556, 2022
Scaling language models: Methods, analysis & insights from training gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
Improving language models by retrieving from trillions of tokens
S Borgeaud, A Mensch, J Hoffmann, T Cai, E Rutherford, K Millican, ...
arXiv preprint arXiv:2112.04426, 2021
Perceiver io: A general architecture for structured inputs & outputs
A Jaegle, S Borgeaud, JB Alayrac, C Doersch, C Ionescu, D Ding, ...
arXiv preprint arXiv:2107.14795, 2021
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
Unsupervised learning of object keypoints for perception and control
TD Kulkarni, A Gupta, C Ionescu, S Borgeaud, M Reynolds, A Zisserman, ...
Advances in neural information processing systems 32, 2019
Accelerating large language model decoding with speculative sampling
C Chen, S Borgeaud, G Irving, JB Lespiau, L Sifre, J Jumper
arXiv preprint arXiv:2302.01318, 2023
General-purpose, long-context autoregressive modeling with perceiver AR
C Hawthorne, A Jaegle, C Cangea, S Borgeaud, C Nash, M Malinowski, ...
International Conference on Machine Learning, 8535-8558, 2022
Unified scaling laws for routed language models
A Clark, D de Las Casas, A Guy, A Mensch, M Paganini, J Hoffmann, ...
International conference on machine learning, 4057-4086, 2022
Gemma: Open models based on gemini research and technology
G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...
arXiv preprint arXiv:2403.08295, 2024
Spriteworld: A flexible, configurable reinforcement learning environment
N Watters, L Matthey, S Borgeaud, R Kabra, A Lerchner
Leveraging Sentence Similarity in Natural Language Generation: Improving Beam Search using Range Voting
S Borgeaud, G Emerson
arXiv preprint arXiv:1908.06288, 2019
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
Human-agent cooperation in bridge bidding
E Lockhart, N Burch, N Bard, S Borgeaud, T Eccles, L Smaira, R Smith
arXiv preprint arXiv:2011.14124, 2020
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ...
arXiv preprint arXiv:2404.07839, 2024
The system can't perform the operation now. Try again later.
Articles 1–18