Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2236 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 720 | 2024 |
Multilingual universal sentence encoder for semantic retrieval Y Yang, D Cer, A Ahmad, M Guo, J Law, N Constant, GH Abrego, S Yuan, ... arXiv preprint arXiv:1907.04307, 2019 | 560 | 2019 |
Character-level language modeling with deeper self-attention R Al-Rfou, D Choe, N Constant, M Guo, L Jones Proceedings of the AAAI conference on artificial intelligence 33 (01), 3159-3166, 2019 | 471 | 2019 |
LongT5: Efficient text-to-text transformer for long sequences M Guo, J Ainslie, D Uthus, S Ontanon, J Ni, YH Sung, Y Yang arXiv preprint arXiv:2112.07916, 2021 | 276 | 2021 |
Effective parallel corpus mining using bilingual sentence embeddings M Guo, Q Shen, Y Yang, H Ge, D Cer, GH Abrego, K Stevens, N Constant, ... arXiv preprint arXiv:1807.11906, 2018 | 129 | 2018 |
Improving multilingual sentence embedding using bi-directional dual encoder with additive margin softmax Y Yang, GH Abrego, S Yuan, M Guo, Q Shen, D Cer, YH Sung, B Strope, ... arXiv preprint arXiv:1902.08564, 2019 | 125 | 2019 |
Wiki-40b: Multilingual language model dataset M Guo, Z Dai, D Vrandečić, R Al-Rfou Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 112 | 2020 |
Mural: multimodal, multitask retrieval across languages A Jain, M Guo, K Srinivasan, T Chen, S Kudugunta, C Jia, Y Yang, ... arXiv preprint arXiv:2109.05125, 2021 | 84 | 2021 |
Colt5: Faster long-range transformers with conditional computation J Ainslie, T Lei, M de Jong, S Ontañón, S Brahma, Y Zemlyanskiy, ... arXiv preprint arXiv:2303.09752, 2023 | 59 | 2023 |
TextSETTR: Few-shot text style extraction and tunable targeted restyling P Riley, N Constant, M Guo, G Kumar, D Uthus, Z Parekh arXiv preprint arXiv:2010.03802, 2020 | 45 | 2020 |
Neural retrieval for question answering with cross-attention supervised data augmentation Y Yang, N Jin, K Lin, M Guo, D Cer arXiv preprint arXiv:2009.13815, 2020 | 32 | 2020 |
Hierarchical document encoder for parallel corpus mining M Guo, Y Yang, K Stevens, D Cer, H Ge, Y Sung, B Strope, R Kurzweil arXiv preprint arXiv:1906.08401, 2019 | 23 | 2019 |
MultiReQA: A cross-domain evaluation forRetrieval question answering models M Guo, Y Yang, D Cer, Q Shen, N Constant Proceedings of the Second Workshop on Domain Adaptation for NLP, 94-104, 2021 | 22 | 2021 |
Multireqa: A cross-domain evaluation for retrieval question answering models M Guo, Y Yang, D Cer, Q Shen, N Constant arXiv preprint arXiv:2005.02507, 2020 | 21 | 2020 |
Cobit: A contrastive bi-directional image-text generation model H You, M Guo, Z Wang, KW Chang, J Baldridge, J Yu arXiv preprint arXiv:2303.13455, 2023 | 18 | 2023 |
Bridging the gap for tokenizer-free language models D Choe, R Al-Rfou, M Guo, H Lee, N Constant arXiv preprint arXiv:1908.10322, 2019 | 18 | 2019 |
Textsettr: Label-free text style extraction and tunable targeted restyling P Riley, N Constant, M Guo, G Kumar, D Uthus, Z Parekh arXiv preprint arXiv:2010.03802, 2020 | 15 | 2020 |
Imagen 3 J Baldridge, J Bauer, M Bhutani, N Brichtova, A Bunner, K Chan, Y Chen, ... arXiv preprint arXiv:2408.07009, 2024 | 9 | 2024 |
mlongt5: A multilingual and efficient text-to-text transformer for longer sequences D Uthus, S Ontañón, J Ainslie, M Guo arXiv preprint arXiv:2305.11129, 2023 | 8 | 2023 |