Follow
Ona de Gibert Bonet
Ona de Gibert Bonet
Verified email at helsinki.fi
Title
Cited by
Cited by
Year
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
12872023
Hate speech dataset from a white supremacy forum
O De Gibert, N Perez, A García-Pablos, M Cuadros
arXiv preprint arXiv:1809.04444, 2018
4872018
Are multilingual models the best choice for moderately under-resourced languages? A comprehensive assessment for Catalan
J Armengol-Estapé, CP Carrino, C Rodriguez-Penagos, OG Bonet, ...
arXiv preprint arXiv:2107.07903, 2021
342021
On the multilingual capabilities of very large-scale English language models
J Armengol-Estapé, OG Bonet, M Melero
arXiv preprint arXiv:2108.13349, 2021
222021
Spanish biomedical crawled corpus: A large, diverse dataset for spanish biomedical language models
CP Carrino, J Armengol-Estapé, OG Bonet, A Gutiérrez-Fandiño, ...
arXiv preprint arXiv:2109.07765, 2021
162021
Spanish biomedical and clinical language embeddings
A Gutiérrez-Fandino, J Armengol-Estapé, CP Carrino, O De Gibert, ...
arXiv preprint arXiv:2102.12843, 2021
92021
Estrategia multidimensional para la selección de candidatos de traducción automática para posedición
N Aranberri, O de Gibert
Linguamática 11 (2), 3-16, 2019
82019
Quality versus Quantity: Building Catalan-English MT Resources
O de Gibert, K Kharitonova, BC Figueras, J Armengol-Estapé, M Melero
5*2022
Automatic removal of identifying information in official eu languages for public administrations: The mapa project
L Gianola, Ē Ajausks, V Arranz, C Bendahman, L Bié, C Borg, A Cerdà, ...
Legal Knowledge and Information Systems, 223-226, 2020
52020
The catalan language club
C Rodriguez-Penagos, C Armentano-Oller, M Villegas, M Melero, ...
arXiv preprint arXiv:2112.01894, 2021
42021
The OPUS-MT Dashboard–A Toolkit for a Systematic Evaluation of Open Machine Translation Models
J Tiedemann, O De Gibert
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
32023
Spanish Datasets for Sensitive Entity Detection in the Legal Domain
O de Gibert, A Garcıa-Pablos, M Cuadros, M Melero
3*2022
Four Approaches to Low-Resource Multilingual NMT: The Helsinki Submission to the AmericasNLP 2023 Shared Task
O De Gibert, R Vázquez, M Aulamo, Y Scherrer, S Virpioja, J Tiedemann
Proceedings of the Workshop on Natural Language Processing for Indigenous …, 2023
22023
Unsupervised Machine Translation in Real-World Scenarios
O de Gibert, I Goenaga, J Armengol-Estapé, O Perez-de-Vinaspre
2*2022
to post-edit or to translate... That is the question: a case study of a recommender system for Quality Estimation of Machine Translation based on linguistic features
O de Gibert Bonet
22018
MAMMOTH: Massively Multilingual Modular Open Translation@ Helsinki
T Mickus, SA Grönroos, J Attieh, M Boggia, O De Gibert, S Ji, NA Lopi, ...
arXiv preprint arXiv:2403.07544, 2024
12024
Sequence-to-Sequence Resources for Catalan
O de Gibert, K Kharitonova, BC Figueras, J Armengol-Estapé, M Melero
arXiv preprint arXiv:2202.06871, 2022
12022
Transfer Learning with Shallow Decoders: BSC at WMT2021’s Multilingual Low-Resource Translation for Indo-European Languages Shared Task
K Kharitonova, O de Gibert Bonet, J Armengol-Estapé, MR i Alvarez, ...
Proceedings of the Sixth Conference on Machine Translation, 362-367, 2021
12021
A New Massive Multilingual Dataset for High-Performance Language Technologies
O De Gibert, G Nail, N Arefyev, M Bañón, J Van Der Linde, S Ji, ...
arXiv preprint arXiv:2403.14009, 2024
2024
HPLT’s First Release of Data and Models
N Arefyev, M Aulamo, P Chen, O de Gibert, B Haddow, J Helcl, B Malik, ...
2024
The system can't perform the operation now. Try again later.
Articles 1–20