Follow
Francisco (Paco) Guzmán
Francisco (Paco) Guzmán
Research Scientist - Facebook AI
No verified email
Title
Cited by
Cited by
Year
Unsupervised cross-lingual representation learning at scale
A Conneau
arXiv preprint arXiv:1911.02116, 2019
60732019
No language left behind: Scaling human-centered machine translation
N Team, MR Costa-jussŕ, J Cross, O Çelebi, M Elbayad, K Heafield, ...
arXiv preprint arXiv:2207.04672, 2022
6012022
CCNet: Extracting high quality monolingual datasets from web crawl data
G Wenzek, MA Lachaux, A Conneau, V Chaudhary, F Guzmán, A Joulin, ...
arXiv preprint arXiv:1911.00359, 2019
5802019
The flores-101 evaluation benchmark for low-resource and multilingual machine translation
N Goyal, C Gao, V Chaudhary, PJ Chen, G Wenzek, D Ju, S Krishnan, ...
Transactions of the Association for Computational Linguistics 10, 522-538, 2022
3882022
Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia
H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán
arXiv preprint arXiv:1907.05791, 2019
3362019
The flores evaluation datasets for low-resource machine translation: Nepali-english and sinhala-english
F Guzmán, PJ Chen, M Ott, J Pino, G Lample, P Koehn, V Chaudhary, ...
arXiv preprint arXiv:1902.01382, 2019
3042019
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
2202024
CCAligned: A massive collection of cross-lingual web-document pairs
A El-Kishky, V Chaudhary, F Guzmán, P Koehn
arXiv preprint arXiv:1911.06154, 2019
1672019
The AMARA corpus: Building parallel language resources for the educational domain
A Abdelali, F Guzmán, H Sajjad, S Vogel
Proceedings of the 9th International Conference on Language Resources and …, 2014
1452014
Unsupervised quality estimation for neural machine translation
M Fomicheva, S Sun, L Yankovskaya, F Blain, F Guzmán, M Fishel, ...
Transactions of the Association for Computational Linguistics 8, 539-555, 2020
1442020
Using Discourse Structure Improves Machine Translation Evaluation
F Guzmán, S Joty, L Mŕrquez, P Nakov
Proceedings of the 52nd Annual Meeting of the Association for Computational …, 2014
1222014
Unsupervised cross-lingual representation learning at scale. CoRR abs/1911.02116 (2019)
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
URL: http://arxiv. org/abs/1911.02116, 1911
901911
TICO-19: the translation initiative for COvid-19
A Anastasopoulos, A Cattelan, ZY Dou, M Federico, C Federman, ...
arXiv preprint arXiv:2007.01788, 2020
842020
Low-resource corpus filtering using multilingual sentence embeddings
V Chaudhary, Y Tang, F Guzmán, H Schwenk, P Koehn
arXiv preprint arXiv:1906.08885, 2019
832019
Findings of the WMT 2019 shared task on parallel corpus filtering for low-resource conditions
P Koehn, F Guzmán, V Chaudhary, J Pino
Proceedings of the Fourth Conference on Machine Translation (Volume 3 …, 2019
812019
Findings of the WMT 2020 shared task on parallel corpus filtering and alignment
P Koehn, V Chaudhary, A El-Kishky, N Goyal, PJ Chen, F Guzmán
Proceedings of the Fifth Conference on Machine Translation, 726-742, 2020
792020
Optimizing for Sentence-Level BLEU+1 Yields Short Translations
P Nakov, F Guzmán, S Vogel
COLING, 2012
732012
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ...
arXiv preprint arXiv:2308.11596, 2023
722023
Seamless: Multilingual Expressive and Streaming Speech Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ...
arXiv preprint arXiv:2312.05187, 2023
692023
MLQE-PE: A multilingual quality estimation and post-editing dataset
M Fomicheva, S Sun, E Fonseca, C Zerva, F Blain, V Chaudhary, ...
arXiv preprint arXiv:2010.04480, 2020
582020
The system can't perform the operation now. Try again later.
Articles 1–20