Taja Kuzman

Cited by

	All	Since 2019
Citations	283	266
h-index	7	6
i10-index	7	6

120

2016201720182019202020212022202320246 5 5 9 6 13 23 114 94

Public access

View all

11 articles

1 article

available

not available

Based on funding mandates

Co-authors

Nikola LjubešićResearcher at Jožef Stefan InstituteVerified email at ijs.si
Peter RupnikJožef Stefan InstituteVerified email at ijs.si
Rik van NoordPostdoc in the MaCoCu project, University of GroningenVerified email at rug.nl
Antonio ToralAssistant Professor, University of GroningenVerified email at rug.nl
Gema Ramírez-SánchezCEO at Prompsit Language Engineering, computational linguistVerified email at prompsit.com
Vít SuchomelMasaryk University and Lexical Computing Ltd.Verified email at mail.muni.cz
Mikel L. Forcada (ORCID 0000-0003-0...Professor of Computer Languages and Systems, Universitat d'AlacantVerified email at ua.es
Leopoldo Pla SempereUniversidad de AlicanteVerified email at dlsi.ua.es
Marta BañónPrompsit Language EngineeringVerified email at prompsit.com
Simon KrekResearcher at Jožef Stefan InstituteVerified email at ijs.si
Miquel Esplà-GomisUniversitat d'AlacantVerified email at dlsi.ua.es
Tomaž ErjavecJožef Stefan InstituteVerified email at ijs.si
Jaka ČibejResearcher and Teaching Assistant, University of LjubljanaVerified email at ff.uni-lj.si
Polona GantarResearcher, Faculty of Arts, Ljubljana, SloveniaVerified email at guest.arnes.si
Špela VintarFull Professor, University of LjubljanaVerified email at ff.uni-lj.si
Špela Arhar HoldtResearch Associate at University of Ljubljana, SloveniaVerified email at cjvt.si
Kaja DobrovoljcResearch Associate, University of Ljubljana, Jozef Stefan InstituteVerified email at ijs.si
Mihael ArcanCo-Founder and Chief Scientific Officer (CSO) @ Lua HealthVerified email at luahealth.io
Darja FišerAssistant Professor, University of LjubljanaVerified email at ff.uni-lj.si
Mojca BrglezJunior Researcher / PhD student, University of LjubljanaVerified email at ff.uni-lj.si

Taja Kuzman

Department of Knowledge Technologies, Jožef Stefan Institute

Verified email at ijs.si

computational linguistics language technology natural language processing web corpora genre identification


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Automatic genre identification: a survey T Kuzman, N Ljubešić Language Resources and Evaluation, 1-34, 2023	82	2023
Neural machine translation of literary texts from English to Slovene T Kuzman, Š Vintar, M Arcan Proceedings of the qualities of literary machine translation, 1-9, 2019	33	2019
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint. ana 2.0 T Erjavec, M Ogrodniczuk, P Osenova, N Ljubešić, K Simov, V Grigorova, ... CLARIN ERIC, 2021	26*	2021
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification T Kuzman, I Mozetič, N Ljubešić arXiv preprint arXiv: 2303.03953, 2023	24*	2023
Training corpus ssj500k 1.3. Slovenian language resource repository CLARIN. SI S Krek, T Erjavec, K Dobrovoljc, S Može, N Ledinek, N Holz	20	2013
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages M Banón, M Espla-Gomis, ML Forcada, C García-Romero, T Kuzman, ... 23rd Annual Conference of the European Association for Machine Translation …, 2022	15	2022
The GINCO training dataset for web genre identification of documents out in the wild T Kuzman, P Rupnik, N Ljubešić arXiv preprint arXiv:2201.03857, 2022	12	2022
Automatic genre identification for robust enrichment of massive text collections: Investigation of classification methods in the era of large language models T Kuzman, I Mozetič, N Ljubešić Machine Learning and Knowledge Extraction 5 (3), 1149-1175, 2023	6	2023
BENCHić-lang: A benchmark for discriminating between Bosnian, Croatian, Montenegrin and Serbian P Rupnik, T Kuzman, N Ljubešić Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial …, 2023	6	2023
Assessing comparability of genre datasets via cross-lingual and cross-dataset experiments T Kuzman, N Ljubešic, S Pollak Jezikovne tehnologije in digitalna humanistika: zbornik konference …, 2022	6	2022
Verbal multiword expressions in Slovene P Gantar, S Krek, T Kuzman Computational and Corpus-Based Phraseology: Second International Conference …, 2017	6	2017
Choice of plausible alternatives dataset in Serbian COPA-SR N Ljubešić, M Starović, T Kuzman, T Samardžić Jožef Stefan Institute, 2022	5	2022
Exploring the Impact of Lexical and Grammatical Features on Automatic Genre Identification T Kuzman, N Ljubešić Proceedings of the Odkrivanje Znanja in Podatkovna Skladišca—SiKDD …, 2022	5	2022
Get to know your parallel data: Performing english variety and genre classification over macocu corpora T Kuzman, P Rupnik, N Ljubešić Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial …, 2023	4	2023
Slovene-English parallel corpus MaCoCu-sl-en 2.0 M Bañón, M Chichirau, M Esplà-Gomis, ML Forcada, A Galiano-Jiménez, ... Jožef Stefan Institute, 2023	4	2023
Training corpus ssj500k 2.2 S Krek, K Dobrovoljc, T Erjavec, S Može, N Ledinek, N Holz, K Zupan, ... Centre for Language Resources and Technologies, University of Ljubljana, 2019	3	2019
Glagolske večbesedne enote v učnem korpusu ssj500k 2.1 P Gantar, ŠA Holdt, J Čibej, T Kuzman, T Kavčič Proceedings of the conference on Language Technologies & Digital Humanities …, 2018	3	2018
Serbian web corpus CLASSLA-web. sr 1.0 N Ljubešić, P Rupnik, T Kuzman Jožef Stefan Institute, 2024	2*	2024
The ParlaMint Project: Ever-growing Family of Comparable and Interoperable Parliamentary Corpora M Ogrodniczuk, P Osenova, T Erjavec, D Fišer, N Ljubešic, Ç Çöltekin, ... CLARIN Annual Conference Proceedings, 62, 2023	2	2023
Montenegrin-English parallel corpus MaCoCu-cnr-en 1.0 M Bañón, M Chichirau, M Esplà-Gomis, ML Forcada, A Galiano-Jiménez, ... Jožef Stefan Institute, 2023	2*	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors