Fine-grained robust prosody transfer for single-speaker neural text-to-speech V Klimkov, S Ronanki, J Rohnke, T Drugman arXiv preprint arXiv:1907.02479, 2019 | 103 | 2019 |
Copycat: Many-to-many fine-grained prosody transfer for neural text-to-speech S Karlapati, A Moinet, A Joly, V Klimkov, D Sáez-Trigueros, T Drugman arXiv preprint arXiv:2004.14617, 2020 | 85 | 2020 |
Effect of data reduction on sequence-to-sequence neural TTS J Latorre, J Lachowicz, J Lorenzo-Trueba, T Merritt, T Drugman, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 80 | 2019 |
Universal neural vocoding with parallel wavenet Y Jiao, A Gabryś, G Tinchev, B Putrycz, D Korzekwa, V Klimkov ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 57 | 2021 |
Traditional machine learning for pitch detection T Drugman, G Huybrechts, V Klimkov, A Moinet IEEE Signal Processing Letters 25 (11), 1745-1749, 2018 | 42 | 2018 |
Phrase break prediction for long-form reading TTS: Exploiting text structure information V Klimkov, A Nadolski, A Moinet, B Putrycz, R Barra-Chicote, T Merritt, ... | 34 | 2018 |
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ... arXiv preprint arXiv:2106.12896, 2021 | 31 | 2021 |
Comprehensive evaluation of statistical speech waveform synthesis T Merritt, B Putrycz, A Nadolski, T Ye, D Korzekwa, W Dolecki, T Drugman, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 325-331, 2018 | 22 | 2018 |
Text-to-speech (TTS) processing with transfer of vocal characteristics V Klimkov, TR Drugman, A Galkin, S Ronanki US Patent 11,410,684, 2022 | 17 | 2022 |
Text-to-speech (TTS) processing JL Trueba, TR Drugman, V Klimkov, S Ronanki, TE Merritt, AP Breen, ... US Patent 10,741,169, 2020 | 14 | 2020 |
Varying speaking styles with neural textto-speech T Wood, T Merritt Alexa Blogs, Nov 19, 2018 | 12 | 2018 |
Contextual text-to-speech processing RB Chicote, J Latorre, AF Nadolski, V Klimkov, TE Merritt US Patent 10,475,438, 2019 | 10 | 2019 |
Enhancing audio quality for expressive neural text-to-speech A Ezzerg, A Gabrys, B Putrycz, D Korzekwa, D Saez-Trigueros, ... arXiv preprint arXiv:2108.06270, 2021 | 9 | 2021 |
Homograph disambiguation with contextual word embeddings for TTS systems M Nicolis, V Klimkov | 7 | 2021 |
Parameter generation algorithms for text-to-speech synthesis with recurrent neural networks V Klimkov, A Moinet, A Nadolski, T Drugman 2018 IEEE Spoken Language Technology Workshop (SLT), 626-631, 2018 | 7 | 2018 |
On granularity of prosodic representations in expressive text-to-speech M Babiański, K Pokora, R Shah, R Sienkiewicz, D Korzekwa, V Klimkov 2022 IEEE Spoken Language Technology Workshop (SLT), 892-899, 2023 | 6 | 2023 |
Text-to-speech (TTS) processing JL Trueba, TR Drugman, V Klimkov, S Ronanki, TE Merritt, AP Breen, ... US Patent 11,410,639, 2022 | 5 | 2022 |
Neural text-to-speech makes speech synthesizers much more versatile JL Trueba, V Klimkov Amazon Science, 2019 | 5 | 2019 |
Expressive machine dubbing through phrase-level cross-lingual prosody transfer J Swiatkowski, D Wang, M Babianski, G Coccia, PL Tobing, R Vipperla, ... arXiv preprint arXiv:2306.11662, 2023 | 4 | 2023 |
Improving the expressiveness of neural vocoding with non-affine Normalizing Flows A Gabryś, Y Jiao, V Klimkov, D Korzekwa, R Barra-Chicote arXiv preprint arXiv:2106.08649, 2021 | 3 | 2021 |