CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech S Karlapati, A Moinet, A Joly, V Klimkov, D Sáez-Trigueros, T Drugman arXiv preprint arXiv:2004.14617, 2020 | 72 | 2020 |
CAMP: a Two-Stage Approach to Modelling Prosody in Context Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 30 | 2021 |
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech S Karlapati, A Abbas, Z Hodari, A Moinet, A Joly, P Karanasou, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 19 | 2021 |
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody P Makarov, A Abbas, M Łajszczak, A Joly, S Karlapati, A Moinet, ... arXiv preprint arXiv:2206.14643, 2022 | 14 | 2022 |
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer S Karlapati, P Karanasou, M Lajszczak, A Abbas, A Moinet, P Makarov, ... arXiv preprint arXiv:2206.13443, 2022 | 11 | 2022 |
A learned conditional prior for the VAE acoustic space of a TTS system P Karanasou, S Karlapati, A Moinet, A Joly, A Abbas, S Slangen, ... | 9 | 2021 |
Expressive, Variable, and Controllable Duration Modelling in TTS A Abbas, T Merritt, A Moinet, S Karlapati, E Muszynska, S Slangen, E Gatti, ... arXiv preprint arXiv:2206.14165, 2022 | 8 | 2022 |
Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments A Mottini, J Lorenzo-Trueba, SVK Karlapati, T Drugman arXiv preprint arXiv:2106.08873, 2021 | 7 | 2021 |
Predicting deformation mechanisms in architected metamaterials using GNN PP Indurkar, S Karlapati, AJD Shaikeea, VS Deshpande arXiv preprint arXiv:2202.09427, 2022 | 3 | 2022 |
Hash based frequent pattern mining approach to text compression C Oswald, S Srinidhi, KS Vishnu, TV Vishal, B Sivaselvan First EAI International Conference on Computer Science and Engineering, 228-238, 2017 | 3 | 2017 |
eCat: An end-to-end model for multi-speaker TTS & many-to-many fine-grained prosody transfer A Abbas, S Karlapati, B Schnell, P Karanasou, MG Moya, A Nagaraj, ... arXiv preprint arXiv:2306.11327, 2023 | 1 | 2023 |
Learned condition text-to-speech synthesis P Karanasou, SVK Karlapati, AP Moinet, AVPY Joly, SA Abbas, ... US Patent 11,830,476, 2023 | | 2023 |
Synthetic speech processing JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI US Patent 11,735,156, 2023 | | 2023 |
Synthetic speech processing JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI US Patent App. 18/305,456, 2023 | | 2023 |
Multi-scale spectrogram text-to-speech SA Abbas, B Bollepalli, AP Moinet, TR Drugman, AVPY Joly, P Karanasou, ... US Patent 11,694,674, 2023 | | 2023 |
A Comparative Analysis of Pretrained Language Models for Text-to-Speech MG Moya, P Karanasou, S Karlapati, B Schnell, N Peinelt, A Moinet, ... 12th Speech Synthesis Workshop (SSW) 2023, 2023 | | 2023 |
Synthetic speech processing AVPY Joly, P Karanasou, APJ Moinet, TR Drugman, SVK Karlapati, ... US Patent 11,574,624, 2023 | | 2023 |
NEURAL NETWORK MEMORY FOR AUDIO SVK Karlapati, P Karanasou, AVPY Joly, AP Moinet, TR Drugman, ... US Patent App. 17/357,585, 2022 | | 2022 |
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech A Abbas, B Bollepalli, A Moinet, A Joly, P Karanasou, P Makarov, ... arXiv preprint arXiv:2106.15649, 2021 | | 2021 |
Triah: an intelligent guiding system for the visually impaired V Thanvantri Vasudevan, S Sridharan, S Swaminathan, SVK Karlapati, ... CSI transactions on ICT 4, 165-171, 2016 | | 2016 |