Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 684 | 2024 |
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018 | 437 | 2018 |
Towards directly modeling raw speech signal for speaker verification using CNNs H Muckenhirn, M Magimai-Doss, S Marcel 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 164 | 2018 |
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023 | 152 | 2023 |
Overview of BTAS 2016 speaker anti-spoofing competition P Korshunov, S Marcel, H Muckenhirn, AR Gonçalves, AGS Mello, ... 2016 IEEE 8th international conference on biometrics theory, applications …, 2016 | 102 | 2016 |
End-to-end convolutional neural network-based voice presentation attack detection H Muckenhirn, M Magimai-Doss, S Marcel 2017 IEEE international joint conference on biometrics (IJCB), 335-341, 2017 | 76 | 2017 |
Long-term spectral statistics for voice presentation attack detection H Muckenhirn, P Korshunov, M Magimai-Doss, S Marcel IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (11 …, 2017 | 59 | 2017 |
On Learning to Identify Genders from Raw Speech Signal Using CNNs SH Kabil, H Muckenhirn, M Magimai-Doss | 57 | 2018 |
Understanding and Visualizing Raw Waveform-Based CNNs. H Muckenhirn, V Abrol, M Magimai-Doss, S Marcel Interspeech, 2345-2349, 2019 | 41 | 2019 |
Presentation attack detection using long-term spectral statistics for trustworthy speaker verification H Muckenhirn, M Magimai-Doss, S Marcel 2016 International Conference of the Biometrics Special Interest Group …, 2016 | 21 | 2016 |
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs H Muckenhirn, M Magimai-Doss, S Marcel Proc. of Interspeech, 2018 | 19 | 2018 |
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking HR Muckenhirn, IL Moreno, J Hershey, K Wilson, P Sridhar, Q Wang, ... conference of the international speech communication association, 2019 | 5 | 2019 |
Gradient-based spectral visualization of CNNs using raw waveforms H Muckenhirn, V Abrol, M Magimai-Doss, S Marcel | 5 | 2018 |
Targeted voice separation by speaker conditioned on spectrogram masking Q Wang, P Sridhar, IL Moreno, H Muckenhirn US Patent 11,217,254, 2022 | 3 | 2022 |
Targeted voice separation by speaker conditioned on spectrogram masking Q Wang, P Sridhar, IL Moreno, H Muckenhirn US Patent 11,922,951, 2024 | 2 | 2024 |
Generating audio waveforms using encoder and decoder neural networks Y Li, M Tagliasacchi, D Roblek, F de Chaumont Quitry, B Gfeller, ... US Patent App. 17/856,292, 2023 | 2 | 2023 |
Magimai.-Doss M., and Marcel S H Muckenhirn, P Korshunov Long-term Spectral Statistics For Voice Presentation Attack Detection. IEEE …, 2017 | 2 | 2017 |
CycleGAN-Based Unpaired Speech Dereverberation H Muckenhirn, A Safin, H Erdogan, FC Quitry, M Tagliasacchi, S Wisdom, ... arXiv preprint arXiv:2203.15652, 2022 | 1 | 2022 |
Trustworthy speaker recognition with minimal prior knowledge using neural networks H Muckenhirn EPFL, 2019 | | 2019 |
Type of publication: Idiap-RR Citation: Muckenhirn_Idiap-RR-30-2017 Number: Idiap-RR-30-2017 Year: 2017 Month: 11 H Muckenhirn | | 2017 |