Follow
Florian Metze
Florian Metze
Verified email at andrew.cmu.edu - Homepage
Title
Cited by
Cited by
Year
EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding
Y Miao, M Gowayyed, F Metze
2015 IEEE workshop on automatic speech recognition and understanding (ASRU …, 2015
8762015
Extracting deep bottleneck features using stacked auto-encoders
J Gehring, Y Miao, F Metze, A Waibel
2013 IEEE international conference on acoustics, speech and signal …, 2013
3522013
Learning joint embedding with multimodal cues for cross-modal video-text retrieval
NC Mithun, J Li, F Metze, AK Roy-Chowdhury
Proceedings of the 2018 ACM on International Conference on Multimedia …, 2018
2552018
Videoclip: Contrastive pre-training for zero-shot video-text understanding
H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ...
arXiv preprint arXiv:2109.14084, 2021
2502021
A one-pass decoder based on polymorphic linguistic context assignment
H Soltau, F Metze, C Fugen, A Waibel
IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU …, 2001
2502001
How2: a large-scale dataset for multimodal language understanding
R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ...
arXiv preprint arXiv:1811.00347, 2018
2282018
Support-set bottlenecks for video-text representation learning
M Patrick, PY Huang, Y Asano, F Metze, A Hauptmann, J Henriques, ...
arXiv preprint arXiv:2010.02824, 2020
2032020
Comparison of four approaches to age and gender recognition for telephone applications
F Metze, J Ajmera, R Englert, U Bub, F Burkhardt, J Stegmann, C Muller, ...
2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007
1972007
A comparison of five multiple instance learning pooling functions for sound event detection with weak labeling
Y Wang, J Li, F Metze
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1802019
Advances in automatic meeting record creation and access
A Waibel, M Bett, F Metze, K Ries, T Schaaf, T Schultz, H Soltau, H Yu, ...
2001 IEEE International Conference on Acoustics, Speech, and Signal …, 2001
1802001
Session independent non-audible speech recognition using surface electromyography
L Maier-Hein, F Metze, T Schultz, A Waibel
IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 331-336, 2005
1652005
A comparison of deep learning methods for environmental sound detection
J Li, W Dai, F Metze, S Qu, S Das
2017 IEEE International conference on acoustics, speech and signal …, 2017
1492017
Keeping your eye on the ball: Trajectory attention in video transformers
M Patrick, D Campbell, Y Asano, I Misra, F Metze, C Feichtenhofer, ...
Advances in neural information processing systems 34, 12493-12506, 2021
1442021
Speaker adaptive training of deep neural network acoustic models using i-vectors
Y Miao, H Zhang, F Metze
IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (11 …, 2015
1372015
Deep maxout networks for low-resource speech recognition
Y Miao, F Metze, S Rawat
2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 398-403, 2013
1222013
Anger recognition in speech using acoustic and linguistic cues
T Polzehl, A Schmitt, F Metze, M Wagner
Speech Communication 53 (9-10), 1198-1209, 2011
1162011
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition
A Jansen, E Dupoux, S Goldwater, M Johnson, S Khudanpur, K Church, ...
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
1142013
A flexible stream architecture for ASR using articulatory features
F Metze, A Waibel
Seventh International Conference on Spoken Language Processing, 2002
1122002
Automatically assessing personality from speech
T Polzehl, S Möller, F Metze
2010 IEEE fourth international conference on semantic computing, 134-140, 2010
1112010
Towards speaker adaptive training of deep neural network acoustic models
Y Miao, H Zhang, F Metze
Fifteenth annual conference of the international speech communication …, 2014
1062014
The system can't perform the operation now. Try again later.
Articles 1–20