Follow
Zhenyao Zhu
Zhenyao Zhu
Verified email at google.com
Title
Cited by
Cited by
Year
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
D Amodei, R Anubhai, E Battenberg, C Case, J Casper, B Catanzaro, ...
International Conference on Machine Learning (ICML), 2015
37732015
Deep Speaker: an End-to-End Neural Speaker Embedding System
C Li, X Ma, B Jiang, X Li, X Zhang, X Liu, Y Cao, A Kannan, Z Zhu
arXiv preprint arXiv:1705.02304, 2017
5752017
Deep learning identity-preserving face space
Z Zhu, P Luo, X Wang, X Tang
2013 IEEE International Conference on Computer Vision (ICCV), 113-120, 2013
4042013
Multi-view perceptron: a deep model for learning face identity and view representations
Z Zhu, P Luo, X Wang, X Tang
Advances in Neural Information Processing Systems (NIPS), 217-225, 2014
3052014
Fully supervised speaker diarization
A Zhang, Q Wang, Z Zhu, J Paisley, C Wang
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
2672019
Exploring Neural Transducers for End-to-End Speech Recognition
E Battenberg, J Chen, R Child, A Coates, Y Gaur, Y Li, H Liu, S Satheesh, ...
Automatic Speech Recognition and Understanding (ASRU) 2017, 2017
2592017
Face Model Compression by Distilling Knowledge from Neurons.
P Luo, Z Zhu, Z Liu, X Wang, X Tang
The AAAI Conference on Artificial Intelligence (AAAI) 2016, 3560-3566, 2015
2542015
DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection
W Ouyang, P Luo, X Zeng, S Qiu, Y Tian, H Li, S Yang, Z Wang, Y Xiong, ...
arXiv preprint arXiv:1409.3505, 2014
1762014
Recover canonical-view faces in the wild with deep neural networks
Z Zhu, P Luo, X Wang, X Tang
arXiv preprint arXiv:1404.3543, 2014
1512014
Deployed end-to-end speech recognition
B Catanzaro, J Chen, M Chrzanowski, E Elsen, J Engel, C Fougner, ...
US Patent App. 10/319,374, 2019
1332019
End-to-end speech recognition
B Catanzaro, J Chen, M Chrzanowski, E Elsen, J Engel, C Fougner, ...
US Patent 10,332,509, 2019
1072019
Learning Multiscale Features Directly From Waveforms
Z Zhu, JH Engel, A Hannun
International Speech Communication Association (Interspeech) 2016, 2016
892016
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
H Liu, Z Zhu, X Li, S Satheesh
International Conference on Machine Learning (ICML), 2017, 2017
642017
Methods and systems for verifying face images based on canonical images
X Tang, ZHU Zhenyao, P Luo, X Wang
US Patent 10,037,457, 2018
60*2018
Deep learning multi-view representation for face recognition
Z Zhu, P Luo, X Wang, X Tang
arXiv preprint arXiv:1406.6947, 2014
432014
Systems and methods for principled bias reduction in production speech models
E Battenberg, R CHILD, A Coates, C Fougner, G Yashesh, J Huang, ...
US Patent App. 15/884,239, 2018
142018
Deep Generative and Discriminative Domain Adaptation
H Zhao, J Hu, Z Zhu, A Coates, G Gordon
Proceedings of the 18th International Conference on Autonomous Agents and …, 2019
92019
Reducing Bias in Production Speech Models
E Battenberg, R Child, A Coates, C Fougner, Y Gaur, J Huang, H Jun, ...
arXiv preprint arXiv:1705.04400, 2017
82017
Systems and methods for automatic unit selection and target decomposition for sequence labelling
H Liu, ZHU Zhenyao, S Satheesh
US Patent 10,373,610, 2019
72019
Method and system for exacting face features from data of face images
X Tang, ZHU Zhenyao, P Luo, X Wang
US Patent 9,710,697, 2017
72017
The system can't perform the operation now. Try again later.
Articles 1–20