Yu Zhang

Cited by

	All	Since 2019
Citations	23448	21692
h-index	57	56
i10-index	113	103

7000

3500

1750

5250

2015201620172018201920202021202220232024101 264 417 865 1426 2350 4035 5148 6471 2206

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yonghui WuGoogle BrainVerified email at google.com
Chung-Cheng ChiuAppleVerified email at apple.com
Wei HanVerified email at illinois.edu
Ye JiaMetaVerified email at google.com
Ron J WeissGoogleVerified email at google.com
William ChanIdeogramVerified email at ideogram.ai
Heiga ZenPrincipal Scientist (Director), Google DeepMindVerified email at google.com
Ruoming Pang (庞若鸣)Apple AI/MLVerified email at apple.com
James GlassMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
James QinGoogleVerified email at google.com
Bo LiGoogleVerified email at google.com
Jonathan ShenGoogleVerified email at google.com
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
Daniel S. ParkGoogle BrainVerified email at google.com
Tara SainathPrincipal Research Scientist, GoogleVerified email at google.com
Zhifeng ChenGoogle Inc.Verified email at google.com
Wei-Ning HsuFacebook AI Research (FAIR)Verified email at csail.mit.edu
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Anmol GulatiResearcher, Google DeepmindVerified email at google.com

Yu Zhang

OpenAI

Verified email at csail.mit.edu - Homepage

Speech Recognition Speech Synthesis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Specaugment: A simple data augmentation method for automatic speech recognition DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le arXiv preprint arXiv:1904.08779, 2019	3767	2019
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018	3016	2018
Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020	2648	2020
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	894	2018
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018	894	2018
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019	721	2019
Wavegrad: Estimating gradients for waveform generation N Chen, Y Zhang, H Zen, RJ Weiss, M Norouzi, W Chan arXiv preprint arXiv:2009.00713, 2020	614	2020
Very deep convolutional networks for end-to-end speech recognition Y Zhang, W Chan, N Jaitly 2017 IEEE international conference on acoustics, speech and signal …, 2017	552	2017
An introduction to computational networks and the computational network toolkit MS Dong Yu, Adam Eversole, Mike Seltzer, Kaisheng Yao, Zhiheng Huang, Brian ... Tech. Rep. MSR, Microsoft Research, 2014, http://codebox/cntk, 2014	467*	2014
Unsupervised learning of disentangled and interpretable representations from sequential data WN Hsu, Y Zhang, J Glass Advances in neural information processing systems 30, 2017	401	2017
Spoken language understanding using long short-term memory neural networks K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi IEEE SLT, 2014	399	2014
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass 2016 IEEE international conference on acoustics, speech and signal …, 2016	360	2016
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan arXiv preprint arXiv:1706.02737, 2017	345	2017
Pushing the limits of semi-supervised learning for automatic speech recognition Y Zhang, J Qin, DS Park, W Han, CC Chiu, R Pang, QV Le, Y Wu arXiv preprint arXiv:2010.10504, 2020	324	2020
Simple recurrent units for highly parallelizable recurrence T Lei, Y Zhang, SI Wang, H Dai, Y Artzi arXiv preprint arXiv:1709.02755, 2017	316	2017
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	312	2021
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language 64, 101114, 2020	298	2020
Contextnet: Improving convolutional neural networks for automatic speech recognition with global context W Han, Z Zhang, Y Zhang, J Yu, CC Chiu, J Qin, A Gulati, R Pang, Y Wu arXiv preprint arXiv:2005.03191, 2020	280	2020
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018	269	2018
Improved noisy student training for automatic speech recognition DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le arXiv preprint arXiv:2005.09629, 2020	241	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors