Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 754 | 2019 |
Residual belief propagation: Informed scheduling for asynchronous message passing G Elidan, I McGraw, D Koller arXiv preprint arXiv:1206.6837, 2012 | 381 | 2012 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 230 | 2020 |
Personalized speech recognition on mobile devices I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 230 | 2016 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 214 | 2019 |
Two-pass end-to-end speech recognition TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ... arXiv preprint arXiv:1908.10992, 2019 | 173 | 2019 |
Tool for selecting ink and other objects in an electronic document AJ Simmons, IC McGraw, PL Engrav, B Barabe, OC Braun US Patent 7,454,702, 2008 | 136 | 2008 |
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition A Jansen, E Dupoux, S Goldwater, M Johnson, S Khudanpur, K Church, ... 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 122 | 2013 |
On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition R Prabhavalkar, O Alsharif, A Bruguier, L McGraw 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 120 | 2016 |
The WAMI toolkit for developing, deploying, and evaluating web-accessible multimodal interfaces A Gruenstein, I McGraw, I Badr Proceedings of the 10th international conference on Multimodal interfaces …, 2008 | 113 | 2008 |
Streaming small-footprint keyword spotting using sequence-to-sequence models Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 103 | 2017 |
Collecting Voices from the Cloud. I McGraw, C Lee, IL Hetherington, S Seneff, JR Glass LREC, 1576-1583, 2010 | 100 | 2010 |
Optimizing speech recognition for the edge Y Shangguan, J Li, Q Liang, R Alvarez, I McGraw arXiv preprint arXiv:1909.12408, 2019 | 72 | 2019 |
Speech-enabled card games for incidental vocabulary acquisition in a foreign language I McGraw, B Yoshimoto, S Seneff Speech Communication 51 (10), 1006-1023, 2009 | 72 | 2009 |
Learning lexicons from speech using a pronunciation mixture model I McGraw, I Badr, JR Glass IEEE Transactions on Audio, Speech, and Language Processing 21 (2), 357-366, 2012 | 58 | 2012 |
A Conversational Movie Search System Based on Conditional Random Fields. J Liu, S Cyphers, P Pasupat, I McGraw, JR Glass Interspeech, 2454-2457, 2012 | 57 | 2012 |
A self-transcribing speech corpus: collecting continuous speech with an online educational game. A Gruenstein, I McGraw, AM Sutherland SLaTE, 109-112, 2009 | 56 | 2009 |
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. TN Sainath, Y He, A Narayanan, R Botros, R Pang, D Rybach, C Allauzen, ... Interspeech 8, 1777-1781, 2021 | 46 | 2021 |
A self-labeling speech corpus: collecting spoken words with an online educational game. I McGraw, A Gruenstein, AM Sutherland Interspeech, 3031-3034, 2009 | 36 | 2009 |
Learning word-level confidence for subword end-to-end ASR D Qiu, Q Li, Y He, Y Zhang, B Li, L Cao, R Prabhavalkar, D Bhatia, W Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 31 | 2021 |