Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion D Kim, CH Lai, WH Liao, N Murata, Y Takida, T Uesaka, Y He, Y Mitsufuji, ... International Conference on Learning Representations (ICLR), 2023 | 103 | 2023 |
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ... International Conference on Machine Learning (ICML), 20987-21012, 2022 | 58 | 2022 |
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration N Murata, K Saito, CH Lai, Y Takida, T Uesaka, Y Mitsufuji, S Ermon International Conference on Machine Learning (ICML), 25501-25522, 2023 | 37 | 2023 |
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation CH Lai, Y Takida, N Murata, T Uesaka, Y Mitsufuji, S Ermon International Conference on Machine Learning (ICML), 18365-18398, 2023 | 36* | 2023 |
Manifold Preserving Guided Diffusion Y He, N Murata, CH Lai, Y Takida, T Uesaka, D Kim, WH Liao, Y Mitsufuji, ... International Conference on Learning Representations (ICLR), 2023 | 31 | 2023 |
Preventing oversmoothing in VAE via generalized variance parameterization Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji Neurocomputing 509, 137-156, 2022 | 27* | 2022 |
Unsupervised vocal dereverberation with diffusion-based generative models K Saito, N Murata, T Uesaka, CH Lai, Y Takida, T Fukui, Y Mitsufuji IEEE International Conference on Acoustics, Speech and Signal Processing …, 2023 | 23 | 2023 |
Automatic Piano Transcription with Hierarchical Frequency-Time Transformer K Toyama, T Akama, Y Ikemiya, Y Takida, WH Liao, Y Mitsufuji 24th International Society for Music Information Retrieval Conference (ISMIR), 2023 | 21 | 2023 |
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ... Interspeech, 2023 | 16* | 2023 |
Exterior and interior sound field separation using convex optimization: Comparison of signal models Y Takida, S Koyama, H Saruwataril 2018 26th European Signal Processing Conference (EUSIPCO), 2549-2553, 2018 | 15 | 2018 |
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network T Shibuya, Y Takida, Y Mitsufuji ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023 | 13 | 2023 |
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ... Transactions on Machine Learning Research (TMLR), 2023 | 8 | 2023 |
On the equivalence of consistency-type models: Consistency models, consistent diffusion models, and fokker-planck regularization CH Lai, Y Takida, T Uesaka, N Murata, Y Mitsufuji, S Ermon International Conference on Machine Learning 2023 Workshop SPIGM, 2023 | 8 | 2023 |
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer Y Takida, M Imaizumi, T Shibuya, CH Lai, T Uesaka, N Murata, Y Mitsufuji International Conference on Learning Representations (ICLR), 2023 | 8 | 2023 |
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher D Kim, CH Lai, WH Liao, Y Takida, N Murata, T Uesaka, Y Mitsufuji, ... arXiv preprint arXiv:2405.14822, 2024 | 5 | 2024 |
Reciprocity gap functional in spherical harmonic domain for gridless sound field decomposition Y Takida, S Koyama, N Ueno, H Saruwatari Signal Processing 169, 107383, 2020 | 5 | 2020 |
Robust gridless sound field decomposition based on structured reciprocity gap functional in spherical harmonic domain Y Takida, S Koyama, N Ueno, H Saruwatari ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 5 | 2019 |
Gridless sound field decomposition based on reciprocity gap functional in spherical harmonic domain Y Takida, S Koyama, N Ueno, H Saruwatari 2018 IEEE 10th Sensor Array and Multichannel Signal Processing Workshop (SAM …, 2018 | 5 | 2018 |
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation K Saito, D Kim, T Shibuya, CH Lai, Z Zhong, Y Takida, Y Mitsufuji arXiv preprint arXiv:2405.18503, 2024 | 3 | 2024 |
Array-geometry-aware spatial active noise control based on direction-of-arrival weighting Y Maeno, Y Takida, N Murata, Y Mitsufuji ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 3 | 2020 |