Follow
Yuhta Takida
Yuhta Takida
Sony AI
Verified email at sony.com
Title
Cited by
Cited by
Year
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
D Kim, CH Lai, WH Liao, N Murata, Y Takida, T Uesaka, Y He, Y Mitsufuji, ...
International Conference on Learning Representations (ICLR), 2023
1032023
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ...
International Conference on Machine Learning (ICML), 20987-21012, 2022
582022
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
N Murata, K Saito, CH Lai, Y Takida, T Uesaka, Y Mitsufuji, S Ermon
International Conference on Machine Learning (ICML), 25501-25522, 2023
372023
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation
CH Lai, Y Takida, N Murata, T Uesaka, Y Mitsufuji, S Ermon
International Conference on Machine Learning (ICML), 18365-18398, 2023
36*2023
Manifold Preserving Guided Diffusion
Y He, N Murata, CH Lai, Y Takida, T Uesaka, D Kim, WH Liao, Y Mitsufuji, ...
International Conference on Learning Representations (ICLR), 2023
312023
Preventing oversmoothing in VAE via generalized variance parameterization
Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji
Neurocomputing 509, 137-156, 2022
27*2022
Unsupervised vocal dereverberation with diffusion-based generative models
K Saito, N Murata, T Uesaka, CH Lai, Y Takida, T Fukui, Y Mitsufuji
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2023
232023
Automatic Piano Transcription with Hierarchical Frequency-Time Transformer
K Toyama, T Akama, Y Ikemiya, Y Takida, WH Liao, Y Mitsufuji
24th International Society for Music Information Retrieval Conference (ISMIR), 2023
212023
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ...
Interspeech, 2023
16*2023
Exterior and interior sound field separation using convex optimization: Comparison of signal models
Y Takida, S Koyama, H Saruwataril
2018 26th European Signal Processing Conference (EUSIPCO), 2549-2553, 2018
152018
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
T Shibuya, Y Takida, Y Mitsufuji
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023
132023
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ...
Transactions on Machine Learning Research (TMLR), 2023
82023
On the equivalence of consistency-type models: Consistency models, consistent diffusion models, and fokker-planck regularization
CH Lai, Y Takida, T Uesaka, N Murata, Y Mitsufuji, S Ermon
International Conference on Machine Learning 2023 Workshop SPIGM, 2023
82023
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
Y Takida, M Imaizumi, T Shibuya, CH Lai, T Uesaka, N Murata, Y Mitsufuji
International Conference on Learning Representations (ICLR), 2023
82023
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher
D Kim, CH Lai, WH Liao, Y Takida, N Murata, T Uesaka, Y Mitsufuji, ...
arXiv preprint arXiv:2405.14822, 2024
52024
Reciprocity gap functional in spherical harmonic domain for gridless sound field decomposition
Y Takida, S Koyama, N Ueno, H Saruwatari
Signal Processing 169, 107383, 2020
52020
Robust gridless sound field decomposition based on structured reciprocity gap functional in spherical harmonic domain
Y Takida, S Koyama, N Ueno, H Saruwatari
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
52019
Gridless sound field decomposition based on reciprocity gap functional in spherical harmonic domain
Y Takida, S Koyama, N Ueno, H Saruwatari
2018 IEEE 10th Sensor Array and Multichannel Signal Processing Workshop (SAM …, 2018
52018
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation
K Saito, D Kim, T Shibuya, CH Lai, Z Zhong, Y Takida, Y Mitsufuji
arXiv preprint arXiv:2405.18503, 2024
32024
Array-geometry-aware spatial active noise control based on direction-of-arrival weighting
Y Maeno, Y Takida, N Murata, Y Mitsufuji
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
32020
The system can't perform the operation now. Try again later.
Articles 1–20