Follow
Shusuke Takahashi
Shusuke Takahashi
Sony Group Corporation
Verified email at sony.com
Title
Cited by
Cited by
Year
ACCDOA: Activity-coupled cartesian direction of arrival representation for sound event localization and detection
K Shimada, Y Koyama, N Takahashi, S Takahashi, Y Mitsufuji
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1122021
Multi-ACCDOA: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training
K Shimada, Y Koyama, S Takahashi, N Takahashi, E Tsunoo, Y Mitsufuji
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
902022
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
A Politis, K Shimada, P Sudarsanam, S Adavanne, D Krause, Y Koyama, ...
arXiv preprint arXiv:2206.01948, 2022
872022
All for one and one for all: Improving music separation by bridging networks
R Sawata, S Uhlich, S Takahashi, Y Mitsufuji
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
632021
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ...
arXiv preprint arXiv:2205.07547, 2022
582022
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
K Shimada, A Politis, P Sudarsanam, D Krause, K Uchida, S Adavanne, ...
arXiv preprint arXiv:2306.09126, 2023
372023
Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection
K Shimada, N Takahashi, Y Koyama, S Takahashi, E Tsunoo, ...
arXiv preprint arXiv:2106.10806, 2021
292021
Sound event localization and detection using activity-coupled cartesian DOA vector and RD3NET
K Shimada, N Takahashi, S Takahashi, Y Mitsufuji
arXiv preprint arXiv:2006.12014, 2020
222020
Elementary real-time implementation of a virtual acoustic display based on ADVISE
S Takane, S Takahashi, Y Suzuki, T Miyajima
Acoustical science and technology 24 (5), 304-310, 2003
192003
Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability
KW Cheuk, R Sawata, T Uesaka, N Murata, N Takahashi, S Takahashi, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
172023
Preventing posterior collapse induced by oversmoothing in gaussian vae
Y Takida, WH Liao, T Uesaka, S Takahashi, Y Mitsufuji
arXiv e-prints, arXiv: 2102.08663, 2021
172021
Spatial data augmentation with simulated room impulse responses for sound event localization and detection
Y Koyama, K Shigemi, M Takahashi, K Shimada, N Takahashi, E Tsunoo, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models
R Sawata, Y Kashiwagi, S Takahashi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification
Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Preventing oversmoothing in VAE via generalized variance parameterization
Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji
Neurocomputing 509, 137-156, 2022
112022
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ...
arXiv preprint arXiv:2305.10734, 2023
102023
A Versatile Diffusion-based Generative Refiner for Speech Enhancement
R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ...
arXiv preprint arXiv:2210.17287, 2022
92022
Information processing device, information processing method, program, recording medium, and information processing system
K Matsumoto, S Takahashi, C Kemmochi, A Inoue
US Patent App. 13/719,652, 2013
92013
Sound processing apparatus, sound processing method and program
R Namba, M Abe, A Inoue, K Toyama, S Takahashi, M Nishiguchi
US Patent 9,762,193, 2017
82017
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ...
8*
The system can't perform the operation now. Try again later.
Articles 1–20