2024
Journal Papers
- Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo & Shogo Seki (2024). VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics. IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 32, 2213-2226.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2024). Masked Modeling Duo: Towards a Universal Audio Pre-training Framework. IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 32, 2391-2406.
Peer-reviewed Conference Papers
- Chihiro Watanabe & Hirokazu Kameoka (2024). GE2E-AC: Generalized End-to-End Loss Training for Accent Classification. 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). Macau, China.
- Xiao Zhang, Haoran Xing, Mingxue Song, Daiki Takeuchi, Noboru Harada & Shoji Makino (2024). Prediction-Error-Based Adaptive SpecAugment for Fine-tuning the Masked Model on Audio Classification Tasks. 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). Macau, China.
- Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko & Noboru Harada (2024). LEARNING TO ASSESS SUBJECTIVE IMPRESSIONS CONVEYED THROUGH SPEECH. European Signal Processing Conference (EUSIPCO). Lyon, France.
- Shunsuke Tsugaki, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Keisuke Imoto (2024). Refining knowledge transfer on audio-image temporal agreement for audio-text cross retrieval. European Signal Processing Conference (EUSIPCO). Lyon, France.
- Bo He, Shiqi Zhang, Xianrui Wang, Zheng Qiu, Daiki Takeuchi, Daisuke Niizumi, Noboru Harada & Shoji Makino (2024). Light Gated Multi Mini-patch Extractor for Audio Classification. ICASSP2024 Satellite Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2024).
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2024). Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection. IEEE Engineering in Medicine and Biology Society (EMBC). Orlando, Florida, USA.
- Takuhiro Kaneko, Hirokazu Kameoka & Kou Tanaka (2024). Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Yuto Kondo, Hirokazu Kameoka, Kou Tanaka & Takuhiro Kaneko (2024). SELECTING N-LOWEST SCORES FOR TRAINING MOS PREDICTION MODELS. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Shiqi Zhang, Zheng Qiu, Daiki Takeuchi, Noboru Harada & Shoji Makino (2024). Unrestricted Global-Phase-Bias Aware Single-channel Speech Enhancement with Conformer-based Metric GAN. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka & Yuto Kondo (2024). FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillationa. Interspeech2024. Kos Island, Greece.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Masahiro Yasuda, Shunsuke Tsubaki & Keisuke Imoto (2024). M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation. Interspeech2024. Kos Island, Greece.
- Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko & Yuto Kondo (2024). PRVAE-VC2: Non-Parallel Voice Conversion by Distillation of Speech Representations. Interspeech2024. Kos Island, Greece.
- Daisuke Niizumi, Noboru Harada, Yasunori Ohishi, Daiki Takeuchi & Masahiro Yasuda (2024). ToyADMOS2#: Yet Another Dataset for The DCASE2024 Challenge Task 2 First-Shot Anomalous Sound Detection. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024). Tokyo, Japan.
- Tomoya Nishida, Noboru Harada, Daisuke Niizumi, Davide Albertini, Roberto Sannino, Simone Pradolini, Filippo Augusti, Keisuke Imoto, Kota Dohi, Harsh Purohit, Takashi Endo & Yohei Kawaguchi (2024). Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024). Tokyo, Japan.
- Daiki Takeuchi, Masahiro Yasuda, Daisuke Niizumi & Noboru Harada (2024). Towards Learning a Difference-Aware General-Purpose Audio Representation. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024). Tokyo, Japan.

















