2024
論文
- Seiseki Akibue, Go Kato & Seiichiro Tani (2024). Probabilistic unitary synthesis with optimal accuracy. ACM Transactions on Quantum Computing, 5 (3), 1-27.
- Ryo Nishikimi, Masahiro Nakano, Kunio Kashino & Shingo Tuskada (2024). Variational Autoencoder-Based Neural Electrocardiogram Synthesis Trained by FEM-Based Heart Simulator. Cardiovascular Digital Health Journal, 5 (1), 19-28.
- Moyu Hasegawa, Kenji Miki, Takuji Kawamura, Ikue Sasozaki, Yuki Hikashiyama, Masaru Tuchida, Kunio Kashino, Masaki Taira, Emiko Ito, Maki Tkeda, Hidekazu Ishida, Shuichiro Higo, Yasushi Sakata & Shigeru Miyagawa (2024). Gene correction and overexpression of TNNI3 improve impaired relaxation in engineered heart tissue model of pediatric restrictive cardiomyopathy. Development, Growth & Differentiation, 66 (2), 119-132.
- Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Shoko Araki & Shoji Makino (2024). DOA-Informed Switching Independent Vector Extraction and Beamforming for Speech Enhancement in Underdetermined Situations. EURASIP Journal on Audio, Speech, and Music Processing, 2024.
- Takanori Ashihara, Marc Delcroix, Yusuke Ijima & Makio Kashino (2024). Unveiling the Linguistic Capabilities of a Self-Supervised Speech Model Through Cross-Lingual Benchmark and Layer- Wise Similarity Analysis. IEEE Access, 12, 98835-98855.
- Reinhold Haeb-Umbach, Tomohiro Nakatani, Marc Delcroix, Christoph Boeddeker & Tsubasa Ochiai (2024). Microphone Array Signal Processing and Deep Learning for Speech Enhancement: Combining model-based and data-driven approaches to parameter estimation and filtering. IEEE Signal Processing Magazine, 41 (6), 12-23.
- Rintaro Ikeshita & Tomohiro Nakatani (2024). Geometrically-Regularized Fast Independent Vector Extraction by Pure Majorization-Minimization. IEEE Transactions on Signal Processing, 72, 1560-1575.
- Tsubasa Ochiai, Kazuma Iwamoto, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki & Shigeru Katagiri (2024). Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 3589-3602.
- Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo & Shogo Seki (2024). VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics. IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 32, 2213-2226.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2024). Masked Modeling Duo: Towards a Universal Audio Pre-training Framework. IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 32, 2391-2406.
- Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Shoko Araki & Shoji Makino (2024). Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 32, 1157-1172.
- Kazufumi Kimoto & Masato Wakayama (2024). Partition functions for non-commutative harmonic oscillators and related divergent series. Indagation Mathematicae.
- Cid Reyes-Bustos & Masato Wakayama (2024). Zeta Limits for The Spectrum of Quantum Rabi Models. Journal of Mathematical Physics, 65 (9).
- Linh Thi Hoai Nguyen, Cid Reyes-Bustos, Daniel Braak & Masato Wakayama (2024). Spacing Distribution for Quantum Rabi Models. Journal of Physics A: Mathematical and Theoretical, 57 (29), 295201.
- Koizumi Junnosuke & Miyazaki Hiroyasu (2024). A motivic construction of the de Rham-Witt complex. Journal of Pure and Applied Algebra, 228 (6), 107602.
- Ryosuke Nakahama (2024). Representation theory of sl(2,R). Mathematical Foundations for Post-Quantum Cryptography.
- Cid Reyes (2024). Towards hash functions based on group-subgroup pair graphs. Mathematical Foundations for Post-Quantum Cryptography.
- Hiroto Kasai, Yuki Takeuchi, Yuichiro Matsuzaki & Yasuhiro Tokura (2024). Direct Moment Estimation of Intensity Distribution of Magnetic Fields with Quantum Sensing Network. New Journal of Physics, 26 (12).
- Jisho Miyazaki & Seiseki Akibue (2024). Non-locality of conjugation symmetry: characterization and examples in quantum network sensing. New Journal of Physics, 26 (5), 053017.
- Yu Mitsuzumi, Go Irie, Akisato Kimura & Atsushi Nakazawa (2024). Phase Randomization: A Data Augmentation for Domain Adaptation in Human Action Recognition. Pattern Recognition, 146.
- Cid Reyes-Bustos, Naoya Yamaguchi & Yuka Yamaguchi (2024). Wolstenholme Primes and Group Determinants of Cyclic Groups. Proceedings of the Japan Academy. Series. A, Mathematical Sciences, 100 (9), 51-55.
- Seiseki Akibue, Go Kato & Seiichiro Tani (2024). Probabilistic state synthesis based on optimal convex approximation. Quantum Information, 10 (1).
国際会議予稿
- Chihiro Watanabe & Hirokazu Kameoka (2024). GE2E-AC: Generalized End-to-End Loss Training for Accent Classification. 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). Macau, China.
- Xiao Zhang, Haoran Xing, Mingxue Song, Daiki Takeuchi, Noboru Harada & Shoji Makino (2024). Prediction-Error-Based Adaptive SpecAugment for Fine-tuning the Masked Model on Audio Classification Tasks. 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). Macau, China.
- Yu Mitsuzumi, Akisato Kimura, Go Irie & Atsushi Nakazawa (2024). Cross-Action Cross-Subject Skeleton Action Recognition Via Simultaneous Action-Subject Learning With Two-Step Feature Removal. 2024 IEEE International Conference on Image Processing (ICIP). Abu Dhabi, United Arab Emirates.
- Akihiro Mizutani, Yuki Takeuchi & Kiyoshi Tamaki (2024). Finite-key Security Analysis of Differential-Phase-Shift QKD. 24th Asian Quantum Information Science Conference(AQIS). Sapporo, Japan.
- Yusuke Oumi, Yuto Shibata, Go Irie, Akisato Kimura, Yoshimitsu Aoki & Mariko Isogawa (2024). Acoustic-Based 3D Human Pose Estimation Robust to Human Position. 35th British Machine Vision Conference 2024,(BMVC). Glasgow, UK.
- Seiseki Akibue, Go Kato & Seiichiro Tani (2024). Probabilistic Unitary and State Synthesis with Optimal Accuracy. 6th International Workshop on Quantum Compilation. Berlin, Germany.
- Yasuhiro Fujiwara, Atsutoshi Kumagai, Yasutoshi Ida, Masahiro Nakano, Makoto Nakatsuji & Akisato Kimura (2024). Efficient Algorithm for K-Multiple-Means. ACM SIGMOD International Conference on Management of Data. Santiago, Chile.
- Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko & Noboru Harada (2024). LEARNING TO ASSESS SUBJECTIVE IMPRESSIONS CONVEYED THROUGH SPEECH. European Signal Processing Conference (EUSIPCO). Lyon, France.
- Shunsuke Tsugaki, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Keisuke Imoto (2024). Refining knowledge transfer on audio-image temporal agreement for audio-text cross retrieval. European Signal Processing Conference (EUSIPCO). Lyon, France.
- Hao Shi, Naoyuki Kamo, Marc Delcroix, Tomohiro Nakatani & Shoko Araki (2024). ENSEMBLE INFERENCE FOR DIFFUSION MODEL-BASED SPEECH ENHANCEMENT. ICASSP2024 Satellite Workshop on Hands-Free Speech Communication and Microphone Array (HSCMA). Seoul, Korea.
- Thilo von Neumann, Christoph Cord-Landwehr Boeddeker, Marc Delcroix & Reinhold Haeb-Umbach (2024). Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. ICASSP2024 Satellite Workshop on Hands-Free Speech Communication and Microphone Array (HSCMA). Seoul, Korea.
- Rino Kimura, Tomohiro Nakatani, Naoyuki Kamo, Delcroix Marc, Shoko Araki, Tetsuya Ueda & Shoji Makino (2024). Diffusion model-based MIMO speech denoising and dereverberation. ICASSP2024 Satellite Workshop on Hands-Free Speech Communication and Microphone Array (HSCMA) Workshop. Seoul, Korea.
- Bo He, Shiqi Zhang, Xianrui Wang, Zheng Qiu, Daiki Takeuchi, Daisuke Niizumi, Noboru Harada & Shoji Makino (2024). Light Gated Multi Mini-patch Extractor for Audio Classification. ICASSP2024 Satellite Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2024).
- Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Takanori Ashihara, Shoko Araki & Jan Cernocky (2024). Probing Self-supervised Learning Models with Target Speech Extraction. ICASSP2024 Satellite Workshop on Self-supervision in Audio, Speech, and Beyond (SASB). Seoul, Korea.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2024). Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection. IEEE Engineering in Medicine and Biology Society (EMBC). Orlando, Florida, USA.
- Takanori Ashihara, Marc Delcroix, Takafumi Moriya, Kohei Matsuura, Taichi Asami & Yusuke Ijima (2024). What do self-supervised speech and speaker models learn? New findings from a cross model layer-wise analysis. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- William Chen, Takatomo Kano, Atsunori Ogawa, Marc Delcroix & Shinji Watanabe (2024). Train Long and Test Long: Leveraging Full Document Contexts in Speech Processing. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Kenichi Fujita, Hiroshi Sato, Takanori Ashihara, Hiroki Kanagawa, Marc Delcroix, Takafumi Moriya & Yusuke Ijima (2024). Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki & Shigeru Katagiri (2024). How does end-to-end speech recognition training impact speech enhancement artifacts?. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Takuhiro Kaneko, Hirokazu Kameoka & Kou Tanaka (2024). Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Dominik Klement, Mireia Diez, Federico Landini, Lukáš Burget, Anna Silnova, Marc Delcroix & Naohiro Tawara (2024). Discriminative Training of VBx Diarization. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Yuto Kondo, Hirokazu Kameoka, Kou Tanaka & Takuhiro Kaneko (2024). SELECTING N-LOWEST SCORES FOR TRAINING MOS PREDICTION MODELS. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Masahiro Nakano, Ryohei Shibue & Kunio Kashino (2024). Sunflower Strategy for Bayesian Relational Data Analysis. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada & Kunio Kashino (2024). Target Speech Spotting and Extraction Based on ConceptBeam. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Junyi Peng, Marc Delcroix, Tsubasa Ochiai, Oldrich Plchot, Shoko Araki & Jan Cernocky (2024). Target Speech Extraction with pre-trained self-supervised learning models. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Hanako Segawa, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Shoko Araki, Takeshi Yamada & Shoji Makino (2024). Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Naohiro Tawara, Marc Delcroix, Atsushi Ando & Atsunori Ogawa (2024). NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Keigo Wakayama, Tsubasa Ochiai, Marc Delcroix, Masahiro Yasuda, Shoichiro Saito, Shoko Araki & Akira Nakayama (2024). Online Target Sound Extraction with Knowledge Distillation from Partially Non-Causal Teacher. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Shiqi Zhang, Zheng Qiu, Daiki Takeuchi, Noboru Harada & Shoji Makino (2024). Unrestricted Global-Phase-Bias Aware Single-channel Speech Enhancement with Conformer-based Metric GAN. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul, Korea.
- Takuhiro kaneko (2024). Improving Physics Augmented Continuum Neural Radiance Fileds-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, WA, USA.
- Yu Mitsuzumi, Akisato Kimura & Hisashi Kashima (2024). Understanding and Improving Source-free Domain Adaptation from a Theoretical Perspective. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, USA.
- Kunio Kashino, Akisato Kimura & Shoji Matsuya (2024). Detection of acute myeloid leukemia without labeling individual blood cells. International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, USA.
- Masahiro Nakano, Hiroki Sakuma, Ryo Nishikimi, Ryohei Shibue, Takashi Sato & Kunio Kashino (2024). Warped Diffusion for Latent Differentiation Inference. International Conference on Artificial Intelligence and Statistics (AISTATS). Valencia, Spain.
- Kenichi Fujita, Takanori Ashihara, Marc Delcroix & Yusuke Ijima (2024). Lightweight Zero-shot Text-to-Speech with Mixture of Adapters. Interspeech2024. Kos Island, Greece.
- Keigo Hojo, Yukoh Wakabayashi, Kengo Ohta, Atsunori Ogawa & Norihide Kitaoka (2024). Boosting CTC-based ASR using inter-layer attention-based CTC loss. Interspeech2024. Kos Island, Greece.
- Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka & Yuto Kondo (2024). FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillationa. Interspeech2024. Kos Island, Greece.
- Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Masato Mimura, Takatomo Kano, Atsunori Ogawa & Marc Delcroix (2024). Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation. Interspeech2024. Kos Island, Greece.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Masahiro Yasuda, Shunsuke Tsubaki & Keisuke Imoto (2024). M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation. Interspeech2024. Kos Island, Greece.
- Hiroshi Sato, Takafumi Moriya, Masato Mimura, Shota Horiguchi, Tsubasa Ochiai, Takanori Ashihara, Atsushi Ando, Kentaro Shinayama & Marc Delcroix (2024). SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling. Interspeech2024. Kos Island, Greece.
- Tatsunari Takagi, Yukoh Wakabayashi, Atsunori Ogawa & Norihide Kitaoka (2024). Text-only domain adaptation for CTC-based speech recognition through substitution of implicit linguistic information in the search space. Interspeech2024. Kos Island, Greece.
- Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki & Simon Doclo (2024). Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers. Interspeech2024. Kos Island, Greece.
- Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko & Yuto Kondo (2024). PRVAE-VC2: Non-Parallel Voice Conversion by Distillation of Speech Representations. Interspeech2024. Kos Island, Greece.
- Daisuke Niizumi, Noboru Harada, Yasunori Ohishi, Daiki Takeuchi & Masahiro Yasuda (2024). ToyADMOS2#: Yet Another Dataset for The DCASE2024 Challenge Task 2 First-Shot Anomalous Sound Detection. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024). Tokyo, Japan.
- Tomoya Nishida, Noboru Harada, Daisuke Niizumi, Davide Albertini, Roberto Sannino, Simone Pradolini, Filippo Augusti, Keisuke Imoto, Kota Dohi, Harsh Purohit, Takashi Endo & Yohei Kawaguchi (2024). Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024). Tokyo, Japan.
- Daiki Takeuchi, Masahiro Yasuda, Daisuke Niizumi & Noboru Harada (2024). Towards Learning a Difference-Aware General-Purpose Audio Representation. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2024 Workshop (DCASE2024). Tokyo, Japan.
- Thilo von Neumann, Christoph Boeddeker, Marc Delcroix & Reinhold Haeb-Umbach (2024). MeetEval, Show Me the Errors! Interactive Visualization of Transcript Alignments for the Analysis of Conversational ASR. Show & Tell Demo, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Seoul, Korea.
Misc
- 石川憲治, 谷川理佐子, 阿久津真理子, 及川靖広. (2024). 干渉縞で音場を見る : 光学的音響計測を用いた音響現象の可視化と解明. 光アライアンス, 35 (8), 1-5.





























































