2026年度に発表された文献の一覧
学術論文誌
- J. Mi, X. Shi, D. Ma, J. He, T. Fujimura, T. Toda, "Robust speech emotion recognition under human speech noise," Computer Speech and Language, Vol. 100, Article 101987, pp. 1-16, Apr. 2026.
- T. Komatsu, H. Munakata, Y. Ishikawa, K. Takeda, T. Toda, "Semi-supervised text-audio contrastive learning method using pseudo-text input," APSIPA Transactions on Signal and Information Processing, Vol. 15, No. 1, pp. 183-198, Apr. 2026.
- Y. Hashizume, T. Toda, "Investigation of perceptual music similarity based on individual instrumental parts by large-scale listening test," APSIPA Transactions on Signal and Information Processing, Vol. 15, No. 1, pp. 249-269, Apr. 2026.
- J. Feng, Y. Yasuda, T. Toda, "An investigation of the robustness of flow- and diffusion-based speech generation models on noisy transcriptions," APSIPA Transactions on Signal and Information Processing, Vol. 15, No. 1, pp. 270-292, Apr. 2026.
- W.-C. Huang, E. Cooper, T. Toda, "MOS-Bench: benchmarking generalization abilities of subjective speech quality assessment models," IEEE Transactions on Audio, Speech and Language Processing, Vol. 34, pp. 2385-2397, Apr. 2026.
- X. Shi, X. Li, T. Toda, "Emotion similarity and shift: modeling temporal dynamic interactions for emotion prediction in conversation," IEEE Transactions on Audio, Speech and Language Processing, Vol. 34, pp. 2552-2567, Apr. 2026.
国際会議
- T. Imamura, T. Komatsu, H. Munakata, T. Toda, "Audio-visual feature fusion for calibrating relevance scores of video moment retrieval," Proc. IEEE ICASSP, pp. 5551-5555, Barcelona, Spain, May 2026.
- L.P. Violeta, X. Zhang, J. Shi, Y. Yasuda, W.-C. Huang, Z. Wu, T. Toda, "The singing voice conversion challenge 2025: from singer identity conversion to singing style conversion," Proc. IEEE ICASSP, pp. 17707-17711, Barcelona, Spain, May 2026.
- J. Wang, T. Toda, "From fixed positions to free-form signals: Virtual Microphone signal estimation for general-purpose spatial audio processing," Proc. IEEE ICASSP, pp. 21011-21015, Barcelona, Spain, May 2026.
- H. Munakata, T. Imamura, T. Nishimura, T. Komatsu, "CASTELLA: long audio dataset with captions and temporal boundaries," Proc. IEEE ICASSP, pp. 15352-15356, Barcelona, Spain, May 2026.
その他発表
- S. Chen, T. Toda, "QHARMA-GAN: quasi-harmonic neural vocoder based on autoregressive moving average model," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.
- D. Ma, L.P. Violeta, K. Kobayashi, T. Toda, "Pretraining and fine-tuning techniques for electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.
- J. He, X. Shi, C.-H. Hu, J. Mi, X. Li, T. Toda, "M4SER: multimodal, multirepresentation, multitask, and multistrategy learning for speech emotion recognition," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.
- B.M. Halpern, T.B. Tienkamp, T. Rebernik, R.J.J.H. van Son, S.A.H.J. de Visscher, M.J.H. Witjes, D. Abur, T. Toda, "XPPG-PCA: reference-free automatic speech severity evaluation with principal components," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.