発表文献

2026年度に発表された文献の一覧

学術論文誌

J. Mi, X. Shi, D. Ma, J. He, T. Fujimura, T. Toda, "Robust speech emotion recognition under human speech noise," Computer Speech and Language, Vol. 100, Article 101987, pp. 1-16, Apr. 2026.
T. Komatsu, H. Munakata, Y. Ishikawa, K. Takeda, T. Toda, "Semi-supervised text-audio contrastive learning method using pseudo-text input," APSIPA Transactions on Signal and Information Processing, Vol. 15, No. 1, pp. 183-198, Apr. 2026.
Y. Hashizume, T. Toda, "Investigation of perceptual music similarity based on individual instrumental parts by large-scale listening test," APSIPA Transactions on Signal and Information Processing, Vol. 15, No. 1, pp. 249-269, Apr. 2026.
J. Feng, Y. Yasuda, T. Toda, "An investigation of the robustness of flow- and diffusion-based speech generation models on noisy transcriptions," APSIPA Transactions on Signal and Information Processing, Vol. 15, No. 1, pp. 270-292, Apr. 2026.
W.-C. Huang, E. Cooper, T. Toda, "MOS-Bench: benchmarking generalization abilities of subjective speech quality assessment models," IEEE Transactions on Audio, Speech and Language Processing, Vol. 34, pp. 2385-2397, Apr. 2026.
X. Shi, X. Li, T. Toda, "Emotion similarity and shift: modeling temporal dynamic interactions for emotion prediction in conversation," IEEE Transactions on Audio, Speech and Language Processing, Vol. 34, pp. 2552-2567, Apr. 2026.

　

国際会議

T. Imamura, T. Komatsu, H. Munakata, T. Toda, "Audio-visual feature fusion for calibrating relevance scores of video moment retrieval," Proc. IEEE ICASSP, pp. 5551-5555, Barcelona, Spain, May 2026.
L.P. Violeta, X. Zhang, J. Shi, Y. Yasuda, W.-C. Huang, Z. Wu, T. Toda, "The singing voice conversion challenge 2025: from singer identity conversion to singing style conversion," Proc. IEEE ICASSP, pp. 17707-17711, Barcelona, Spain, May 2026.
J. Wang, T. Toda, "From fixed positions to free-form signals: Virtual Microphone signal estimation for general-purpose spatial audio processing," Proc. IEEE ICASSP, pp. 21011-21015, Barcelona, Spain, May 2026.
H. Munakata, T. Imamura, T. Nishimura, T. Komatsu, "CASTELLA: long audio dataset with captions and temporal boundaries," Proc. IEEE ICASSP, pp. 15352-15356, Barcelona, Spain, May 2026.

　

その他発表

S. Chen, T. Toda, "QHARMA-GAN: quasi-harmonic neural vocoder based on autoregressive moving average model," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.
D. Ma, L.P. Violeta, K. Kobayashi, T. Toda, "Pretraining and fine-tuning techniques for electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.
J. He, X. Shi, C.-H. Hu, J. Mi, X. Li, T. Toda, "M4SER: multimodal, multirepresentation, multitask, and multistrategy learning for speech emotion recognition," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.
B.M. Halpern, T.B. Tienkamp, T. Rebernik, R.J.J.H. van Son, S.A.H.J. de Visscher, M.J.H. Witjes, D. Abur, T. Toda, "XPPG-PCA: reference-free automatic speech severity evaluation with principal components," IEEE ICASSP, SPS journal paper presentation, Barcelona, Spain, May 2026.

　

他の年度はこちら