2025年度に発表された文献の一覧
学術論文誌
- S. Luan, Y. Wakabayashi, T. Toda, "Generalized sound field interpolation for freely spaced microphone arrays in rotation-robust beamforming," Applied Acoustics, Vol. 236, Article 110706, pp. 1-15. Apr. 2025.
- M. Eshghi, T. Toda, "Predicting fundamental frequency patterns in electrolaryngeal speech using automated phoneme extraction," IEEE Access, Vol. 13, pp. 73831-73847, Apr. 2025.
- Y. Ohtani, T. Okamoto, T. Toda, H. Kawai, "Fast neural vocoder with fundamental frequency control using finite impulse response filters," IEEE Transactions on Audio, Speech and Language Processing, Vol. 33, pp. 1893-1906, Apr. 2025.
- D. Ma, Y. Choi, T. Fujimura, F. Li, C. Xie, K. Kobayashi, T. Toda, "Sequence-to-sequence voice conversion-based techniques for electrolaryngeal speech enhancement in noisy and reverberant conditions," APSIPA Transactions on Signal and Information Processing, Vol. 14, No. 1, e8, pp. 1-40, May 2025.
- C. Xie, T. Toda, "An investigation of noisy-to-noisy voice conversion performance in various noisy conditions," APSIPA Transactions on Signal and Information Processing, Vol. 14, No. 1, e10, pp. 1-30, June 2025.
- T. Fujimura, T. Toda, "Analysis and extension of noisy-target training for unsupervised target signal enhancement," APSIPA Transactions on Signal and Information Processing, Vol. 14, No. 1, e12, pp. 1-27, June 2025.
- I. Kuroyanagi, T. Fujimura, K. Takeda, T. Toda, "Improving anomalous sound detection through pseudo-anomalous set selection and pseudo-label utilization under unlabeled conditions," APSIPA Transactions on Signal and Information Processing, Vol. 14, No. 1, e13, pp. 1-28, June 2025.
- J. He, T. Toda, "PMF-CEC: phoneme-augmented multimodal fusion for context-aware ASR error correction with error-specific selective decoding," IEEE Transactions on Audio, Speech and Language Processing, Vol. 33, pp. 2402-2417, June 2025.
- Y. Choi, C. Xie, T. Toda, "Noise and reverberation-controllable voice conversion," IEEE Transactions on Audio, Speech and Language Processing, Vol. 33, pp. 2430-2443, June 2025.
国際会議
- Y. Hashizume, T. Toda, "Investigation of perceptual music similarity focusing on each instrumental part," Proc. IEEE ICASSP, 5 pages, Hyderabad, India, Apr. 2025.
- T. Fujimura, I. Kuroyanagi, T. Toda, "Improvements of discriminative feature space training for anomalous sound detection in unlabeled conditions," Proc. IEEE ICASSP, 5 pages, Hyderabad, India, Apr. 2025.
- K. Nishizawa, R. Yamamoto, W.-C. Huang, T. Toda, "Investigating factors related to the naturalness of synthesized unison singing," Proc. IEEE ICASSP, 5 pages, Hyderabad, India, Apr. 2025.
- T. Ogura, T. Okamoto, Y. Ohtani, E. Cooper, T. Toda, H. Kawai, "Mora-level prosody prediction for text-to-speech using Japanese BERT without accentual labels," Proc. IEEE ICASSP, 5 pages, Hyderabad, India, Apr. 2025.
招待講演
- 米山 怜於, "ニューラルボコーダ概説:生成モデルと実用性の観点から," 音学シンポジウム, 招待講演, 東京, June 2025.
- 戸田 智基, "音声研究の知見がニューラルボコーダの発展にもたらす効果," 音学シンポジウム, 招待講演, 東京, June 2025.
研究会
- 藤村 拓弥, "ICASSP2025における異常音検知の動向," 信学技報, Vol. 125, No. 36, EA2025-1, pp. 1-6, May 2025.
応用音響研究会, オーガナイズドセッション, May 2025.
- 橋爪 優果, "ICASSP2025における音楽情報処理の動向," 信学技報, Vol. 125, No. 36, EA2025-3, pp. 13-17, May 2025.
- 米山 怜於, "ニューラルボコーダ概説:生成モデルと実用性の観点から," 情報処理研報, Vol. 2025-SLP-156, No. 3, 1 page, June 2025.
- 戸田 智基, "音声研究の知見がニューラルボコーダの発展にもたらす効果," 情報処理研報, Vol. 2025-SLP-156, No. 4, 1 page, June 2025.
- 宮司 光梨, 澤田 桂都, ホワン ウェンチン, 戸田 智基, "制御性の高いピアノ自動編曲に向けた楽曲難易度指標の設計," 情報処理研報, Vol. 2025-MUS-143, No. 8, pp. 1-7, June 2025.
- 山下 陽生, 岡本 拓磨, 高島 遼一, 大谷 大和, 滝口 哲也, 戸田 智基, 河井 恒, "重み付きAttentionのアライメント機構を用いた系列変換型声質変換," 情報処理研報, Vol. 2025-SLP-143, No. 75, pp. 1-6, June 2025.【音学シンポジウム2025優秀発表賞(受賞者:山下 陽生)】
- 服部 公宏, ホワン ウェンチン, 武田 一哉, 戸田 智基, "多様なシミュレーション音場における教師あり仮想マイクアレイ信号推定の汎化性能評価," 信学技報, Vol. 125, No. 74, SP2025-20, pp. 107-112, June 2025.
- W.-C. Huang, L.P. Violeta, T. Toda, "JATTS: a comparison-oriented Japanese text-to-speech open-sourced toolkit," 信学技報, Vol. 125, No. 74, SP2025-22, pp. 119-124, June 2025.
その他発表
- 西尾 直樹, 小林 和弘, 戸田 智基, 横井 紗矢香, 向山 宣昭, 和田 明久, 横井 麻衣, 重山 真由, 三谷 壮平, 曾根 三千彦, "電気のコエから自分のコエへ -Save the Voice Project-," 日本気管食道科学会会報, 特集5 パネルディスカッション1:喉頭摘出後のコミュニケーション支援, Vol. 76, No. 2, p. 108, Apr. 2025.
- W.-C. Huang, "Automatic quality assessment for speech and beyond," Talk, Conversational AI Reading Group, Mila/Concordia University, May 2025.