2018年度に発表された文献の一覧
学術論文誌
- K. Kobayashi, T. Toda, S. Nakamura, "Intra-gender statistical singing voice conversion with direct waveform modification using log-spectral differential," Speech Communication, Vol. 99, pp. 211-220, May 2018.
- S. Seki, T. Toda, K. Takeda, "Stereophonic music separation based on non-negative tensor factorization with cepstral distance regularization," IEICE Transactions on Fundamentals, Vol. E101-A, No. 7, pp. 1057-1064, July 2018.
- T. Kano, S. Takamichi, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "An end-to-end model for cross-lingual transformation of paralinguistic information," Machine Translation, Vol. 32, No. 4, pp. 353-368, Dec. 2018.
- A. Tamamori, T. Hayashi, T. Toda, K. Takeda, "Daily activity recognition based on recurrent neural network using multi-modal signals," APSIPA Transactions on Signal and Information Processing, Vol. 7, e21, pp. 1-11, Dec. 2018.
国際会議
- T. Okamoto, K. Tachibana, T. Toda, Y. Shiga, H. Kawai, "An investigation of subband WaveNet vocoder covering entire audible frequency range with limited acoustic features," Proc. IEEE ICASSP, pp. 5654-5658, Calgary, Canada, Apr. 2018.
- K. Tachibana, T. Toda, Y. Shiga, H. Kawai, "An investigation of noise shaping with perceptual weighting for WaveNet-based speech generation," Proc. IEEE ICASSP, pp. 5664-5668, Calgary, Canada, Apr. 2018.
- K. Tanaka, H. Kameoka, K. Morikawa, "VAE-SPACE: deep generative model of voice fundamental frequency contours," Proc. IEEE ICASSP, pp. 5779-5783, Calgary, Canada, Apr. 2018.
- S. Seiya, R. Ito, K. Okamoto, U. Tanikawa, S. Ohira, D. Deguchi, T. Toda, "Development of "KamiRepo" system with automatic student identification to handle handwritten assignments on LMS," Proc. IEEE EDUCON, pp. 841-848, Santa Cruz de Tenerife, Spain, Apr. 2018.
- T. Kinnunen, J. Lorenzo-Trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio, Z. Ling, "A spoofing benchmark for the 2018 voice conversion challenge: leveraging from spoofing countermeasures for speech artifact assessment," Proc. Odyssey 2018, pp. 187-194, Les Sables d'Olonne, France, June 2018.
- J. Lorenzo-Trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio, T. Kinnunen, Z. Ling, "The voice conversion challenge 2018: promoting development of parallel and nonparallel methods," Proc. Odyssey 2018, pp. 195-202, Les Sables d'Olonne, France, June 2018.
- K. Kobayashi, T. Toda, "sprocket: open-source voice conversion software," Proc. Odyssey 2018, pp. 203-210, Les Sables d'Olonne, France, June 2018.
- Y.-C. Wu, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, "The NU non-parallel voice conversion system for the voice conversion challenge 2018," Proc. Odyssey 2018, pp. 211-218, Les Sables d'Olonne, France, June 2018.
- P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda, "NU voice conversion system for the voice conversion challenge 2018," Proc. Odyssey 2018, pp. 219-226, Les Sables d'Olonne, France, June 2018.
- T. Hayashi, S. Watanabe, T. Toda, K. Takeda, "Multi-Head Decoder for end-to-end speech recognition," Proc. INTERSPEECH, pp. 801-805, Hyderabad, India, Sep. 2018.
- Y.-C. Wu, K. Kobayashi, T. Hayashi, P.L. Tobing, T. Toda, "Collapsed segment detection and reduction for WaveNet vocoder," Proc. INTERSPEECH, pp. 1988-1992, Hyderabad, India, Sep. 2018.
- H. Kawahara, K. Sakakibara, M. Morise, H. Banno, T. Toda, T. Irino, "Frequency domain variants of velvet noise and their application to speech processing and synthesis," Proc. INTERSPEECH, pp. 2027-2031, Hyderabad, India, Sep. 2018.
- S. Tamura, K. Horio, H. Endo, S. Hayamizu, T. Toda, "Audio-visual voice conversion using deep canonical correlation analysis for deep bottleneck features," Proc. INTERSPEECH, pp. 2469-2473, Hyderabad, India, Sep. 2018.
- F. Ahmadi, T. Toda, "Designing a pneumatic bionic voice prosthesis - statistical approach for source excitation generation," Proc. INTERSPEECH, pp. 3142-3146, Hyderabad, India, Sep. 2018.
- K. Miyazaki, T. Hayashi, T. Toda, K. Takeda, "Connectionist temporal classification-based sound event encoder for converting sound events into onomatopoeia representations," Proc. EUSIPCO, pp. 857-861, Rome, Italy, Sep. 2018.
- K. Kobayashi, T. Toda, "Electrolarygeal speech enhancement with statistical voice conversion based on CLDNN," Proc. EUSIPCO, pp. 2129-2133, Rome, Italy, Sep. 2018.
- T. Hayashi, T. Komatsu, R. Kondo, T. Toda, K. Takeda, "Anomalous sound event detection based on WaveNet," Proc. EUSIPCO, pp. 2508-2512, Rome, Italy, Sep. 2018.
- M. Takada, S. Seki, T. Toda, "Self-produced speech enhancement and suppression method using air- and body-conductive microphones," Proc. APSIPA ASC, pp. 1240-1245, Hawaii, USA, 2018.
- P.L. Tobing, T. Hayashi, Y.-C. Wu, K. Kobayashi, T. Toda, "An evaluation of deep spectral mappings and WaveNet vocoder for voice conversion," Proc. IEEE SLT, pp. 297-303, Athens, Greece, Dec. 2018.
- T. Okamoto, T. Toda, Y. Shiga, H. Kawai, "Improving FFTNet vocoder with noise shaping and subband approaches," Proc. IEEE SLT, pp. 304-311, Athens, Greece, Dec. 2018.
- T. Hayashi, S. Watanabe, Y. Zhang, T. Toda, T. Hori, R. Astudillo, K. Takeda, "Back-translation-style data augmentation for end-to-end ASR," Proc. IEEE SLT, pp. 426-433, Athens, Greece, Dec. 2018.
著書・解説
- 中村 哲, Sakriani Sakti, Graham Neubig, 戸田 智基, 高道 慎之介, "音声言語の自動翻訳 -コンピュータによる自動翻訳を目指して-," 日本音響学会(編), コロナ社, June 2018.
- 高道 慎之介, 戸田 智基, "音声翻訳システムにおける音声変換の利用," 日本音響学会誌, Vol. 74, No. 9, pp. 535-538, Sep. 2018.
- K. Vijayan, H. Li, T. Toda, "Speech-to-singing voice conversion: the challenges and strategies for improving vocal conversion processes," IEEE Signal Processing Magazine, Vol. 36, No. 1, pp. 95-102, Jan. 2019.
- K. Miyazaki, T. Toda, T. Hayashi, K. Takeda, "Environmental sound processing and its applications," IEEJ Transactions on Electronics, Information and Systems, Vol. 14, No. 3, pp. 340-351, Mar. 2019.
講習会
- T. Toda, "Advanced Voice Conversion," Speech Processing Courses in Crete (SPCC), University of Crete, Heraklion, Greece, July 2018.
- T. Toda, "Hands on Voice Conversion," Speech Processing Courses in Crete (SPCC), University of Crete, Heraklion, Greece, July 2018.
- 戸田 智基, "音声分析・合成," 音声認識・音声対話技術講習会, 高度言語情報融合フォーラム(ALAGIN)技術開発部会 音声処理分科会, 京都大学, Aug. 2018.
招待講演
- 戸田 智基, "音声変換による発声機能の拡張," 東京大学ヒューマンオーグメンテーション学第4回セミナー, Nov. 2018.
- T. Toda, "Augmented vocal production towards new singing style development," Dagstuhl Seminar, Stimulus Talk at Seminar 19052: computational methods for melody and voice processing in music recordings, Wadern, Germany, Jan. 2019.
研究会
- 田村 哲嗣, 堀尾 健斗, 遠藤 肇, 速水 悟, 戸田 智基, "深層ボトルネック特徴と深層正準相関分析を用いたマルチモーダル声質変換," 信学技報, Vol. 118, No. 112, SP2018-4, pp. 13-18, June 2018.
- 高田 萌絵, 関 翔悟, 戸田 智基, "ウェアラブルな空気/体内伝導マイクロフォンを用いた自己発声音強調/抑圧法," 信学技報, Vol. 118, No. 190, EA2018-29, pp. 7-12, Aug. 2018.
- 内野 達貴, 橋詰 淳, 勝野 雅央, 戸田 智基, "嚥下障害診断における嚥下音からの咽頭残留判定," 信学技報, Vol. 118, No. 198, SP2018-27, pp. 23-27, Aug. 2018.
- 宮崎 晃一, 林 知樹, 戸田 智基, 武田 一哉, "End-to-Endアプローチに基づく音イベントの擬音語表現への記号化," 信学技報, Vol. 118, No. 198, SP2018-30, pp. 37-42, Aug. 2018.
- 山岸 順一, 安田 裕介, Y. Zhao, T. Warnita, F. Fang, Y. Peng, 田中 智宏, B. Zhuang, Y.-C. Wu, 須田 仁志, H.-T. Luong, P.L. Tobing, 高島 悠樹, "SLP研究会の新たな試み:国際会議既発表セッション," 情報処理研報, Vol. 2019-SLP-126, No. 7, pp. 1-6, Feb. 2019.
- 栗田 優佑, 小林 和弘, 武田 一哉, 戸田 智基, "波形加工に基づく統計的声質変換の外部雑音に対する頑健性," 信学技報, Vol. 118, No. 497, SP2018-115, pp. 317-322, Mar. 2019.
- 関 翔悟, 亀岡 弘和, 李 莉, 戸田 智基, 武田 一哉, "多チャンネル変分自己符号化器に基づく劣決定音源分離の評価," 信学技報, Vol. 118, No. 497, SP2018-116, pp. 323-328, Mar. 2019.
大会講演
- 高田 萌絵, 関 翔悟, 戸田 智基, "空気/体内伝導マイクロフォンを用いた雑音環境下における自己発声音強調/抑圧法," 音講論, 3-1-13, pp. 225-226, Sep. 2018.
- 関 翔悟, 林 知樹, 武田 一哉, 戸田 智基, "WaveNetに基づく振幅スペクトログラムからの波形生成," 音講論, 1-P-14, pp. 281-282, Sep. 2018.
- 林 知樹, 渡部 晋治, 戸田 智基, 武田 一哉, "End-to-End音声認識ためのMulti-Head Decoderネットワーク," 音講論, 1-2-9, pp. 925-926, Sep. 2018.
- M. Eshghi, S. Seki, K. Kobayashi, T. Toda, "Electrolaryngeal Speech Enhancement by Using Attached Microphones onto Electrolarynx," 音講論, 1-R-26, pp. 1023-1024, Sep. 2018.
- 岡本 拓磨, 戸田 智基, 志賀 芳則, 河井 恒, "FFTNetボコーダの高品質化に関する検討," 音講論, 1-R-39, pp. 1179-1182, Sep. 2018.
- 内野 達貴, 橋詰 淳, 勝野 雅央, 戸田 智基, "嚥下音を利用した嚥下障害診断のための咽頭残留推定法," 音講論, 2-5-4, pp. 1307-1308, Sep. 2018.
- 田村 哲嗣, 堀尾 健斗, 遠藤 肇, 速水 悟, 戸田 智基, "深層ボトルネック特徴と深層正準相関分析を用いたマルチモーダル声質変換," サイレント音声認識ワークショップ, No. 6, Sep. 2018.
- 山田 智也, 関 翔悟, 小林 和弘, 戸田 智基, "楽曲中歌声加工における声質変換精度向上のための歌声・伴奏分離法," 信号処理シンポジウム, B6-3, pp. 258-263, Nov. 2018.
- 出口 大輔, 清谷 竣也, 大平 茂輝, 戸田 智基, "手書きレポートとLMSの連携を実現する名大版紙レポシステムの全学運用," 大学ICT推進協議会 2018年度年次大会, MP-24, 3 pages, Nov. 2018.【大学ICT推進協議会2018年度年次大会 優秀ポスター賞】
- 関 翔悟, 亀岡 弘和, 李 莉, 戸田 智基, 武田 一哉, "多チャンネル変分自己符号化器を用いた劣決定音源分離," 音講論, 1-6-20, pp. 229-230, Mar. 2019.
- 岡田 慎太郎, 安藤 厚志, 戸田 智基, "音素事後確率を利用した表現学習に基づく発話感情認識," 音講論, 2-9-7, pp. 881-882, Mar. 2019.【第19回日本音響学会 学生優秀発表賞(受賞者:岡田 慎太郎)】
- 栗田 優佑, 小林 和弘, 武田 一哉, 戸田 智基, "雑音環境下における統計的声質変換の頑健性に関する調査," 音講論, 1-10-2, pp. 1017-1018, Mar. 2019.
- 岡本 拓磨, 戸田 智基, 志賀 芳則, 河井 恒, "基本周波数とメルケプストラムを用いたリアルタイムニューラルボコーダの検討," 音講論, 3-10-3, pp. 1057-1060, Mar. 2019.
- W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang, "Reducing mismatch of WaveNet vocoder for variational autoencoder based voice conversion," 音講論, 3-5-14, pp. 1317-1318, Mar. 2019.
- P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda, "Voice conversion with cyclic recurrent neural network for WaveNet fine-tuning," 音講論, 3-5-15, pp. 1319-1320, Mar. 2019.
- 安原 和輝, "End-to-End 型テキスト音声合成におけるWaveNetボコーダの学習に関する調査," 平成30年度電子情報通信学会東海支部卒業研究発表会, Po-077, 1 page, Mar. 2019.
その他発表
- Y.-C. Wu, K. Kobayashi, T. Hayashi, P.L. Tobing, T. Toda, "Collapsed speech segment detection and suppression for WaveNet vocoder," Google's 3rd Speech Technology Summit, London, UK, May 2018.
- P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda, "Development of NU voice conversion system 2018," Google's 3rd Speech Technology Summit, London, UK, May 2018.
- 内野 達貴, "嚥下音を利用した嚥下障害診断のための咽頭残留推定法," 第22回東海地区音声関連研究室修士論文中間発表会, 愛知, Aug. 2018.
- 山田 智也, "複数の歌声分離法に基づく楽曲中の歌声加工システムの提案," 第22回東海地区音声関連研究室修士論文中間発表会, 愛知, Aug. 2018.
- 戸田 智基, "発声者の協力的動作を活用した音声生成機能の拡張技術," JSTフェア2018, 東京, Aug. 2018.
- Z.-H. Ling, J. Yamagishi, J. Lorenzo-Trueba, T. Toda, D. Saito, F. Villavicencio, T. Kinnunen, "Voice Conversion Challenge 2018," ISCSLP, Demonstration session, D-4, Taipei, Taiwan, Nov. 2018.
修士論文
- 山田 智也, "楽曲中の歌声加工のための楽音信号分離・変換技術" 情報学研究科知能システム学専攻修士論文, Feb. 2019.
卒業論文
- 大竹 徹郎, "楽曲音源分離における各種音源抽出ネットワークの統合法" 平成30年度情報工学コース卒業研究報告, Feb. 2019.
- 多賀 遥香, "ユーザの協力的動作を活用したリアルタイム声質変換" 平成30年度情報工学コース卒業研究報告, Feb. 2019.
- 安原 和輝, "End-to-End 型テキスト音声合成におけるWaveNetボコーダの学習に関する調査," 平成30年度情報工学コース卒業研究報告, Feb. 2019.