リアルタイム音能力拡張

スモールデータ機械学習に基づくリアルタイム音コミュニケーション能力拡張システム

Title / タイトル

スモールデータ機械学習に基づくリアルタイム音コミュニケーション能力拡張システム(2023-2025, 立石科学技術振興財団研究助成(S) 共同研究者)

Reference / 発表文献

(Saito et al., 2024)
(Seki et al., 2024)
(佑樹齋藤 et al., 2024)
(Ishikawa et al., 2024)
(悠人石川 et al., 2024)

References

2024

コーパス

SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark

Yuki Saito , Takuto Igarashi , Kentaro Seki , Shinnosuke Takamichi , Ryuichi Yamamoto , Kentaro Tachibana , and Hiroshi Saruwatari

In Proceedings of Interspeech , Mar 2024

arXiv Bib Website

@inproceedings{saito24interspeech_src4vc,
  abbr_publisher = {Proceedings of Interspeech},
  booktitle = {Proceedings of Interspeech},
  title = {SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark},
  author = {Saito, Yuki and Igarashi, Takuto and Seki, Kentaro and Takamichi, Shinnosuke and Yamamoto, Ryuichi and Tachibana, Kentaro and Saruwatari, Hiroshi},
  year = {2024},
  memo = {This research was conducted as joint research between LY Corporation and Saruwatari-Takamichi Laboratory of The University of Tokyo, Japan. This work was supported by Research Grant S of the Tateishi Science and Technology Foundation.}
}

音声変換

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

Kentaro Seki , Shinnosuke Takamichi , Norihiro Takamune , Yuki Saito , Kanami Imamura , and Hiroshi Saruwatari

In Proceedings of Interspeech , Mar 2024

arXiv Bib Code

@inproceedings{seki24interspeech_spatial-voice-conversion,
  abbr_publisher = {Proceedings of Interspeech},
  booktitle = {Proceedings of Interspeech},
  title = {Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals},
  author = {Seki, Kentaro and Takamichi, Shinnosuke and Takamune, Norihiro and Saito, Yuki and Imamura, Kanami and Saruwatari, Hiroshi},
  year = {2024},
  memo = {This work is supported by Research Grant S of the Tateishi Science and Technology Foundation.}
}

コーパス

SRC4VCデータセット：多話者音声変換モデルのベンチマークを目的とした実デバイス収録音声コーパス

齋藤佑樹 , 五十嵐琢斗 , 関健太郎 , 高道慎之介 , 山本龍一 , 橘健太郎 , and 猿渡洋

In 電子情報通信学会音声研究会 , Mar 2024

Bib

@inproceedings{saito24sp_real-environment-vc,
  abbr_publisher = {電子情報通信学会 音声研究会},
  booktitle = {電子情報通信学会 音声研究会},
  title = {{SRC4VC}データセット：多話者音声変換モデルのベンチマークを目的とした実デバイス収録音声コーパス},
  author = {佑樹, 齋藤 and 琢斗, 五十嵐 and 健太郎, 関 and 慎之介, 高道 and 龍一, 山本 and 健太郎, 橘 and 洋, 猿渡},
  year = {2024},
  memo = {This research was conducted as joint research between LY Corporation and Saruwatari-Takamichi Laboratory of The University of Tokyo, Japan. This work was supported by Research Grant S of the Tateishi Science and Technology Foundation.}
}

Real-Time Noise Estimation for Lombard-Effect Speech Synthesis in Human–Avatar Dialogue Systems

Yuto Ishikawa , Osamu Take , Tomohiko Nakamura , Norihiro Takamune , Yuki Saito , Shinnosuke Takamichi , and Hiroshi Saruwatari

In Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) , Mar 2024

Bib PDF

@inproceedings{ishikawa24apsipa_lombard,
  abbr_publisher = {Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)},
  booktitle = {Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)},
  title = {Real-Time Noise Estimation for Lombard-Effect Speech Synthesis in Human–Avatar Dialogue Systems},
  author = {Ishikawa, Yuto and Take, Osamu and Nakamura, Tomohiko and Takamune, Norihiro and Saito, Yuki and Takamichi, Shinnosuke and Saruwatari, Hiroshi},
  year = {2024}
}

人間とアバターとの対話システムにおける拡散性雑音下リアルタイム推定雑音を用いたLombard効果模擬音声合成のための検討

石川悠人 , 武伯寒 , 中村友彦 , 高宗典玄 , 齋藤佑樹 , 高道慎之介 , and 猿渡洋

In 日本音響学会秋季研究発表会 , Mar 2024

Bib PDF

@inproceedings{ishikawa24asja_lombard,
  abbr_publisher = {日本音響学会秋季研究発表会},
  booktitle = {日本音響学会秋季研究発表会},
  title = {人間とアバターとの対話システムにおける拡散性雑音下リアルタイム推定雑音を用いたLombard効果模擬音声合成のための検討},
  author = {悠人, 石川 and 伯寒, 武 and 友彦, 中村 and 典玄, 高宗 and 佑樹, 齋藤 and 慎之介, 高道 and 洋, 猿渡},
  year = {2024}
}