5 papers accepted for Interspeech 2026!

The following papers have been accepted for Interspeech 2026. Congrats to all!

  • YODAS v3: Over 1 Million Hours of High-Bandwidth, Stereophonic, Multilingual Speech
  • ELSA: Acoustic Event-Level Semantic Alignment for Fine-Grained Reference-Free Text-to-Audio Evaluation
  • What Makes Us Hate Our Own Voice? Large-scale experiments on Playback–Imagery Gaps and Individual–Speech Feature Effects
  • On the Effect of Segmentation Width and Cluster Size on Speech Resynthesis and Continuation in Generative Spoken Language Models
  • Do speech foundation models perceive speaker similarity as humans do?

References

2026

  1. YODAS v3: Over 1 Million Hours of High-Bandwidth, Stereophonic, Multilingual Speech
    William Chen ,  Shinnosuke Takamichi ,  Sayaka Shiota ,  Satoru Fukayama ,  Samuele Cornell ,  and  Shinji Watanabe
    In Proceedings of Interspeech , Sep 2026
  2. ELSA: Acoustic Event-Level Semantic Alignment for Fine-Grained Reference-Free Text-to-Audio Evaluation
    Shuntaro Suzuki ,  Kento Tokura ,  Daichi Yashima ,  Kanon Amemiya ,  Komei Sugiura ,  and  Shinnosuke Takamichi
    In Proceedings of Interspeech , Sep 2026
  3. What Makes Us Hate Our Own Voice? Large-scale experiments on Playback–Imagery Gaps and Individual–Speech Feature Effects
    Koki Fukuda ,  and  Shinnosuke Takamichi
    In Proceedings of Interspeech , Sep 2026
  4. On the Effect of Segmentation Width and Cluster Size on Speech Resynthesis and Continuation in Generative Spoken Language Models
    Shunsuke Kando ,  Wataru Nakata ,  Shinnosuke Takamichi ,  and  Yusuke Miyao
    In Proceedings of Interspeech , Sep 2026
  5. Do speech foundation models perceive speaker similarity as humans do?
    Minoru Kishi ,  Hayato Yagi ,  Shinnosuke Takamichi ,  and  Yuki Saito
    In Proceedings of Interspeech , Sep 2026