5 papers accepted for Interspeech 2026!

June 03, 2026

2026

The following papers have been accepted for Interspeech 2026. Congrats to all!

YODAS v3: Over 1 Million Hours of High-Bandwidth, Stereophonic, Multilingual Speech
- (Chen et al., 2026)
ELSA: Acoustic Event-Level Semantic Alignment for Fine-Grained Reference-Free Text-to-Audio Evaluation
- (Suzuki et al., 2026)
What Makes Us Hate Our Own Voice? Large-scale experiments on Playback–Imagery Gaps and Individual–Speech Feature Effects
- (Fukuda & Takamichi, 2026)
On the Effect of Segmentation Width and Cluster Size on Speech Resynthesis and Continuation in Generative Spoken Language Models
- (Kando et al., 2026)
Do speech foundation models perceive speaker similarity as humans do?
- (Kishi et al., 2026)

References

2026

YODAS v3: Over 1 Million Hours of High-Bandwidth, Stereophonic, Multilingual Speech

William Chen , Shinnosuke Takamichi , Sayaka Shiota , Satoru Fukayama , Samuele Cornell , and Shinji Watanabe

In Proceedings of Interspeech , Sep 2026

@inproceedings{chen26interspeech_yodas-v3,
  abbr_publisher = {Proceedings of Interspeech},
  booktitle = {Proceedings of Interspeech},
  title = {YODAS v3: Over 1 Million Hours of High-Bandwidth, Stereophonic, Multilingual Speech},
  author = {Chen, William and Takamichi, Shinnosuke and Shiota, Sayaka and Fukayama, Satoru and Cornell, Samuele and Watanabe, Shinji},
  year = {2026},
  month = sep
}

ELSA: Acoustic Event-Level Semantic Alignment for Fine-Grained Reference-Free Text-to-Audio Evaluation

Shuntaro Suzuki , Kento Tokura , Daichi Yashima , Kanon Amemiya , Komei Sugiura , and Shinnosuke Takamichi

In Proceedings of Interspeech , Sep 2026

@inproceedings{suzuki26interspeech_elsa,
  abbr_publisher = {Proceedings of Interspeech},
  booktitle = {Proceedings of Interspeech},
  title = {ELSA: Acoustic Event-Level Semantic Alignment for Fine-Grained Reference-Free Text-to-Audio Evaluation},
  author = {Suzuki, Shuntaro and Tokura, Kento and Yashima, Daichi and Amemiya, Kanon and Sugiura, Komei and Takamichi, Shinnosuke},
  year = {2026},
  month = sep
}

What Makes Us Hate Our Own Voice? Large-scale experiments on Playback–Imagery Gaps and Individual–Speech Feature Effects

Koki Fukuda , and Shinnosuke Takamichi

In Proceedings of Interspeech , Sep 2026

@inproceedings{fukuda26interspeech_own-voice,
  abbr_publisher = {Proceedings of Interspeech},
  booktitle = {Proceedings of Interspeech},
  title = {What Makes Us Hate Our Own Voice? Large-scale experiments on Playback--Imagery Gaps and Individual--Speech Feature Effects},
  author = {Fukuda, Koki and Takamichi, Shinnosuke},
  year = {2026},
  month = sep
}

On the Effect of Segmentation Width and Cluster Size on Speech Resynthesis and Continuation in Generative Spoken Language Models

Shunsuke Kando , Wataru Nakata , Shinnosuke Takamichi , and Yusuke Miyao

In Proceedings of Interspeech , Sep 2026

@inproceedings{kando26interspeech_speech-resynthesis,
  abbr_publisher = {Proceedings of Interspeech},
  booktitle = {Proceedings of Interspeech},
  title = {On the Effect of Segmentation Width and Cluster Size on Speech Resynthesis and Continuation in Generative Spoken Language Models},
  author = {Kando, Shunsuke and Nakata, Wataru and Takamichi, Shinnosuke and Miyao, Yusuke},
  year = {2026},
  month = sep
}

Do speech foundation models perceive speaker similarity as humans do?

Minoru Kishi , Hayato Yagi , Shinnosuke Takamichi , and Yuki Saito

In Proceedings of Interspeech , Sep 2026

@inproceedings{kishi26interspeech_speaker-similarity,
  abbr_publisher = {Proceedings of Interspeech},
  booktitle = {Proceedings of Interspeech},
  title = {Do speech foundation models perceive speaker similarity as humans do?},
  author = {Kishi, Minoru and Yagi, Hayato and Takamichi, Shinnosuke and Saito, Yuki},
  year = {2026},
  month = sep
}