Six Presentations at ICASSP 2026
We will present the following papers at ICASSP 2026.
- SPATIAL-CLAP: LEARNING SPATIALLY-AWARE AUDIO–TEXT EMBEDDINGS FOR MULTI-SOURCE CONDITIONS
- TTSOPS: A CLOSED-LOOP CORPUS OPTIMIZATION FRAMEWORK FOR TRAINING MULTI-SPEAKER TTS MODELS FROM DARK DATA
- XACLE Challenge 2026: The first x-to-audio alignment challenge
- MANGAVOX: DATASET OF ACTED VOICES ALIGNED WITH MANGA IMAGES TOWARDS COMPUTER UNDERSTANDING OF AUDIO COMICS
- SS-JDSC: SINGLE-SPEAKER JAPANESE DYSARTHRIC SPEECH CORPUS
- THREE-STAGE BSRNN FOR UNIVERSAL SPEECH ENHANCEMENT AND DATA CURATION USING A LARGE PRE-TRAINED SPEECH RESTORATION MODEL
References
2026
- SPATIAL-CLAP: LEARNING SPATIALLY-AWARE AUDIO–TEXT EMBEDDINGS FOR MULTI-SOURCE CONDITIONSIn Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2026
- TTSOPS: A CLOSED-LOOP CORPUS OPTIMIZATION FRAMEWORK FOR TRAINING MULTI-SPEAKER TTS MODELS FROM DARK DATAIn Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2026
- XACLE Challenge 2026: The first x-to-audio alignment challengeIn Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2026
- MANGAVOX: DATASET OF ACTED VOICES ALIGNED WITH MANGA IMAGES TOWARDS COMPUTER UNDERSTANDING OF AUDIO COMICSIn Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2026
- SS-JDSC: SINGLE-SPEAKER JAPANESE DYSARTHRIC SPEECH CORPUSIn Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2026
- THREE-STAGE BSRNN FOR UNIVERSAL SPEECH ENHANCEMENT AND DATA CURATION USING A LARGE PRE-TRAINED SPEECH RESTORATION MODELIn Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2026