Publications
Publications / 対外発表論文
2025
-
Blended English as an International Language Learning Program Utilizing Text-to-Speech Technology: A Pilot StudyeLearn, Oct 2025
-
-
-
Drum-to-Vocal Percussion Sound Conversion and Its Evaluation MethodologyIn Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) , Oct 2025
-
Constructing an In-the-Wild Spoken Dialogue Dataset Based on YouTube Dialogue VideosIn Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) , Oct 2025
-
Active Learning for Text-to-Speech Synthesis with Informative Sample CollectionIn Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) , Oct 2025
-
VitaEval: Open-source Human Evaluation Tool for Video-to-Text and Video-to-Audio SystemsIn International Conference on Natural Language Generation , Mar 2025
-
Real-Time Drum-to-Vocal Percussion Sound Conversion SystemIn International Society for Music Information Retrieval Late‑Breaking/Demo Session , Sep 2025
-
Analysing the Language of Neural Audio CodecsIn IEEE Automatic Speech Recogiton and Understanding Workshop (ASRU) , Dec 2025
-
Learning Marmoset Vocal Patterns with a Masked Autoencoder for Robust Call Segmentation, Classification, and Caller IdentificationIn IEEE Automatic Speech Recogiton and Understanding Workshop (ASRU) , Dec 2025
-
Do statistical patterns in neural audio codec tokens by synthesized speech reveal structure beyond speech quality?In Joint Meeting Acoustical Society of America and Acoustical Society of Japan , Dec 2025
-
Analysis of a Dataset for Evaluating Semantic Relevance Between Text and AudioIn Joint Meeting Acoustical Society of America and Acoustical Society of Japan , Dec 2025
-
Developing learners’ communicative competence in English as an international language using non-native English variations with AI speech synthesis technologyIn EUROCALL , Aug 2025
-
-
-
-
-
-
-
-
-
2024
2023
-
Speaking Practice Using Text-to-speech Technology: Japanese EFL Learners’ PerceptionsIn WorldCALL , Nov 2023
-
ChatGPT-EDSS: ChatGPT由来のContext Word Embeddingから学習される共感的対話音声合成モデルIn 音学シンポジウム , Jun 2023
-
Effects of text-to-speech synthesized speech on learners’ presentation anxiety and self-efficacy: A comparison of two modelsIn Proc. EUROCALL , Aug 2023
-
TimToShape: Supporting Practice of Musical Instruments by Visualizing Timbre with 2D Shapes based on Crossmodal CorrespondencesIn Proc. IUI , Mar 2023
-