Final intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact
Speaker: Benjamin StahlHost: Hannes Gamper In this talk, we explore advancements in computational models for speech quality assessment. Self-supervised learning models have emerged as powerful front-ends, outperforming supervised-only models. However, their large size renders them…
TransVIP
Speech to Speech Translation System with Voice and Isochrony Preservation We introduce a novel model framework TransVIP that leverages diverse datasets in a cascade fashion yet facilitates end-to-end inference through joint probability. Furthermore, we propose…
MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges
Speaker(s): Eloi MolinerHost: Hannes Gamper Speech reverberation control involves the manipulation of acoustic characteristics in speech recordings, including tasks like speech dereverberation or reverberation time reduction. Diffusion implicit bridges are a recently proposed domain translation…