An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
-
Updated
Feb 14, 2025 - Python
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
target speaker extraction and verification for multi-talker speech
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments"
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments"
This is a demo for our paper 'Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches'
This is a demo for our paper 'Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction'.
Add a description, image, and links to the speaker-extraction topic page so that developers can more easily learn about it.
To associate your repository with the speaker-extraction topic, visit your repo's landing page and select "manage topics."