This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my ...
Tensor pre-processing on low memory hardware(?) fails due to Qwen, VAE and audio separately loaded to CPU/GPU causing dtype mismatch in preprocess_to_tensors(). Occurs on consumer 3070 ti. Temporary ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results