Catalan Text to Speech
-
Updated
Dec 12, 2022 - Python
Catalan Text to Speech
MelGAN with catalyst framework
SE-MelGAN - Speaker Agnostic Rapid Speech Enhancement
A neural network (GAN) trained to apply metal screaming effects, turning vocals from songs, speeches or whispers into realistic screams and growls.
Unofficial implementation of Multi-band MelGAN
MelGAN Multi GPU Implementation.
🐸💬 Coqui TTS Double Decoder Consistency samples
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
Ultrafast GAN based Vocoder for Text to Speech
MelGAN implementation with Multi-Band and Full Band supports...
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Add a description, image, and links to the melgan topic page so that developers can more easily learn about it.
To associate your repository with the melgan topic, visit your repo's landing page and select "manage topics."