MelGAN Multi GPU Implementation.
-
Updated
Nov 19, 2019 - Python
MelGAN Multi GPU Implementation.
SE-MelGAN - Speaker Agnostic Rapid Speech Enhancement
MelGAN with catalyst framework
MelGAN implementation with Multi-Band and Full Band supports...
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
🐸💬 Coqui TTS Double Decoder Consistency samples
Ultrafast GAN based Vocoder for Text to Speech
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
Catalan Text to Speech
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
A neural network (GAN) trained to apply metal screaming effects, turning vocals from songs, speeches or whispers into realistic screams and growls.
Add a description, image, and links to the melgan topic page so that developers can more easily learn about it.
To associate your repository with the melgan topic, visit your repo's landing page and select "manage topics."