melgan

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

japanese pytorch tts english chinese russian korean tibetan mandarin tacotron dctts fastspeech melgan

Updated Mar 25, 2023
Python

v-iashin / SpecVQGAN

Star

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

audio video pytorch transformer gan multi-modal evaluation-metrics video-understanding vas video-features vqvae bmvc melgan audio-generation vggsound

Updated Jun 6, 2023
Jupyter Notebook

zeroone-universe / RealTimeBWE

Star

Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"

pytorch-implementation pytorch-lightning melgan bandwidth-extension

Updated Oct 25, 2023
Python

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Nov 9, 2023
Jupyter Notebook

TensorSpeech / TensorFlowTTS

Star

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Updated Nov 27, 2023
Python

gaetano-signorelli / ScreamNet

Star

A neural network (GAN) trained to apply metal screaming effects, turning vocals from songs, speeches or whispers into realistic screams and growls.

music tensorflow generative-adversarial-network audio-processing scream melgan metalcore

Updated Nov 30, 2023
Python

Improve this page

Add a description, image, and links to the melgan topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the melgan topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

melgan

Here are 23 public repositories matching this topic...

diver-j / melgan-multi

systemcorp-ai / systemcorp-ai.github.io

elephantmipt / MelGAN

ZirumAndBigBro / MelGan-WavGan

rishikksh20 / melgan

Mixergi / MelGAN

himajin2045 / voice-conversion

xcmyz / FastVocoder

shun60s / Mel-GAN-clone

erogol / ddc-samples

rishikksh20 / iSTFT-Avocodo-pytorch

rishikksh20 / VocGAN

ga642381 / FastSpeech2

mehdihosseinimoghadam / Catalan-Text-to-Speech

atomicoo / FCH-TTS

v-iashin / SpecVQGAN

zeroone-universe / RealTimeBWE

mozilla / TTS

TensorSpeech / TensorFlowTTS

gaetano-signorelli / ScreamNet

Improve this page

Add this topic to your repo