HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
-
Updated
Jul 23, 2023 - Python
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Vietnamese Text to Speech library
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
TTS models for Arabic (Tacotron2, FastPitch)
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
This is the experimental description of MnTTS2.
Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"
포스코 청년 AI·Big Data 아카데미 - AI 프로젝트
TTS (FastPitch) for German
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...
Add a description, image, and links to the hifi-gan topic page so that developers can more easily learn about it.
To associate your repository with the hifi-gan topic, visit your repo's landing page and select "manage topics."