Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
-
Updated
May 3, 2024 - Python
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney
Diffusion model papers, survey, and taxonomy
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Text To Video Synthesis Colab
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
CLIP + FFT/DWT/RGB = text to image/video
Finetune ModelScope's Text To Video model using Diffusers 🧨
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
[Arxiv] A Survey on Video Diffusion Models
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Add a description, image, and links to the text-to-video topic page so that developers can more easily learn about it.
To associate your repository with the text-to-video topic, visit your repo's landing page and select "manage topics."