🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
-
Updated
May 31, 2024 - HTML
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
A general purpose ComfyUI workflow for common use cases.
Github action workflows to generate awesome wallpapers with HuggingFace Inference API (serverless)
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
An AI image generation frontend focused on ease of use, versatility and capability for professional uses, packaged as a click-and-run executable.
A Text to Image generator website with the support of hugging face and stability ai. the project is under development
PyTorch implementation of PerCo (Towards Image Compression with Perfect Realism at Ultra-Low Bitrates, ICLR 2024)
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
Text-To-Image AI project in php and using @openai API
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
A curated list of Generative AI tools, works, models, and references
All code and data necessary to replicate experiments in the paper BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models.
[IJCAI 2024, Official Code] for paper "AK4Prompts: Aesthetics-driven Automatically Keywords-Ranking for Prompts in Text-To-Image Models". Official Weights and Demos provided. 首个利用IAA做大模型提示词筛选的工作.
Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney
All code related to the Try Before you Bias (TBYB) tool based on the paper: Quantifying Bias in Text-to-Image Generative Models. You can access a hosted TBYB web-service (and comprae evaluations to other users) via: https://huggingface.co/spaces/JVice/try-before-you-bias
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
High Quality Image Generation Model - Comes Under NGC Models @prithivmlmods
Stable Diffusion and LLMs offline on your own hardware
A collection of awesome text-to-image generation studies.
Add a description, image, and links to the text-to-image topic page so that developers can more easily learn about it.
To associate your repository with the text-to-image topic, visit your repo's landing page and select "manage topics."