A tab for sd-webui for replacing objects in pictures or videos using detection prompt
-
Updated
May 29, 2024 - Python
A tab for sd-webui for replacing objects in pictures or videos using detection prompt
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
OpenMMLab Detection Toolbox and Benchmark
An intuitive Python tool for annotating images with bounding boxes. Easily assign custom classes to objects and save annotations. Includes AI model integration for automated annotation. Perfect for streamlining computer vision projects. classes to these objects, and save annotations.
Images to inference with no labeling (use foundation models to train supervised models).
Pranav's code contribution during his internship at ARL-W. Includes gaze-based object detection GUI and screen tracking script.
Grounding DINO module for use with Autodistill.
A project to combine Grounding-DINO with Meta AI's Segment Anything Model (SAM) and Stable Diffusion for image manipulation using prompts. The plan is to integrate these techniques and deploy the model on Hugging Face with a Gradio interface for users to detect, segment regions and inpaint them in images.
GroundedSAM Base Model plugin for Autodistill
This is the front-end webpage for the AI image cutout generator, the BE for this repo can be found here: https://github.com/OriginalByteMe/AI_Image_cutout_maker
AI Image Cutout Maker is a project that uses artificial intelligence to automatically create cutouts from images. This project is designed to simplify the process of creating cutouts, which can be a time-consuming task if done manually. This project utilizes the power of Segment Anything and Grounding Dino AI models to detect subjects in an image
A minimalistic webapp to perform zero shot object detection based on textual prompts using GroundingDINO
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
SegMate: A Segmentation Toolkit
This project explores the intersection of NLP and CV, showcasing the potential of leveraging three powerful models – SAM, Stable Diffusion, and Grounding DINO – to edit manipulate images through textual commands.
Automatic data labeling tool, with flexible search patterns, using the power of GroundingDINO (detect anything with language).
Explore the cutting edge of computer vision with this comprehensive repository, showcasing a spectrum from classical machine learning to state-of-the-art transformer models.
Add a description, image, and links to the grounding-dino topic page so that developers can more easily learn about it.
To associate your repository with the grounding-dino topic, visit your repo's landing page and select "manage topics."