Website for downloading MP4 clips from kick.com URLs
-
Updated
Jun 10, 2024 - Vue
Website for downloading MP4 clips from kick.com URLs
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
Transformers 3rd Edition
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
end-to-end image search app
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography
Image Captioning Using CLIP & GPT Models
An AI-powered natural language & reverse Image Search Engine powered by CLIP & qdrant.
Semantic alignment of astronomical data with natural language using multi-modal models. (Jax) Code associated with https://arxiv.org/abs/2403.08851.
Fine-tuning code for CLIP models
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
PyTorch Implementation of the CLIP Algorithm
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
Code for the paper "Towards Concept-based Interpretability of Skin Lesion Diagnosis using Vision-Language Models", ISBI 2024 (Oral).
3rd Place, Visual Prompt Tuning Challenge @ CVPR 2023 HIT Workshop (2023)
Run zero-shot prediction models on your data
Add a description, image, and links to the clip topic page so that developers can more easily learn about it.
To associate your repository with the clip topic, visit your repo's landing page and select "manage topics."