Skip to content
@dvlab-research

DV Lab

Deep Vision Lab

Pinned

  1. LISA LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1.5k 99

  2. LongLoRA LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2.5k 241

  3. MGM MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 2.9k 263

  4. LLaMA-VID LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 579 37

  5. Video-P2P Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 332 22

  6. LLMGA LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 252 17

Repositories

Showing 10 of 63 repositories
  • MR-GSM8K Public

    Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

    Python 30 0 2 0 Updated Apr 25, 2024
  • MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 2,928 Apache-2.0 263 37 2 Updated Apr 22, 2024
  • LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1,465 Apache-2.0 99 47 1 Updated Apr 8, 2024
  • GroupContrast Public

    [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

    31 MIT 1 1 0 Updated Mar 15, 2024
  • Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 332 22 5 0 Updated Mar 12, 2024
  • Parametric-Contrastive-Learning Public

    Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

    Python 222 MIT 29 5 0 Updated Feb 29, 2024
  • LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2,451 Apache-2.0 241 39 1 Updated Feb 11, 2024
  • Prompt-Highlighter Public

    [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

    Python 101 MIT 2 2 0 Updated Jan 25, 2024
  • LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 252 Apache-2.0 17 3 0 Updated Jan 22, 2024
  • LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 579 Apache-2.0 37 23 0 Updated Jan 10, 2024

Top languages

Loading…

Most used topics

Loading…