#

vision-transformer

Here are 758 public repositories matching this topic...

Apsurt / omni-geo-ai

Omni Geoguessr AI: A Vision Transformer AI integrated with Geoguessr for automated geographic location prediction and gameplay using streetview panoramas.

python machine-learning ai computer-vision deep-learning geolocation streetview image-recognition geography google-maps-api location-prediction geoguessr vision-transformer automated-gameplay city-recognition climate-classification elevation-detection

Updated Jun 8, 2024
Python

uncbiag / Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

transformer-models vision-transformer multimodal-models foundation-models large-language-models

Updated Jun 8, 2024

transfomers-silicon-research

aliemo / transfomers-silicon-research

Research and Materials on Hardware implementation of Transformer Model

natural-language-processing fpga accelerator transformer gpu-acceleration pretrained-models hardware-designs research-paper bert fpga-accelerator systolic-arrays processing-in-memory vision-transformer video-transformer

Updated Jun 8, 2024
Jupyter Notebook

Blaizzy / mlx-vlm

MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.

mlx vision-framework apple-silicon vision-transformer llm vision-language-model llava local-ai idefics paligemma

Updated Jun 8, 2024
Python

tue-mps / benchmark-vfm-ss

benchmark semantic-segmentation vfm vision-transformer foundation-model vision-foundation-model

Updated Jun 8, 2024
Python

PardisTaghavi / SwinMTL

computer-vision deep-learning gan semantic-segmentation depth-estimation multi-task-learning vision-transformer

Updated Jun 8, 2024
Jupyter Notebook

dusty-nv / NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated Jun 7, 2024
Python

aitlas

biasvariancelabs / aitlas

AiTLAS implements state-of-the-art AI methods for exploratory and predictive analysis of satellite images.

machine-learning deep-neural-networks computer-vision deep-learning geospatial models pytorch artificial-intelligence sentinel dataset remote-sensing classification image-classification segmentation object-detection satellite-data earth-observation satellite-images vision-transformer

Updated Jun 7, 2024
Python

sun-wendy / cv-project

Final project for 6.8301 - Computer Vision (spring 2024)

computer-vision resnet final-project melanoma-detection vision-transformer

Updated Jun 7, 2024
Jupyter Notebook

open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark

Updated Jun 7, 2024
Python

InternLM / InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

foundation gpt language-model multimodal multi-modality vision-transformer gpt-4 visual-language-learning llm chatgpt instruction-tuning large-language-model supervised-finetuning mllm vision-language-model large-vision-language-model

Updated Jun 7, 2024
Python

JacobHanimann / scDINO

Self-Supervised Vision Transformers for multiplexed imaging datasets

unsupervised morphology phenotyping multi-channel single-cell fluorescence-microscopy-imaging high-content-screening multi-head-attention self-supervised-learning greyscale-image vision-transformer

Updated Jun 7, 2024
Python

Denis2054 / Transformers-for-NLP-and-Computer-Vision-3rd-Edition

Transformers 3rd Edition

Updated Jun 7, 2024
Jupyter Notebook

RishabhMathur06 / Splitting-Videos-Into-Parts-Using-ViT-and-CNN

python computer-vision tensorflow numpy keras cnn vision-transformer

Updated Jun 7, 2024
Jupyter Notebook

Adibvafa / RadioCareBorealisAI

RadioCare: Fighting Inefficiencies in Medical Imaging

machine-learning healthcare radiology mimic-cxr ai-in-healthcare vision-transformer radiology-report-generation

Updated Jun 7, 2024
Jupyter Notebook

tintn / vision-transformer-from-scratch

A Simplified PyTorch Implementation of Vision Transformer (ViT)

computer-vision transformers image-classification vision-transformer

Updated Jun 6, 2024
Jupyter Notebook

horrible-dong / QTClassification

A lightweight and extensible toolbox for image classification

lightweight template machine-learning deep-learning amp configs pytorch toolbox extensible imagenet resnet cifar10 cifar100 image-classfication vision-transformer

Updated Jun 7, 2024
Python

danikiyasseh / SAIS

[Nature Biomedical Engineering 2023] Decoding surgical activity from videos with a vision transformer

artificial-intelligence surgery vision-transformer decoding-surgery nature-biomedical-engineering automated-skill-assessment automated-gesture-recognition

Updated Jun 6, 2024
Python

computer-vision-challenge

afondiel / computer-vision-challenge

This is a series of computer vision foundational projects for anyone diving into the field must tackle.

computer-vision image-processing image-classification image-generation image-detection computer-vision-algorithms computer-vision-tools computer-vision-opencv computer-vision-datasets vision-models vision-transformer computer-vision-python computer-vision-projects computer-vision-hello-world cv-challenge computer-vision-challenge

Updated Jun 6, 2024
Jupyter Notebook

uzh-rpg / ssms_event_cameras

Official PyTorch implementation of the CVPR 2024 paper: State Space Models for Event Cameras (Spotlight).

Updated Jun 6, 2024
Python

Improve this page

Add a description, image, and links to the vision-transformer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-transformer topic, visit your repo's landing page and select "manage topics."