#

deepspeed

Here are 58 public repositories matching this topic...

damomineraleo / BittensorGUI

[T] ~ Nova Wallet ~ GUI wallet for windows on the bittensor network polkadot you can use this to store your TAO under a polkadot address [T]

ai neural-network tensorflow machine pytorch machinelearning neural-machine-translation deepspeed bittensor aicreative computerlearning

Updated Jan 13, 2023

YooSungHyun / Transformer-OCR

Transformer OCR by Torch Lightning

ocr deep-learning opticalcharacterrecognition deepspeed vision-ai torch-lightning transformer-ocr

Updated Feb 17, 2023
Python

DONGRYEOLLEE1 / LLM-FT

Efficient Fine-tuning for LLM

deepspeed llm-training

Updated Jun 5, 2024
Python

thevasudevgupta / quick

quick is the simple trainer built on the top of pytorch & deepspeed for making my deep learning model training more smoother & faster.

pytorch deepspeed

Updated May 9, 2021
Python

siahuat0727 / bert-benchmark

A framework for benchmarking various DNN inference engine.

pytorch tensorrt onnxruntime deepspeed nnfusion

Updated Nov 8, 2021
Python

cdw / deepspeed_in_aml

Create an environment within AzureML that supports Deepspeed training, execute some example training processes thereon.

azureml deepspeed

Updated Nov 8, 2021
Jupyter Notebook

surdarla / image_playground

cifar, imagenet

pytorch alexnet fishnet deepspeed

Updated Jul 4, 2022
Python

janelu9 / EasyLLM

Running Large Language Model easily.

pipeline llama zero deepspeed llm

Updated May 31, 2024
Python

jistiak / finetune-gpt-deepspeed

Sample codes and guidelines on how to finetune any opensource GPT models using #deepspeed and #huggingface

gpt hf finetuning deepspeed llm

Updated Mar 31, 2023

ztjhz / miniLM

Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project

nlp deep-learning neural-network llama ntu bert roberta gpt2 wandb deepspeed llm sc4001

Updated Nov 9, 2023
Python

abhilash1910 / Framework-Optimization

Framework, Model & Kernel Optimizations for Distributed Deep Learning - Data Hack Summit

pytorch triton codegen inductor ddp deepspeed fsdp tensorparallel pipelineparallel

Updated Aug 1, 2023
Python

bobo0810 / MiniGPT-4-DeepSpeed

MiniGPT-4基于DeepSpeed加速➕ 扩充模型规模 ➕ 实验分析

deepspeed llm minigpt4

Updated Oct 11, 2023

Tommy-s-Online-Courses / DeepSpeed

DeepSpeed系列课程资料

Updated Apr 15, 2024
Jupyter Notebook

SulRash / minLLMTrain

Minimal yet high performant code for pretraining llms. Attempts to implement some SOTA features. Implements training through: Deepspeed, Megatron-LM, and FSDP. WIP

huggingface pretraining deepspeed megatron-lm llm fsdp

Updated Feb 6, 2024
Python

YanSte / NLP-LLM-Fine-tuning-DeepSpeed

Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and Trainer with DeepSpeed

nlp fine-tuning deepspeed llm

Updated Jan 5, 2024
Jupyter Notebook

Reason-Wang / InstructLLM

The official implementation of paper "Demystifying Instruction Mixing for Fine-tuning Large Language Models"

nlp transformers fine-tuning deepspeed llm instruction-tuning llama2

Updated Jan 5, 2024
Python

dyedd / deepspeed-diffusers

🚀 使用 Deepspeed 训练 Diffusers | Training Diffusers with Deepspeed 这是第一个完全使用 Deepspeed 的开源扩散模型数据并行/Zero 框架。 The first diffusion model data parallel/Zero framework in open source that uses Deepspeed exclusively.

model diffusion deepspeed diffusers

Updated Jun 6, 2024
Python

AndreSoble / PerformerDualEncoder

Train a Performer Dual Encoder to get Language Agnostic Sentence Embeddings like LABSE

encoder pytorch dual performer deepspeed dualencoder

Updated Jan 19, 2021
Python

taishan1994 / chinese_llm_pretrained

使用自己的tokenizer继续预训练大语言模型。

lora language-model zero pretrained deepspeed llm gpt2-chinese

Updated Jun 25, 2023
Python

DominikLindorfer / pyAlpaca

Instruction-following LLaMA Model Trained with Deepspeed to Output Python-Code from General Instructions

pytorch llama gpt alpaca finetuning gpt-3 deepspeed gpt4 llm stanford-alpaca pyalpaca

Updated May 5, 2023
Python

Improve this page

Add a description, image, and links to the deepspeed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeed topic, visit your repo's landing page and select "manage topics."