Here are
58 public repositories
matching this topic...
[T] ~ Nova Wallet ~ GUI wallet for windows on the bittensor network polkadot you can use this to store your TAO under a polkadot address [T]
Transformer OCR by Torch Lightning
Updated
Feb 17, 2023
Python
Efficient Fine-tuning for LLM
Updated
Jun 5, 2024
Python
quick is the simple trainer built on the top of pytorch & deepspeed for making my deep learning model training more smoother & faster.
Updated
May 9, 2021
Python
A framework for benchmarking various DNN inference engine.
Updated
Nov 8, 2021
Python
Create an environment within AzureML that supports Deepspeed training, execute some example training processes thereon.
Updated
Nov 8, 2021
Jupyter Notebook
Updated
Jul 4, 2022
Python
Running Large Language Model easily.
Updated
May 31, 2024
Python
Sample codes and guidelines on how to finetune any opensource GPT models using #deepspeed and #huggingface
Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project
Updated
Nov 9, 2023
Python
Framework, Model & Kernel Optimizations for Distributed Deep Learning - Data Hack Summit
Updated
Aug 1, 2023
Python
MiniGPT-4基于DeepSpeed加速➕ 扩充模型规模 ➕ 实验分析
Updated
Apr 15, 2024
Jupyter Notebook
Minimal yet high performant code for pretraining llms. Attempts to implement some SOTA features. Implements training through: Deepspeed, Megatron-LM, and FSDP. WIP
Updated
Feb 6, 2024
Python
Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and Trainer with DeepSpeed
Updated
Jan 5, 2024
Jupyter Notebook
The official implementation of paper "Demystifying Instruction Mixing for Fine-tuning Large Language Models"
Updated
Jan 5, 2024
Python
🚀 使用 Deepspeed 训练 Diffusers | Training Diffusers with Deepspeed 这是第一个完全使用 Deepspeed 的开源扩散模型数据并行/Zero 框架。 The first diffusion model data parallel/Zero framework in open source that uses Deepspeed exclusively.
Updated
Jun 6, 2024
Python
Train a Performer Dual Encoder to get Language Agnostic Sentence Embeddings like LABSE
Updated
Jan 19, 2021
Python
使用自己的tokenizer继续预训练大语言模型。
Updated
Jun 25, 2023
Python
Instruction-following LLaMA Model Trained with Deepspeed to Output Python-Code from General Instructions
Updated
May 5, 2023
Python
Improve this page
Add a description, image, and links to the
deepspeed
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
deepspeed
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.