deepspeed

Personal Project: MPP-Qwen14B(Multimodal Pipeline Parallel-Qwen14B). Don't let the poverty limit your imagination! Train your own 14B LLaVA-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen

Updated May 18, 2024
Jupyter Notebook

stanleylsx / llms_tool

Star

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

bloom pytorch falcon llama moss mistral aquila baichuan deepspeed chatglm chatglm2 internlm llama2 qwen xverse baichuan2 aquila2 chatglm3

Updated Dec 8, 2023
Python

git-cloner / llama2-lora-fine-tuning

Star

llama2 finetuning with deepspeed and lora

lora finetuning deepspeed llama2

Updated Jul 28, 2023
Python

HomebrewNLP / revlib

Star

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

deep-learning pytorch tpu revnet xla deepspeed momentumnet

Updated Aug 6, 2022
Python

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

pytorch llama gpt lora finetune ppo peft deepspeed llm chatgpt rlhf reward-models chatglm chatglm-6b

Updated Apr 28, 2023
Python

CoinCheung / gdGPT

Star

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

nlp bloom pipeline pytorch deepspeed llm full-finetune model-parallization flash-attention llama2 baichuan2-7b chatglm3-6b mixtral-8x7b

Updated Feb 5, 2024
Python

saforem2 / l2hmc-qcd

Sponsor

Star

Application of the L2HMC algorithm to simulations in lattice QCD.

machine-learning deep-learning tensorflow monte-carlo pytorch hydra lattice mcmc hmc hamiltonian-monte-carlo lattice-qcd horovod gauge-theory deepspeed

Updated Feb 2, 2024
Jupyter Notebook

jackaduma / Alpaca-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

pytorch llama gpt lora alpaca finetune ppo peft deepspeed llm chatgpt rlhf reward-models

Updated Apr 28, 2023
Python

bobo0810 / LearnDeepSpeed

Star

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

examples deepspeed large-language-models

Updated Sep 7, 2023
Python

xyjigsaw / LLM-Pretrain-SFT

Star

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

llama lora mistral deepspeed large-language-models baichuan2

Updated Jan 30, 2024
Python

Improve this page

Add a description, image, and links to the deepspeed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeed topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepspeed

Here are 56 public repositories matching this topic...

InternLM / lmdeploy

OpenLLMAI / OpenRLHF

PKU-Alignment / safe-rlhf

zjunlp / KnowLM

alibaba / Megatron-LLaMA

Xirider / finetune-gpt2xl

shm007g / LLaMA-Cult-and-More

OpenMOSS / CoLLiE

intelligent-machine-learning / glake

sunzeyeah / RLHF

Coobiw / MiniGPT4Qwen

stanleylsx / llms_tool

git-cloner / llama2-lora-fine-tuning

HomebrewNLP / revlib

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

CoinCheung / gdGPT

saforem2 / l2hmc-qcd

jackaduma / Alpaca-LoRA-RLHF-PyTorch

bobo0810 / LearnDeepSpeed

xyjigsaw / LLM-Pretrain-SFT

Improve this page

Add this topic to your repo