deepspeed

Here are 57 public repositories matching this topic...

OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

reinforcement-learning raylib transformers deepspeed large-language-models reinforcement-learning-from-human-feedback vllm

Updated Jun 2, 2024
Python

Coobiw / MiniGPT4Qwen

Star

Personal Project: MPP-Qwen14B(Multimodal Pipeline Parallel-Qwen14B). Don't let the poverty limit your imagination! Train your own 14B LLaVA-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen

Updated Jun 1, 2024
Jupyter Notebook

InternLM / lmdeploy

Star

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

llama cuda-kernels deepspeed llm fastertransformer llm-inference turbomind internlm llama2 codellama llama3

Updated May 31, 2024
Python

janelu9 / EasyLLM

Star

Running Large Language Model easily.

pipeline llama zero deepspeed llm

Updated May 31, 2024
Python

l294265421 / my-llm

Star

All about large language models

distributed-training deepspeed large-language-models chatgpt

Updated May 29, 2024

OpenMOSS / CoLLiE

Star

Collaborative Training of Large Language Models in an Efficient Way

nlp deep-learning pytorch deepspeed

Updated May 28, 2024
Python

intelligent-machine-learning / glake

Star

GLake: optimizing GPU memory management and IO transmission.

memory gpu pytorch onnx deepspeed llm

Updated May 24, 2024
C++

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

transformer ray deepspeed llama-cpp vllm llm-inference

Updated May 17, 2024
Python

glb400 / Toy-RecLM

Star

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

recommender-system sasrec deepspeed large-language-models llama2 actions-speak-louder-than-words

Updated Apr 25, 2024
Python

zjunlp / KnowLM

Star

An Open-sourced Knowledgable Large Language Model Framework.

Updated Apr 21, 2024
Python

PKU-Alignment / safe-rlhf

Star

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Updated Apr 20, 2024
Python

Tommy-s-Online-Courses / DeepSpeed

Star

DeepSpeed系列课程资料

deepspeed

Updated Apr 15, 2024
Jupyter Notebook

dyedd / deepspeed-diffusers

Star

使用deepspeed训练diffusers（training diffusers with deepspeed）

model diffusion deepspeed diffusers

Updated Apr 2, 2024
Python

XplainMind / LLMindCraft

Star

Shaping Language Models with Cognitive Insights

docker transformers pretraining deepspeed large-language-models reinforcement-learning-from-human-feedback instruct-tuning

Updated Feb 29, 2024
Python

SulRash / minLLMTrain

Star

Minimal yet high performant code for pretraining llms. Attempts to implement some SOTA features. Implements training through: Deepspeed, Megatron-LM, and FSDP. WIP

huggingface pretraining deepspeed megatron-lm llm fsdp