[Refactor & Feature] Refactor `xtuner chat` to support `lmdeploy` &`vLLM` #317

pppppM · 2024-01-15T07:08:04Z

Motivation

可以对接推理引擎加速 xtuner chat
支持 xtuner 训练得到的模型直接部署
方便 xtuner 开发 gradio 应用
保证训练部署时对话模板一致
简化部署流程

Usage

xtuner chat 启动命令

# HF 
python xtuner/tools/new_chat.py internlm/internlm-chat-7b 

# LMDeploy (w/o adapter)
python xtuner/tools/new_chat.py internlm/internlm-chat-7b --lmdeploy

# LMDeploy (w/o adapter)
python xtuner/tools/new_chat.py internlm/internlm-chat-7b --vllm

# HF Moss
python xtuner/tools/new_chat.py meta-llama/Llama-2-7b-hf --adapter xtuner/Llama-2-7b-qlora-moss-003-sft --bot-name Llama2 --prompt-template moss_sft --system-prompt moss_sft --with-plugins calculate solve search 

# LMDeploy Moss (w/o adapter)
python xtuner/tools/new_chat.py MOSS_MERGED --bot-name Llama2 --prompt-template moss_sft --system-prompt moss_sft --with-plugins calculate solve search  --lmdeploy

# Lagent (only support HF)
python xtuner/tools/new_chat.py internlm/internlm-7b --adapter xtuner/internlm-7b-qlora-msagent-react --lagent

# Llava (only support HF)
python xtuner/tools/new_chat.py internlm/internlm-chat-7b \
  --visual-encoder openai/clip-vit-large-patch14-336 \
  --llava xtuner/llava-internlm-7b \
  --prompt-template internlm_chat \
  --image $IMAGE_PATH

ChatBot 用法


from xtuner.chat import BaseChat, CHAT_TEMPLATE
template = CHAT_TEMPLATE['internlm2-chat']

################# 使用 HF 推理 #####################
from xtuner.chat import HFBot
bot = HFBot('internlm/internlm2-chat-7b')
hf_bot = BaseChat( bot, chat_template=template)

## 对话
print(hf_bot.chat( '你是谁'))

## 流式输出
streamer = hf_bot.create_streamer()
hf_bot.chat( '你是谁',  streamer=streamer)

## 流式输出迭代器（for gradio）
streamer = hf_bot.create_streamer(iterable=True)

from threading import Thread
chat_kwargs = dict(text='你是谁', streamer=streamer)
thread = Thread(target=hf_bot.chat, kwargs=chat_kwargs)
thread.start()

for new_text in streamer:
      print(new_text, flush=True, end='')

## 清空历史
hf_bot.reset_history()

## 离线批处理
results = hf_bot.predict(['你是谁？', '你叫什么？'])


################# 使用 HF Llava 推理 #####################
from xtuner.chat import HFLlavaBot, LlavaChat
bot = HFLlavaBot(
                 'internlm/internlm2-chat-7b', 
                 'xtuner/llava-internlm2-7b',
                 'openai/clip-vit-large-patch14-336')

image1 = 'https://llava.hliu.cc/file=/nobackup/haotian/code/LLaVA_dev/llava/serve/examples/extreme_ironing.jpg'
image2 = 'https://llava.hliu.cc/file=/nobackup/haotian/code/LLaVA_dev/llava/serve/examples/waterview.jpg'
llava_bot = LlavaChat( bot, image1, chat_template=template)

## 对话
print(llava_bot.chat( 'What is unusual about this image?'))

## 流式输出
streamer = bot.create_streamer()
llava_bot.chat( 'What is unusual about this image?',  streamer=streamer)

## 流式输出迭代器（for gradio）
streamer = bot.create_streamer(iterable=True)

from threading import Thread
chat_kwargs = dict(text='What is unusual about this image?', streamer=streamer)
thread = Thread(target=llava_bot.chat, kwargs=chat_kwargs)
thread.start()

for new_text in streamer:
      print(new_text, flush=True, end='')

## 清空历史
llava_bot.reset_history()

## 替换图像
llava_bot.reset_image(img2)
print(llava_bot.chat( 'What are the things I should be cautious about when I visit here?'))

TODO

New Args

repetition-penalty
lmdeploy(LMDeploy)
dynamic-ntk(LMDeploy)
logn-attn(LMDeploy)
rope_scaling_factor(LMDeploy)
batch-size(LMDeploy)
predict, the file path that need to be predicted offline
predict-repeat

BC-Breakings

Remove torch-dtype
Remove offload-folder
Remove no-streamer(only support no-streamer)

This reverts commit 21085c4.

…anch

chynphh · 2024-03-14T03:27:51Z

@pppppM 目前可以用了吗

pppppM marked this pull request as draft January 15, 2024 07:08

LZHgrla and others added 6 commits January 16, 2024 15:15

update

da76f02

add chat bot mechanism

009480e

move moss plugins to bot dir

274231b

test new chat

fbc488e

fix lmdeploy error

3e1233f

complete predict

acb6a3d

pppppM force-pushed the new_chat branch from d726308 to acb6a3d Compare January 16, 2024 13:36

support vllm

42543fc

pppppM changed the title ~~[Refactor & Feature] Refactor xtuner chat to support lmdeploy~~ [Refactor & Feature] Refactor xtuner chat to support lmdeploy &vLLM Jan 16, 2024

pppppM and others added 20 commits January 19, 2024 01:14

delete depercated files

387bf52

refactor bot and chat

825bc6e

Merge branch 'main' into lzh/add_prompt_config

13630b0

Merge branch 'main' into lzh/add_prompt_config

e7d822d

up

1fdaddf

fix hf bot and lmdeploy bot

76a9bfd

fix lint

fdfd746

fix gen config

2b2881c

update

00fa897

add test script

f701ac6

Merge branch 'main' into lzh/add_prompt_config

edf7777

remove demo cfg

21085c4

Revert "remove demo cfg"

76c1e44

This reverts commit 21085c4.

add pretrain template

a590550

update agent format

d01938e

streaming output

e891898

Merge branch 'lzh/add_prompt_config' into new_chat

7b79e48

merge new chat template

5f01691

support llava

7037ad7

fix lmdeploy no streamer

1e969d3

StarCycle mentioned this pull request Jan 31, 2024

After finetuning llava, can I run it without xtuner? #382

Closed

skip special tokens

3143e93

pppppM mentioned this pull request Feb 27, 2024

[Feature] 有计划支持xtuner-llava系列吗 InternLM/lmdeploy#999

Closed

pppppM added 2 commits March 4, 2024 19:24

Merge branch 'main' into new_chat

fbb28dd

Roll back the code excluding chat to be consistent with the main br…

b9cc0fc

…anch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor & Feature] Refactor `xtuner chat` to support `lmdeploy` &`vLLM` #317

[Refactor & Feature] Refactor `xtuner chat` to support `lmdeploy` &`vLLM` #317

pppppM commented Jan 15, 2024 •

edited

chynphh commented Mar 14, 2024

[Refactor & Feature] Refactor xtuner chat to support lmdeploy &vLLM #317

Are you sure you want to change the base?

[Refactor & Feature] Refactor xtuner chat to support lmdeploy &vLLM #317

Conversation

pppppM commented Jan 15, 2024 • edited

Motivation

Usage

TODO

New Args

BC-Breakings

chynphh commented Mar 14, 2024

[Refactor & Feature] Refactor `xtuner chat` to support `lmdeploy` &`vLLM` #317

[Refactor & Feature] Refactor `xtuner chat` to support `lmdeploy` &`vLLM` #317

pppppM commented Jan 15, 2024 •

edited