Refactor load weights #1603

grimoire · 2024-05-16T09:24:51Z

optimization tp model loading.

requirement

Optimize moe #1520

lmdeploy/pytorch/weight_loader/model_weight_loader.py

RunningLeon · 2024-06-04T09:43:24Z

@zhulinJulia24 hi, could you start a full-scope test of all pytorch engine models using daily_test CI? Thanks.

lvhan028 · 2024-06-11T06:19:58Z

lmdeploy/pytorch/models/chatglm2.py

+                                         rank=rank,
+                                         world_size=world_size,
+                                         prefix='query_key_value')
+        rowwise_parallelize_linear(self.dense,


much better than previous version

lvhan028 · 2024-06-11T06:28:09Z

lmdeploy/pytorch/weight_loader/model_weight_loader.py

+logger = get_logger('lmdeploy')
+
+
+def _get_weight_type(model_path: str, use_safetensors: bool = None):


use_safetensors can be {True, False, None}. Why not True or False?

Align with transformers

https://github.com/huggingface/transformers/blob/dcdda5324bcc7a750b5e40e11dd795442204ff27/src/transformers/modeling_utils.py#L2813

lvhan028 · 2024-06-11T06:35:31Z

lmdeploy/pytorch/weight_loader/model_weight_loader.py

+        for name, param in mod.named_parameters(recurse=False):
+            dtype = param.dtype
+            if not loader.has(name):
+                logger.debug(f'rank [{rank}]'


How to invoke this condition?

Some model might shared weight of token embedding, they do not safe redundant weight in checkpoint.

lmdeploy/pytorch/weight_loader/model_weight_loader.py

lvhan028 · 2024-06-11T06:43:29Z

lmdeploy/pytorch/models/functional.py

@@ -160,204 +157,3 @@ def sync_qparam_to_context(context: Any, layer_id: str, qparams: dict):
        context.set_output(layer_id, last_qparam)
    else:
        context.set_output(layer_id, qparams)
-
-
-@torch.no_grad()


Is it used before?

Almost never.

lmdeploy/pytorch/weight_loader/dist_utils.py

grimoire mentioned this pull request May 21, 2024

[Draft] Torch deepseek v2 #1621

Draft

2 tasks

grimoire added 3 commits May 22, 2024 16:58

first

7bac22d

all model down

77034bf

remove device mesh

d7138f1

grimoire force-pushed the refactor-load-weights branch from 53fe863 to d7138f1 Compare May 22, 2024 09:08

grimoire marked this pull request as draft May 22, 2024 09:09

fix triton==2.2.0

fd3f84f

grimoire marked this pull request as ready for review May 22, 2024 09:31

lvhan028 requested a review from RunningLeon June 4, 2024 06:50

merge main

08199b3

RunningLeon reviewed Jun 4, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/model_weight_loader.py Outdated Show resolved Hide resolved

update weight type log

235e684

lvhan028 added the improvement label Jun 5, 2024

lvhan028 reviewed Jun 11, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/model_weight_loader.py Outdated Show resolved Hide resolved

lvhan028 reviewed Jun 11, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/dist_utils.py Show resolved Hide resolved

lvhan028 reviewed Jun 11, 2024

View reviewed changes

lmdeploy/pytorch/weight_loader/dist_utils.py Outdated Show resolved Hide resolved

fix comment

de73b38

lvhan028 approved these changes Jun 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor load weights #1603

Refactor load weights #1603

grimoire commented May 16, 2024 •

edited

RunningLeon commented Jun 4, 2024

lvhan028 Jun 11, 2024

lvhan028 Jun 11, 2024

grimoire Jun 11, 2024

lvhan028 Jun 11, 2024

grimoire Jun 11, 2024

lvhan028 Jun 11, 2024

grimoire Jun 11, 2024

		logger = get_logger('lmdeploy')


		def _get_weight_type(model_path: str, use_safetensors: bool = None):

Refactor load weights #1603

Are you sure you want to change the base?

Refactor load weights #1603

Conversation

grimoire commented May 16, 2024 • edited

requirement

RunningLeon commented Jun 4, 2024

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

grimoire Jun 11, 2024

Choose a reason for hiding this comment

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

grimoire Jun 11, 2024

Choose a reason for hiding this comment

lvhan028 Jun 11, 2024

Choose a reason for hiding this comment

grimoire Jun 11, 2024

Choose a reason for hiding this comment

grimoire commented May 16, 2024 •

edited