Model wrapper with local fine-tuning #169

zyzhang1130 · 2024-04-19T06:47:13Z

name: Pull Request
about: Create a pull request

Description

Added features to download models from hugging face model hub/load local hugging face model and fine-tune loaded model with hugging face dataset. Model loading and fine-tuning can happen both at the initialization stage and after the agent has been initialized (see README in agentscope/examples/conversation_with_agent_with_finetuned_model for details). Major changes to the repo include creating the example script conversation_with_agent_with_finetuned_model.py, adding a new model wrapper HuggingFaceWrapper in agentscope/examples/conversation_with_agent_with_finetuned_model/huggingface_model.py, and creating a new agent type Finetune_DialogAgent in 'agentscope/examples/conversation_with_agent_with_finetuned_model/finetune_dialogagent.py'. All changes are done in a new example directory agentscope/examples/conversation_with_agent_with_finetuned_model.

To test, run agentscope/examples/conversation_with_agent_with_finetuned_model/conversation_with_agent_with_finetuned_model.py by following the instructions in the README in the same directory.

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

…d local hugging face model and finetune loaded model with hugging face dataset Added features to download models from hugging face model hub/load local hugging face model and finetune loaded model with hugging face dataset. Model loading and fine-tuning can happen both at the initialization stage and after the agent has been initialized (see README in `agentscope/examples/load_finetune_huggingface_model` for details). Major changes to the repo include creating the example script `load_finetune_huggingface_model`, adding a new model wrapper `HuggingFaceWrapper`, and creating a new agent type Finetune_DialogAgent. All changes are done in a new example directory `agentscope/examples/load_finetune_huggingface_model`.

made customized hyperparameters specification available from `model_configs` for fine-tuning at initialization, or through `fine_tune_config` in `Finetune_DialogAgent`'s `fine_tune` method after initialization

ZiTao-Li

See the inline comments. Besides, you can run pre-commit run --all-files to check if your coding style satisfies our repo's requirement.

examples/load_finetune_huggingface_model/huggingface_model.py

ZiTao-Li · 2024-04-24T06:34:21Z

examples/load_finetune_huggingface_model/huggingface_model.py

+            # Decode the generated tokens to a string
+            generated_text = self.tokenizer.decode(outputs[0][input_ids.shape[1]:], skip_special_tokens=True)
+
+            return ModelResponse(text=generated_text, raw={'model_id': self.model_id})


To be consistent with the other model wrappers, the 'raw' should contain the generated_text as well. Also, the

Updated. What was the second point?

examples/load_finetune_huggingface_model/huggingface_model.py

… from local.

examples/load_finetune_huggingface_model/README.md

fixed issue related to `format` method

zyzhang1130 · 2024-04-27T14:46:04Z

Need to disable E203 for pre-commit run. It automatically reformats `examples/load_finetune_huggingface_model/huggingface_model.py' line147 by adding whitespace before ':' and then flags it as an error.

ZiTao-Li

About the flask8 code style error on github, maybe the link can can help https://flake8.pycqa.org/en/3.1.1/user/ignoring-errors.html#in-line-ignoring-errors
try to split the the agent to an separated file to make it more clean.
change the name of this folder "load_finetune_huggingface_model" to "conversation_with_agent_with_finetuned_model" to make the naming style consistent.

examples/load_finetune_huggingface_model/README.md

examples/load_finetune_huggingface_model/huggingface_model.py

ZiTao-Li

As a better practice, please create new branches (in your forked repo) for the updates on reformatting examples tasks (one branch for one example). Please do not let the changes affects each other.

… branch for it)

ZiTao-Li

The Description of this PR seems outdated, please update it accordingly.

zyzhang1130 · 2024-05-06T08:12:03Z

The Description of this PR seems outdated, please update it accordingly.

Done.

DavdGao

Please see inline comments, and:

It's not clear who is responsible to fine-tune the model, the agent or model wrapper? Since the new model configuration has all required arguments for fine-tuning, why don't we just implement the fine-tuning functionality all in the new model wrapper class (e.g. the constructor function)? So that all other agents in AgentScope can re-use the model wrapper and fine-tune their local models without modifications.

DavdGao · 2024-05-06T07:55:30Z

examples/conversation_with_agent_with_finetuned_model/huggingface_model.py

+                self.model = AutoModelForCausalLM.from_pretrained(
+                    model_id,
+                    token=self.huggingface_token,
+                    device_map="auto",


Since the model is loaded by device_map as auto, will there be any conflict between the attribute self.device (and device argument in the constructor)?

I presumed fine-tuning of LLMs happens on gpus, since it is (as far as I'm aware) infeasible to do on cpu. self.device is meant for device for inference only.

This part is updated. If the user didn't specify device in model_configs, device_map="auto" by default; otherwise device_map is set to the user-specified device. (see 8685213)

DavdGao · 2024-05-06T08:02:04Z

...conversation_with_agent_with_finetuned_model/conversation_with_agent_with_finetuned_model.py

+                "config_name": "my_custom_model",
+                # Or another generative model of your choice.
+                # Needed from loading from Hugging Face.
+                "model_id": "openlm-research/open_llama_3b_v2",


How about using the argument model_name_or_path like what transformers library does in from_pretrained function rather than providing two arguments?

Changed according to suggestion.

examples/conversation_with_agent_with_finetuned_model/README.md

DavdGao · 2024-05-06T08:06:30Z

...conversation_with_agent_with_finetuned_model/conversation_with_agent_with_finetuned_model.py

+        ],
+    )
+
+    # # alternatively can load `model_configs` from json file


suggested to remove lines 56-59

I wanted to give the user the flexibility to specify the arguments needed in a json file, but also wanted to provide some explanation which cannot be done with json file, so I have two identical model_configs, one in agentscope/examples/conversation_with_agent_with_finetuned_model/conversation_with_agent_with_finetuned_model.py and the other in agentscope/examples/conversation_with_agent_with_finetuned_model/configs/model_configs.json.

DavdGao · 2024-05-06T08:13:11Z

...conversation_with_agent_with_finetuned_model/conversation_with_agent_with_finetuned_model.py

+    )
+
+    dialog_agent.load_model(
+        model_id="openlm-research/open_llama_3b_v2",


The design is a little strange, and the interface should be more concise.

We have already specified the model configuration in agentscope.init and providemodel_config_name in line 66, why we have to repeat to set model_id in line 70 and 74?

Similar issue with dataset name in line 40 and 86.

This is an optional step where the user can choose to load another model after the agent has been instantiated if needed, depending on their use cases. Added a comment to clarify its purpose.

DavdGao · 2024-05-06T08:14:43Z

examples/conversation_with_agent_with_finetuned_model/huggingface_model.py

+        time_string = now.strftime("%Y-%m-%d_%H-%M-%S")
+
+        # Specify the filename
+        log_name_temp = model.config.name_or_path.split("/")[-1]


Maybe we should support user customized saving directory.

Added a new argument 'output_dir for the users to customize saving directory. By default will save to the save example directory if left unspecified.

zyzhang1130 · 2024-05-06T09:34:27Z

Please see inline comments, and:

1. It's not clear who is responsible to fine-tune the model, the agent or model wrapper? Since the new model configuration has all required arguments for fine-tuning, why don't we just implement the fine-tuning functionality all in the new model wrapper class (e.g. the constructor function)? So that all other agent in AgentScope can re-use the model wrapper and fine-tune their local models without modifications.

Reloading and fine-tuning models after an agent has been instantiated need to introduce a new method to the agent class. If this is not required, this model wrapper in principle supports loading and fine-tuning for other types of agents.

DavdGao · 2024-05-06T09:50:16Z

Please see inline comments, and:
1. It's not clear who is responsible to fine-tune the model, the agent or model wrapper? Since the new model configuration has all required arguments for fine-tuning, why don't we just implement the fine-tuning functionality all in the new model wrapper class (e.g. the constructor function)? So that all other agent in AgentScope can re-use the model wrapper and fine-tune their local models without modifications.
Reloading and fine-tuning models after an agent has been instantiated need to introduce a new method to the agent class. If this is not required, this model wrapper in principle supports loading and fine-tuning for other types of agents.

Yes, and my question is that why we need to reloading and fine-tuning models within the agent? In my view, once we create a huggingface model wrapper, we have already decided to fine-tune the model. So that the training can be finished automatically within the constructor function of the model wrapper as follows?

class HuggingFaceWrapper(ModelWrapperBase):
     def __init__(self, *args, **kwargs): 
         # load model
         self.model = self.load_model(xxx)

         # fine tuning
         self.fine_tune_model()

    # ...

In this way, the agent doesn't need to do anything, and developers only need to set their model configuration.

zyzhang1130 · 2024-05-06T10:18:29Z

Please see inline comments, and:
1. It's not clear who is responsible to fine-tune the model, the agent or model wrapper? Since the new model configuration has all required arguments for fine-tuning, why don't we just implement the fine-tuning functionality all in the new model wrapper class (e.g. the constructor function)? So that all other agent in AgentScope can re-use the model wrapper and fine-tune their local models without modifications.
Reloading and fine-tuning models after an agent has been instantiated need to introduce a new method to the agent class. If this is not required, this model wrapper in principle supports loading and fine-tuning for other types of agents.
Yes, and my question is that why we need to reloading and fine-tuning models within the agent? In my view, once we create a huggingface model wrapper, we have already decided to fine-tune the model. So that the training can be finished automatically within the constructor function of the model wrapper as follows?
class HuggingFaceWrapper(ModelWrapperBase):
     def __init__(self, *args, **kwargs): 
         # load model
         self.model = self.load_model(xxx)

         # fine tuning
         self.fine_tune_model()

    # ...
In this way, the agent doesn't need to do anything, and developers only need to set their model configuration.

I have a long-term use case scenario in mind, where there might be new data becoming available after the agent has been deployed (i.e., continual learning setting). In this case, the agent might need to be fine-tuned (potentially multiple times) after deployment. One such example we touched on during the discussion is fine-tuning the agents from their interaction traces in a multi-agent setup. Perhaps I can rename the agent class to better reflect this use case?

DavdGao · 2024-05-06T10:40:15Z

Please see inline comments, and:
1. It's not clear who is responsible to fine-tune the model, the agent or model wrapper? Since the new model configuration has all required arguments for fine-tuning, why don't we just implement the fine-tuning functionality all in the new model wrapper class (e.g. the constructor function)? So that all other agent in AgentScope can re-use the model wrapper and fine-tune their local models without modifications.
Reloading and fine-tuning models after an agent has been instantiated need to introduce a new method to the agent class. If this is not required, this model wrapper in principle supports loading and fine-tuning for other types of agents.
Yes, and my question is that why we need to reloading and fine-tuning models within the agent? In my view, once we create a huggingface model wrapper, we have already decided to fine-tune the model. So that the training can be finished automatically within the constructor function of the model wrapper as follows?
class HuggingFaceWrapper(ModelWrapperBase):
     def __init__(self, *args, **kwargs): 
         # load model
         self.model = self.load_model(xxx)

         # fine tuning
         self.fine_tune_model()

    # ...
In this way, the agent doesn't need to do anything, and developers only need to set their model configuration.
I have a long-term use case scenario in mind, where there might be new data becoming available after the agent has been deployed (e.g., continual learning setting). In this case, the agent might need to be fine-tuned (potentially multiple times) after deployment. Perhaps I can rename the agent class to better reflect this use case?

Okay, I understand your consideration. For me, agent.model.fine_tune() in continual learning setting is also acceptable. Whatever, please ensure the other agents can directly use this huggingface model configuration without modifying their code. Others plz see inline comments.

updated the dependencies needed

Updated the way to read token from .env file, so that it can work in any example directory.

zyzhang1130 · 2024-05-24T06:54:36Z

Further update of this pull request can be found here as it was moved to a new branch #240

zyzhang1130 added 2 commits April 19, 2024 14:45

added customized hyperparameters specification

ea00db0

made customized hyperparameters specification available from `model_configs` for fine-tuning at initialization, or through `fine_tune_config` in `Finetune_DialogAgent`'s `fine_tune` method after initialization

ZiTao-Li reviewed Apr 24, 2024

View reviewed changes

zyzhang1130 added 2 commits April 25, 2024 13:15

added docstring and made changes in accordance with the comments

3e8c468

decoupled model loading and tokenizer loading. Now can load tokenizer…

10a9870

… from local.

rayrayraykk reviewed Apr 25, 2024

View reviewed changes

examples/load_finetune_huggingface_model/README.md Outdated Show resolved Hide resolved

rayrayraykk reviewed Apr 25, 2024

View reviewed changes

examples/load_finetune_huggingface_model/README.md Outdated Show resolved Hide resolved

zyzhang1130 added 4 commits April 25, 2024 18:28

removed unnecessary info in README

5237356

resolved all issues flagged by pre-commit run

a6918eb

further removed info irrelevant to model loading and finetuning

b4f4f40

Update huggingface_model.py

e33b3de

fixed issue related to `format` method

ZiTao-Li reviewed Apr 30, 2024

View reviewed changes

zyzhang1130 added 3 commits May 2, 2024 11:49

updated according to suggestions given

8023820

added updated README

0a079b9

updated README for two examples and tested on 3 model_type.

a4d1f1b

ZiTao-Li reviewed May 6, 2024

View reviewed changes

zyzhang1130 and others added 5 commits May 6, 2024 11:09

undo update to conversation_with_mentions README (created a dedicated…

6b5410e

… branch for it)

reverted changes made to conversation_with_RAG_agents\README.md

6d10051

resolved pre-commit related issues

db27edd

resolved pre-commit related issues

b371226

resolved pre-commit related issues

7f3a012

ZiTao-Li changed the title ~~Updated pull request (all changes made are now in agentscope/examples/load_finetune_huggingface_model)~~ Model wrapper with local fine-tuning May 6, 2024

ZiTao-Li reviewed May 6, 2024

View reviewed changes

DavdGao reviewed May 6, 2024

View reviewed changes

zyzhang1130 and others added 7 commits May 8, 2024 17:04

resolve issues mentioned

15bf79a

resolve issues raised

9998e66

resolve issues raised

f6b46ed

Update README.md

6bf09f1

updated the dependencies needed

Update README.md

8d7e880

Merge branch 'modelscope:main' into main

195ac69

Update huggingface_model.py

98b471e

Updated the way to read token from .env file, so that it can work in any example directory.

zyzhang1130 deleted the branch modelscope:main May 21, 2024 09:33

zyzhang1130 closed this May 21, 2024

zyzhang1130 deleted the main branch May 21, 2024 09:33

zyzhang1130 restored the main branch May 21, 2024 09:34

zyzhang1130 mentioned this pull request May 24, 2024

Conversation with agent with finetuned model #240

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model wrapper with local fine-tuning #169

Model wrapper with local fine-tuning #169

zyzhang1130 commented Apr 19, 2024 •

edited

ZiTao-Li left a comment

ZiTao-Li Apr 24, 2024

zyzhang1130 Apr 25, 2024

zyzhang1130 commented Apr 27, 2024

ZiTao-Li left a comment

ZiTao-Li left a comment

ZiTao-Li left a comment

zyzhang1130 commented May 6, 2024

DavdGao left a comment •

edited

DavdGao May 6, 2024

zyzhang1130 May 7, 2024 •

edited

zyzhang1130 May 30, 2024 •

edited

DavdGao May 6, 2024

zyzhang1130 May 7, 2024

DavdGao May 6, 2024

zyzhang1130 May 7, 2024

DavdGao May 6, 2024

zyzhang1130 May 7, 2024

DavdGao May 6, 2024 •

edited

zyzhang1130 May 8, 2024

zyzhang1130 commented May 6, 2024

DavdGao commented May 6, 2024 •

edited

zyzhang1130 commented May 6, 2024 •

edited

DavdGao commented May 6, 2024

zyzhang1130 commented May 24, 2024

Model wrapper with local fine-tuning #169

Model wrapper with local fine-tuning #169

Conversation

zyzhang1130 commented Apr 19, 2024 • edited

name: Pull Request about: Create a pull request

Description

Checklist

ZiTao-Li left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zyzhang1130 commented Apr 27, 2024

ZiTao-Li left a comment

Choose a reason for hiding this comment

ZiTao-Li left a comment

Choose a reason for hiding this comment

ZiTao-Li left a comment

Choose a reason for hiding this comment

zyzhang1130 commented May 6, 2024

DavdGao left a comment • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zyzhang1130 May 7, 2024 • edited

Choose a reason for hiding this comment

zyzhang1130 May 30, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DavdGao May 6, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zyzhang1130 commented May 6, 2024

DavdGao commented May 6, 2024 • edited

zyzhang1130 commented May 6, 2024 • edited

DavdGao commented May 6, 2024

zyzhang1130 commented May 24, 2024

zyzhang1130 commented Apr 19, 2024 •

edited

name: Pull Request
about: Create a pull request

DavdGao left a comment •

edited

zyzhang1130 May 7, 2024 •

edited

zyzhang1130 May 30, 2024 •

edited

DavdGao May 6, 2024 •

edited

DavdGao commented May 6, 2024 •

edited

zyzhang1130 commented May 6, 2024 •

edited