Add lora+ implentation #1509

moghadas76 · 2024-02-26T15:55:40Z

Implementing LoRA+ https://arxiv.org/abs/2402.12354

BenjaminBossan · 2024-02-26T16:39:48Z

Duplicate of #1504 :)

Sorry about closing (wrong button).

HuggingFaceDocBuilderDev · 2024-02-26T16:44:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

moghadas76 · 2024-02-26T16:45:25Z

What was the conclusion in that issue?

BenjaminBossan · 2024-02-26T17:10:22Z

No conclusion yet, we want to wait and see if the performance gains are indeed robust. Regarding your code, it's basically just a giant string with the code, right? Was that the intent?

moghadas76 · 2024-02-26T17:12:58Z

waiting for you to ask implement new Trainer object or not?

BenjaminBossan · 2024-03-06T10:58:16Z

Hey, after some discussion, I think we can proceed with this project. Let's add the create_loraplus_optimizer function but not the custom trainer class. We can put the function inside of peft/helpers.py.

Some considerations:

Add a reference to the original repo
Update the docs
Remove the logger code
If you feel up for the task, let's add some unit tests.

BenjaminBossan · 2024-03-12T12:41:36Z

@moghadas76 do you still plan on working on this?

moghadas76 · 2024-03-12T12:42:53Z

Yes, This weekend I'll fix the points

…

On Tue, Mar 12, 2024, 1:41 PM Benjamin Bossan ***@***.***> wrote: @moghadas76 <https://github.com/moghadas76> do you still plan on working on this? — Reply to this email directly, view it on GitHub <#1509 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFRH3KKIYBNPIB4W5RLD5PTYX3ZZPAVCNFSM6AAAAABD2OVEOGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOJRGU3DGMZQHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

BenjaminBossan · 2024-03-12T12:54:51Z

Great, thanks. On top of what I mentioned, let's also move this to a new file. I'm thinking src/peft/optimizers/loraplus.py. The idea here is that we want to add more optimizer-related methods in the future, so it makes sense to choose a proper file structure right away.

moghadas76 · 2024-03-17T16:22:03Z

Please review my code

BenjaminBossan

Thanks for working on this. It is a good start but there are a few issues, please check my comments. On top of that, could you please move the function out of helpers.py into a separate module, as I mentioned above?

Great, thanks. On top of what I mentioned, let's also move this to a new file. I'm thinking src/peft/optimizers/loraplus.py. The idea here is that we want to add more optimizer-related methods in the future, so it makes sense to choose a proper file structure right away.

Moreover, it would be great to document this function in our PEFT docs, but it would be fine to do that in a follow-up PR.

Finally, please run make style on your changes.

src/peft/tuners/lora/config.py

src/peft/helpers.py

tests/test_loraplus_helper.py

src/peft/helpers.py

BenjaminBossan · 2024-03-25T11:48:02Z

@moghadas76 Do you still plan on working on this?

moghadas76 · 2024-03-25T11:51:30Z

Yes, I'll fix the comments tonight

…

On Mon, Mar 25, 2024, 12:48 PM Benjamin Bossan ***@***.***> wrote: @moghadas76 <https://github.com/moghadas76> Do you still plan on working on this? — Reply to this email directly, view it on GitHub <#1509 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFRH3KIOTROBBMBYUMYCHE3Y2AFIPAVCNFSM6AAAAABD2OVEOGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMJXHAZDIOBVGE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

BenjaminBossan · 2024-03-25T11:56:40Z

Yes, I'll fix the comments tonight

Thanks. No need to rush, I just wanted to inquire if you're still on it :)

BenjaminBossan · 2024-04-02T12:44:51Z

@moghadas76 LMK once you're finished with your changes and want me to do another review.

BenjaminBossan · 2024-04-19T09:53:12Z

Gentle ping @moghadas76

moghadas76 · 2024-04-19T20:01:41Z

Hi
I fixed the comments
Please review again

moghadas76 · 2024-04-19T20:02:00Z

@BenjaminBossan

BenjaminBossan · 2024-04-25T10:07:50Z

Sorry for the delay, I was at a conference, will review soon.

BenjaminBossan

Thanks for making the adjustments, this already looks quite good. I still found a few minor areas for improvements, which I commented. Also, as mentioned in my earlier comment, could you please move the code to a different file?

src/peft/utils/peft_types.py

src/peft/helpers.py

BenjaminBossan · 2024-04-26T12:18:01Z

tests/test_loraplus_helper.py

+    }
+    optim = create_loraplus_optimizer(model=model, optimizer_cls=optimizer_cls, optimizer_kwargs=optim_config)
+    assert optim is not None
+    assert len(optim.param_groups) == 4


Can we do some more checks here?

done...another test added

would it make sense to also check the specific learning rates we get for the different param groups?

moghadas76 · 2024-05-17T00:24:46Z

@BenjaminBossan
May I ask final check? Best regards

BenjaminBossan · 2024-05-17T15:46:01Z

At the risk of repeating myself the 5th time, could you please make the following change:

Great, thanks. On top of what I mentioned, let's also move this to a new file. I'm thinking src/peft/optimizers/loraplus.py. The idea here is that we want to add more optimizer-related methods in the future, so it makes sense to choose a proper file structure right away.

moghadas76 · 2024-05-18T09:19:09Z

Sorry... I missed that comment. I've fixed it

BenjaminBossan · 2024-05-21T13:40:14Z

Hmm, code quality checks are still failing with:

tests/test_loraplus_helper.py:1:1: I001 [*] Import block is un-sorted or un-formatted

Is it possible that your local ruff version differs? CI uses v0.2.2.

moghadas76 · 2024-05-21T20:44:55Z

You were right. My ruff version was old.

BenjaminBossan

Thanks for the updates. Our code style check still fails though, not sure what the reason is if you use the same ruff version. Here is the diff that I get when running ruff locally on your branch:

modified   src/peft/optimizers/__init__.py
@@ -17,4 +17,4 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 
-from .loraplus import create_loraplus_optimizer
\ No newline at end of file
+from .loraplus import create_loraplus_optimizer
modified   src/peft/optimizers/loraplus.py
@@ -8,20 +8,24 @@ from transformers.trainer_pt_utils import get_parameter_names
 from ..peft_model import PeftModel
 
 
-def create_loraplus_optimizer(model: PeftModel, optimizer_cls: type[Optimizer], optimizer_kwargs: dict, loraplus_lr_embedding: float=1e-6) -> Optimizer:
+def create_loraplus_optimizer(
+    model: PeftModel, optimizer_cls: type[Optimizer], optimizer_kwargs: dict, loraplus_lr_embedding: float = 1e-6
+) -> Optimizer:
     """
-    Creates a LoraPlus optimizer.
-    Implementing LoRA+ https://arxiv.org/abs/2402.12354
-    Reference: https://github.com/nikhil-ghosh-berkeley/loraplus/
+    Creates a LoraPlus optimizer. Implementing LoRA+ https://arxiv.org/abs/2402.12354 Reference:
+    https://github.com/nikhil-ghosh-berkeley/loraplus/
 
     Args:
         model (`torch.nn.Module`): The model to be optimized.
         optimizer_cls (`torch.optim.Optimizer`): The optimizer class to be used.
         optimizer_kwargs (`dict`): Additional keyword arguments to be passed to the optimizer.
-            - **loraplus_lr_ratio** (`float`): The ratio of the learning rate to be used for the embedding layer. Defaults to loraplus_lr_ratio
-            - loraplus_lr_embedding (`float`): The learning rate to be used for the embedding layer. Defaults to loraplus_lr_embedding
+            - **loraplus_lr_ratio** (`float`): The ratio of the learning rate to be used for the embedding layer.
+              Defaults to loraplus_lr_ratio
+            - loraplus_lr_embedding (`float`): The learning rate to be used for the embedding layer. Defaults to
+              loraplus_lr_embedding
     """
     from ..tuners.lora.layer import Embedding
+
     loraplus_lr_ratio = optimizer_kwargs.pop("loraplus_lr_ratio")
 
     decay_parameters = get_parameter_names(model, ALL_LAYERNORM_LAYERS)
@@ -81,6 +85,7 @@ def create_loraplus_optimizer(model: PeftModel, optimizer_cls: type[Optimizer],
     optimizer = optimizer_cls(optimizer_grouped_parameters, **optimizer_kwargs)
     if optimizer_cls.__name__ == "Adam8bit":
         import bitsandbytes
+
         manager = bitsandbytes.optim.GlobalOptimManager.get_instance()
         for module in model.modules():
             if isinstance(module, nn.Embedding):
modified   tests/test_loraplus_helper.py
@@ -25,32 +25,37 @@ def test_lora_plus_helper_sucess():
     model = SimpleNet()
     optimizer_cls = bnb.optim.Adam8bit
     optim_config = {
-        'lr': 5e-5,
-        'eps': 1e-6,
-        'betas': (0.9, 0.999),
-        'weight_decay': 0.0,
+        "lr": 5e-5,
+        "eps": 1e-6,
+        "betas": (0.9, 0.999),
+        "weight_decay": 0.0,
         "loraplus_lr_ratio": 0.2,
     }
-    optim = create_loraplus_optimizer(model=model, optimizer_cls=optimizer_cls, optimizer_kwargs=optim_config, loraplus_lr_embedding=1e-6)
+    optim = create_loraplus_optimizer(
+        model=model, optimizer_cls=optimizer_cls, optimizer_kwargs=optim_config, loraplus_lr_embedding=1e-6
+    )
     assert optim is not None
     assert len(optim.param_groups) == 4
 
+
 def test_lora_plus_optimizer_sucess():
     optimizer_cls = bnb.optim.Adam8bit
     optim_config = {
-        'lr': 5e-5,
-        'eps': 1e-6,
-        'betas': (0.9, 0.999),
-        'weight_decay': 0.0,
+        "lr": 5e-5,
+        "eps": 1e-6,
+        "betas": (0.9, 0.999),
+        "weight_decay": 0.0,
         "loraplus_lr_ratio": 0.2,
     }
     model: SimpleNet = SimpleNet().cuda()
-    optim = create_loraplus_optimizer(model=model, optimizer_cls=optimizer_cls, optimizer_kwargs=optim_config, loraplus_lr_embedding=1e-6)
+    optim = create_loraplus_optimizer(
+        model=model, optimizer_cls=optimizer_cls, optimizer_kwargs=optim_config, loraplus_lr_embedding=1e-6
+    )
     loss = torch.nn.CrossEntropyLoss()
     bnb.optim.GlobalOptimManager.get_instance().register_parameters(model.parameters())
     x = torch.randint(100, (2, 4, 10)).cuda()
     output = model(x).permute(0, 3, 1, 2)
-    label = torch.randint(16, (2,4,10,)).cuda()
+    label = torch.randint(16, (2, 4, 10)).cuda()
     loss_value = loss(output, label)
     loss_value.backward()
     optim.step()

BenjaminBossan · 2024-05-22T13:22:29Z

tests/test_loraplus_helper.py

+    assert len(optim.param_groups) == 4
+
+def test_lora_plus_optimizer_sucess():
+    optimizer_cls = bnb.optim.Adam8bit


Could you please add a short comment here of what is being tested?

BenjaminBossan · 2024-05-22T13:23:13Z

tests/test_loraplus_helper.py

+    }
+    optim = create_loraplus_optimizer(model=model, optimizer_cls=optimizer_cls, optimizer_kwargs=optim_config)
+    assert optim is not None
+    assert len(optim.param_groups) == 4


would it make sense to also check the specific learning rates we get for the different param groups?

Add lora+ implentation

972fa75

BenjaminBossan closed this Feb 26, 2024

BenjaminBossan reopened this Feb 26, 2024

BenjaminBossan mentioned this pull request Feb 29, 2024

Feature Request: Integrate Lora+/different learning rates for adapter matrices A and B #1504

Closed

Support LoraPlus cfg

f95ee34

BenjaminBossan requested changes Mar 18, 2024

View reviewed changes

moghadas76 added 2 commits March 31, 2024 01:31

Fix QA comments

0968391

Fix test

fb8d954

moghadas76 added 2 commits April 19, 2024 21:54

Fix tests

f148f0e

Merge branch 'main' into feature/loraplus

888d2f1

Merge branch 'main' into feature/loraplus

e86f8b6

BenjaminBossan requested changes Apr 26, 2024

View reviewed changes

moghadas76 added 3 commits April 28, 2024 20:04

Fix docstring

bccdb55

Fix comments

b1ff0f9

Add unit test

556127d

Merge branch 'main' into feature/loraplus

d62ef84

Decouple file structures

4edfa3d

moghadas76 added 2 commits May 21, 2024 09:42

Fix clean code issues

92dd565

Merge branch 'huggingface:main' into feature/loraplus

0be05fd

Fix styling problem

f4c4e58

BenjaminBossan requested changes May 22, 2024

View reviewed changes

BenjaminBossan mentioned this pull request May 27, 2024

Add Special Optimizer for LoRA training #1803

Open

fangzhaozhang mentioned this pull request May 28, 2024

Integrating Riemannian Preconditioner #1807

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lora+ implentation #1509

Add lora+ implentation #1509

moghadas76 commented Feb 26, 2024

BenjaminBossan commented Feb 26, 2024 •

edited

HuggingFaceDocBuilderDev commented Feb 26, 2024

moghadas76 commented Feb 26, 2024

BenjaminBossan commented Feb 26, 2024

moghadas76 commented Feb 26, 2024

BenjaminBossan commented Mar 6, 2024

BenjaminBossan commented Mar 12, 2024

moghadas76 commented Mar 12, 2024 via email

BenjaminBossan commented Mar 12, 2024

moghadas76 commented Mar 17, 2024

BenjaminBossan left a comment •

edited

BenjaminBossan commented Mar 25, 2024

moghadas76 commented Mar 25, 2024 via email

BenjaminBossan commented Mar 25, 2024

BenjaminBossan commented Apr 2, 2024

BenjaminBossan commented Apr 19, 2024

moghadas76 commented Apr 19, 2024

moghadas76 commented Apr 19, 2024

BenjaminBossan commented Apr 25, 2024

BenjaminBossan left a comment

BenjaminBossan Apr 26, 2024

moghadas76 May 17, 2024

BenjaminBossan May 22, 2024

moghadas76 commented May 17, 2024

BenjaminBossan commented May 17, 2024

moghadas76 commented May 18, 2024

BenjaminBossan commented May 21, 2024

moghadas76 commented May 21, 2024

BenjaminBossan left a comment

BenjaminBossan May 22, 2024

BenjaminBossan May 22, 2024

Add lora+ implentation #1509

Are you sure you want to change the base?

Add lora+ implentation #1509

Conversation

moghadas76 commented Feb 26, 2024

BenjaminBossan commented Feb 26, 2024 • edited

HuggingFaceDocBuilderDev commented Feb 26, 2024

moghadas76 commented Feb 26, 2024

BenjaminBossan commented Feb 26, 2024

moghadas76 commented Feb 26, 2024

BenjaminBossan commented Mar 6, 2024

BenjaminBossan commented Mar 12, 2024

moghadas76 commented Mar 12, 2024 via email

BenjaminBossan commented Mar 12, 2024

moghadas76 commented Mar 17, 2024

BenjaminBossan left a comment • edited

Choose a reason for hiding this comment

BenjaminBossan commented Mar 25, 2024

moghadas76 commented Mar 25, 2024 via email

BenjaminBossan commented Mar 25, 2024

BenjaminBossan commented Apr 2, 2024

BenjaminBossan commented Apr 19, 2024

moghadas76 commented Apr 19, 2024

moghadas76 commented Apr 19, 2024

BenjaminBossan commented Apr 25, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Apr 26, 2024

Choose a reason for hiding this comment

moghadas76 May 17, 2024

Choose a reason for hiding this comment

BenjaminBossan May 22, 2024

Choose a reason for hiding this comment

moghadas76 commented May 17, 2024

BenjaminBossan commented May 17, 2024

moghadas76 commented May 18, 2024

BenjaminBossan commented May 21, 2024

moghadas76 commented May 21, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan May 22, 2024

Choose a reason for hiding this comment

BenjaminBossan May 22, 2024

Choose a reason for hiding this comment

BenjaminBossan commented Feb 26, 2024 •

edited

BenjaminBossan left a comment •

edited