Initialization for LoRA weights A and B initialized #1728

sanaullah-06 · 2024-05-13T15:01:10Z

System Info

The comment and code are contradictory please anyone explain it to me.

Who can help?

@BenjaminBossan

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

nn.init.zeros_(self.lora_embedding_B[adapter_name])
nn.init.normal_(self.lora_embedding_A[adapter_name])

Expected behavior

nil.

The text was updated successfully, but these errors were encountered:

BenjaminBossan · 2024-05-13T15:23:05Z

You're right, the comment doesn't match the code. After a quick glance at the LoRA paper, I don't see an explicit mention of how LoRA should be initialized for embedding layers. When checking the reference implementation by Microsoft, they, do, however, use the same scheme as we do:

https://github.com/microsoft/LoRA/blob/4c0333854cb905966f8cc4e9a74068c1e507c7b7/loralib/layers.py#L55-L60

Therefore, I think the code is correct.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initialization for LoRA weights A and B initialized #1728

Initialization for LoRA weights A and B initialized #1728

sanaullah-06 commented May 13, 2024

BenjaminBossan commented May 13, 2024

Initialization for LoRA weights A and B initialized #1728

Initialization for LoRA weights A and B initialized #1728

Comments

sanaullah-06 commented May 13, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

BenjaminBossan commented May 13, 2024