Finetuning general LLM models from hugging face #521

bkhanal-11 · 2024-05-17T07:53:30Z

Thank you for your continuous support to the LLM open-source community.

I was wondering if we use AutoModel, AutoConfig and AutoTokenizer instead of LlamaForCausalLM, LlamaConfig and LlamaTokenizer for general LLM fine-tuning, will the fine-tuning pipeline/recipe fail?

The text was updated successfully, but these errors were encountered:

mreso · 2024-05-20T17:04:52Z

Hi @bkhanal-11 for the single gpu use case this might work, but for the advances use cases with FSDP you'll need to different wrapping policy and there might be other things to consider and adapt.

bkhanal-11 · 2024-05-21T04:01:49Z

That makes sense, thanks.

bkhanal-11 closed this as completed May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning general LLM models from hugging face #521

Finetuning general LLM models from hugging face #521

bkhanal-11 commented May 17, 2024

mreso commented May 20, 2024

bkhanal-11 commented May 21, 2024

Finetuning general LLM models from hugging face #521

Finetuning general LLM models from hugging face #521

Comments

bkhanal-11 commented May 17, 2024

mreso commented May 20, 2024

bkhanal-11 commented May 21, 2024