Fine-tune Phi-2

Code for Medium story "Fine-tune Microsoft’s Phi-2 with QLoRA and synthetic data":

Create a synthetic dataset from a seed of instructions
Fine-tune Phi-2 using QLoRA

Notebooks

nb_dataset.ipynb: Create a synthetic conversational dataset using a seed of riddles
nb_qlora.ipynb: Fine-tune Phi-2 using QLoRA
nb_qlora_colab.ipynb: Hacky version to enable gradient checkpointing, set of hyperparameters for low VRAM usage to run on a free Google Colab T4 instance

`qlora.py`: Train on multiple GPUs with accelerate

Overview

Setup and Initialization: Import necessary libraries, set up Weights and Biases (wandb) for tracking, and initialize a unique run identifier.
Configuration and Seeds: Set the seed for reproducibility and configure model and dataset paths, learning rate, batch sizes, epochs, and maximum token length.
LoRA Configuration: Define the Low-Rank Adaptation (LoRA) configuration for efficient model fine-tuning.
Model Preparation: Load the Phi-2 model with quantization settings for 4-bit training, and resize token embeddings to accommodate new special tokens.
Tokenizer Preparation: Load and configure the tokenizer, adding necessary special tokens for ChatML formatting.
Dataset Loading and Preparation: Load the dataset from Hugging Face, split it into training and test sets, and apply ChatML formatting and tokenization.
Data Collation: Define a collation function to transform individual data samples into batched data suitable for training.
Training Configuration: Set up training arguments with specified hyperparameters like batch sizes, learning rate, and gradient accumulation steps.
Trainer Initialization: Initialize the Trainer object with the model, tokenizer, training arguments, and data collator.
Training Execution: Launch the training process, optionally with Weights and Biases tracking for the main process.

Set parameters

modelpath="microsoft/phi-2"
dataset_name="g-ronimo/riddles_evolved"
lr=0.00002		# low but works for this dataset
bs=1			# batch size for training
bs_eval=16		# batch size for evals
ga_steps=16		# gradient acc. steps
epochs=20		# dataset is small, many epochs needed
max_length=1024		# samples will be cut beyond this number of tokens
output_dir=f"out"

To run qlora.py, please install all requirements with:

pip install -r requirements.txt

Run

accelerate launch qlora.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

nb_dataset.ipynb

nb_dataset.ipynb

nb_qlora.ipynb

nb_qlora.ipynb

nb_qlora_colab.ipynb

nb_qlora_colab.ipynb

qlora.py

qlora.py

requirements.txt

requirements.txt

Repository files navigation

Fine-tune Phi-2

Notebooks

`qlora.py`: Train on multiple GPUs with accelerate

Overview

Set parameters

To run qlora.py, please install all requirements with:

Run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md
nb_dataset.ipynb		nb_dataset.ipynb
nb_qlora.ipynb		nb_qlora.ipynb
nb_qlora_colab.ipynb		nb_qlora_colab.ipynb
qlora.py		qlora.py
requirements.txt		requirements.txt

oliverbob/phi2-fine

Folders and files

Latest commit

History

Repository files navigation

Fine-tune Phi-2

Notebooks

qlora.py: Train on multiple GPUs with accelerate

Overview

Set parameters

To run qlora.py, please install all requirements with:

Run

About

Resources

Stars

Watchers

Forks

Languages

`qlora.py`: Train on multiple GPUs with accelerate