How to get the data seen by the model during training? #30886

jaydeepborkar · 2024-05-17T21:32:50Z

Hi! I haven't been able to find an answer to my question so opening an issue here. I'm fine-tuning the GPT-2 XL model using the trainer for 10 epochs and I'd like to save the data seen by the model during each epoch. More specifically, I want to save the data seen by the model every 242 steps. For instance, data seen from step 1 to step 242, step 243 to step 484, and so on until the end of the 10th epoch. I'm a bit confused about how to do this since the data is shuffled after each epoch. Is it possible to use TrainerCallback here?

These are my training args
training_args = TrainingArguments( f"models/XL", evaluation_strategy = "steps", learning_rate=2e-5, weight_decay=0.01, push_to_hub=False, num_train_epochs=10, per_device_train_batch_size=8, per_device_eval_batch_size=8, save_strategy="epoch", save_steps = 242, fp16=True, report_to="none", logging_strategy="steps", logging_steps=100, )

I'd appreciate any directions. Thanks :)

The text was updated successfully, but these errors were encountered:

amyeroberts · 2024-05-20T08:38:23Z

Hi @jaydeepborkar, thanks for raising an issue!

This is a question best placed in our forums. We try to reserve the github issues for feature requests and bug reports.

jaydeepborkar · 2024-05-20T17:26:28Z

Sure, thanks!

jaydeepborkar closed this as completed May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the data seen by the model during training? #30886

How to get the data seen by the model during training? #30886

jaydeepborkar commented May 17, 2024

amyeroberts commented May 20, 2024

jaydeepborkar commented May 20, 2024

How to get the data seen by the model during training? #30886

How to get the data seen by the model during training? #30886

Comments

jaydeepborkar commented May 17, 2024

amyeroberts commented May 20, 2024

jaydeepborkar commented May 20, 2024