Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about the training result on 4 A100 with batch size = 36 #14

Open
lizhyuxi opened this issue Aug 11, 2023 · 1 comment
Open

Comments

@lizhyuxi
Copy link

lizhyuxi commented Aug 11, 2023

I trained using four A100 GUP and the total batch size is 36.
After a total of 300,000 times of training, this is the result of the model:
image

which is quite different from the result given in your paper :
image

I did not change the code, what could be the cause?

@xiao-11
Copy link

xiao-11 commented Nov 30, 2023

How long do you train the model on 4 A100?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants