Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

baichuan2 mmlu结果复现的问题 #194

Open
zhanghan1992 opened this issue Nov 2, 2023 · 0 comments
Open

baichuan2 mmlu结果复现的问题 #194

zhanghan1992 opened this issue Nov 2, 2023 · 0 comments

Comments

@zhanghan1992
Copy link

zhanghan1992 commented Nov 2, 2023

评估使用的代码:https://github.com/baichuan-inc/Baichuan-7B/blob/main/evaluation/evaluate_mmlu.py

用bf16精度测试 llama2-13-hf 和 baichuan2-13b-base
llama2-13-hf: 0.550
baichuan2-13b-base: 0.564

改了一行代码,用fp32测试:
#model = AutoModelForCausalLM.from_pretrained(args.model, torch_dtype=torch.bfloat16, device_map="auto",trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(args.model, device_map="auto",trust_remote_code=True)
llama2-13-hf: 0.554
baichuan2-13b-base: 0.590

请教下,为啥baichuan2在bf16和fp32精度下结果差这么多?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant