We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
针对现有公开评测集llm刷榜现象严重,不能反映模型真实效果,咱们这边有更鲁棒、泛化性更好的评测方式吗
The text was updated successfully, but these errors were encountered:
We provide OpenCompass 2.0 leaderboard for LLM, which consists of the non-public data.
Sorry, something went wrong.
Feel free to re-open if needed.
tonysy
No branches or pull requests
描述该功能
针对现有公开评测集llm刷榜现象严重,不能反映模型真实效果,咱们这边有更鲁棒、泛化性更好的评测方式吗
是否希望自己实现该功能?
The text was updated successfully, but these errors were encountered: