Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

泛化性更好的评测方式 #1170

Closed
1 task
cobraheleah opened this issue May 17, 2024 · 2 comments
Closed
1 task

泛化性更好的评测方式 #1170

cobraheleah opened this issue May 17, 2024 · 2 comments
Assignees

Comments

@cobraheleah
Copy link

描述该功能

针对现有公开评测集llm刷榜现象严重,不能反映模型真实效果,咱们这边有更鲁棒、泛化性更好的评测方式吗

是否希望自己实现该功能?

  • 我希望自己来实现这一功能,并向 OpenCompass 贡献代码!
@tonysy
Copy link
Collaborator

tonysy commented May 20, 2024

We provide OpenCompass 2.0 leaderboard for LLM, which consists of the non-public data.

@tonysy
Copy link
Collaborator

tonysy commented May 23, 2024

Feel free to re-open if needed.

@tonysy tonysy closed this as completed May 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants