Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepSeek发布全球最强开源第二代MoE模型:DeepSeek-V2! #4221

Open
tqangxl opened this issue May 7, 2024 · 3 comments
Open

DeepSeek发布全球最强开源第二代MoE模型:DeepSeek-V2! #4221

tqangxl opened this issue May 7, 2024 · 3 comments
Labels
model request Model requests

Comments

@tqangxl
Copy link

tqangxl commented May 7, 2024

https://mp.weixin.qq.com/s/3AmJpYe1eLPHk7HJLYM24A
pls add DeepSeek-V2
模型&论文双开源

深度求索始终秉持着最开放的开源精神,以开源推动人类AGI事业的前行。这次的DeepSeek-V2模型和论文也将完全开源,免费商用,无需申请:

模型权重:

https://huggingface.co/deepseek-ai
技术报告:

https://github.com/deepseek-ai/DeepSeek-V2/blob/main/deepseek-v2-tech-report.pdf

@tqangxl tqangxl added the model request Model requests label May 7, 2024
@wwjCMP
Copy link

wwjCMP commented May 7, 2024

#4205

@taozhiyuai
Copy link

not much better performance than llama 3, but 3 times bigger in size than 70B. not recommended to run only most personal machine.

@xinge666
Copy link

not much better performance than llama 3, but 3 times bigger in size than 70B. not recommended to run only most personal machine.

this maybe a CPU friendly model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
model request Model requests
Projects
None yet
Development

No branches or pull requests

4 participants