Pinned
Repositories
Showing 10 of 321 repositories
- LOLA-Megatron-DeepSpeed Public Forked from microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
-
-