返回
RCreddit.com
7
·开发者社区 · RSS

ascend-tribe/openPangu-2.0-Flash (They haven't uploaded it to Huggingface yet)

查看原文

https://ai.gitcode.com/ascend-tribe/openPangu-2.0-Flash

openPangu-2.0-Flash is an MoE model trained on Ascend. The model has 92B total parameters and 6B activated parameters. Its context length is 512k. The total pretraining data contains 34T tokens. During Post-training, openPangu-2.0-Flash is trained through unified SFT with slow and fast thinking capability, multiple specialist RL traning, on-policy distillation combining multiple RL specialists.

主题标签Hugging Face
原始关键词#huggingface#openpangu#uploaded#ascend#flash#haven
查看原文reddit.com
单一来源,暂无交叉验证
ascend-tribe/openPangu-2.0-Flash (They haven't uploaded it to Huggingface yet) · BuzzRadr