Post
2052
Ring-mini-linear-2.0 🔥a hybrid attention MoE model released by Ant group
inclusionAI/Ring-mini-linear-2.0
✨ Hybrid linear + standard attention
✨ 16.4B total, only 1.6B activated
✨ 512k context window via YaRN
✨ Faster than same-size MoE
inclusionAI/Ring-mini-linear-2.0
✨ Hybrid linear + standard attention
✨ 16.4B total, only 1.6B activated
✨ 512k context window via YaRN
✨ Faster than same-size MoE