michaelbenayoun/qwen3-tiny-4kv-heads-8layers-random Text Generation • 6.61M • Updated 9 days ago • 28
michaelbenayoun/qwen3-tiny-4kv-heads-4layers-random Text Generation • 5.47M • Updated 9 days ago • 27.7k
michaelbenayoun/deepseekv3-tiny-4kv-heads-4-layers-random Text Generation • 5.27M • Updated Jul 24 • 1
michaelbenayoun/llama-2-tiny-4kv-heads-2layers-random Feature Extraction • 2.08M • Updated May 7, 2024 • 3
michaelbenayoun/llama-2-tiny-4kv-heads-8layers-random Feature Extraction • 2.17M • Updated May 3, 2024 • 1
michaelbenayoun/llama-2-tiny-16layers-32kv-heads-random Feature Extraction • 1.14M • Updated Jan 4, 2024 • 5