michaelbenayoun/qwen3-tiny-4kv-heads-8layers-random Text Generation • 6.61M • Updated Oct 30 • 607
michaelbenayoun/qwen3-tiny-4kv-heads-4layers-random Text Generation • 5.47M • Updated Oct 30 • 26.6k
michaelbenayoun/deepseekv3-tiny-4kv-heads-4-layers-random Text Generation • 5.27M • Updated Jul 24 • 5
michaelbenayoun/granite-tiny-4kv-heads-4layers-random Text Generation • 4.2M • Updated Jun 18 • 737
michaelbenayoun/llama-2-tiny-4kv-heads-4layers-random Text Generation • 8.54M • Updated Jun 2 • 113k
michaelbenayoun/llama-2-tiny-4kv-heads-16layers-random Text Generation • 8.98M • Updated May 27 • 20
michaelbenayoun/llama-2-tiny-4kv-heads-2layers-random Feature Extraction • 2.08M • Updated May 7, 2024 • 8
michaelbenayoun/llama-2-tiny-4kv-heads-8layers-random Feature Extraction • 2.17M • Updated May 3, 2024 • 4
michaelbenayoun/llama-2-tiny-16layers-random Feature Extraction • 1.14M • Updated Jan 9, 2024 • 7
michaelbenayoun/llama-2-tiny-16layers-32kv-heads-random Feature Extraction • 1.14M • Updated Jan 4, 2024 • 9
michaelbenayoun/gpt-neox-tiny-4layers-random Feature Extraction • 59.7k • Updated Jan 4, 2024 • 7
michaelbenayoun/mistral-tiny-4layers-8kv-heads-random Text Generation • 2.08M • Updated Nov 9, 2023 • 7