trl-internal-testing/tiny-DeepseekV3ForCausalLM Text Generation • 5.52M • Updated 12 days ago • 517 • 3
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31 • 5.14k • 163