ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 5.09k • 187
emiliodavola/french-solitaire-dqn-single-solution Reinforcement Learning • Updated 28 days ago • 8 • 2
0xgr3y/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tall_tame_panther Text Generation • 0.5B • Updated 21 days ago • 4.06k • 1
AXONVERTEX-AI-RESEARCH/Orchestrator-8B-Q8_0-GGUF Reinforcement Learning • 8B • Updated 11 days ago • 498 • 7