deepseek-ai/DeepSeek-V4-Pro Text Generation β’ 862B β’ Updated about 3 hours ago β’ 123k β’ β’ 2.9k
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 Text Generation β’ 32B β’ Updated Mar 15 β’ 769k β’ β’ 336
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method β’ 30 items β’ Updated Feb 25 β’ 138