DeepSeek V4 Replicas Small-scale faithful replicas of the DeepSeek-V4 architecture for ablation and weight-transfer research. kshitijthakkar/deepseek-v4-mini-300M-init Text Generation • 0.3B • Updated Apr 29 • 15 kshitijthakkar/deepseek-v4-mini-1B-init Text Generation • 1B • Updated Apr 29 • 9 kshitijthakkar/deepseek-v4-mini-3B-init Text Generation • 3B • Updated Apr 29 • 9 • 1 kshitijthakkar/deepseek-v4-mini-6B-init Text Generation • 8B • Updated Apr 30 • 11 • 4
mcp-server-bench This is a collection of Benchmarking results between Gradio and FastMCP kshitijthakkar/mcp-server-bench Viewer • Updated Feb 27 • 360 • 61 kshitijthakkar/mcp-server-bench-gradio-optimized Viewer • Updated Mar 2 • 48 • 47 kshitijthakkar/mcp-server-bench-gradio Viewer • Updated Mar 4 • 12 • 45 kshitijthakkar/mcp-server-bench-gradio-optimized-full-bench Viewer • Updated Mar 2 • 337 • 49
DeepSeek V4 Replicas Small-scale faithful replicas of the DeepSeek-V4 architecture for ablation and weight-transfer research. kshitijthakkar/deepseek-v4-mini-300M-init Text Generation • 0.3B • Updated Apr 29 • 15 kshitijthakkar/deepseek-v4-mini-1B-init Text Generation • 1B • Updated Apr 29 • 9 kshitijthakkar/deepseek-v4-mini-3B-init Text Generation • 3B • Updated Apr 29 • 9 • 1 kshitijthakkar/deepseek-v4-mini-6B-init Text Generation • 8B • Updated Apr 30 • 11 • 4
mcp-server-bench This is a collection of Benchmarking results between Gradio and FastMCP kshitijthakkar/mcp-server-bench Viewer • Updated Feb 27 • 360 • 61 kshitijthakkar/mcp-server-bench-gradio-optimized Viewer • Updated Mar 2 • 48 • 47 kshitijthakkar/mcp-server-bench-gradio Viewer • Updated Mar 4 • 12 • 45 kshitijthakkar/mcp-server-bench-gradio-optimized-full-bench Viewer • Updated Mar 2 • 337 • 49
Running Racing for Chiku — a chiku-inu field report 🐾 chiku-inu's Gemma-challenge contributions & lessons
Runtime error Agents 1 E-Commerce Product Content Generator 🛒 Generate product photos and marketing copy for e‑commerce
kshitijthakkar/deepseek-v4-mini-300M-recovered Text Generation • 0.3B • Updated about 1 month ago • 16 • 1
kshitijthakkar/deepseek-v4-mini-300M-recovered-h100 Text Generation • 0.3B • Updated about 1 month ago • 3
kshitijthakkar/deepseek-v4-mini-300M-recovered-wip Text Generation • 0.3B • Updated about 1 month ago • 5