Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model about 16 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-NVFP4 published
a model about 16 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-NVFP4 updated
a model about 18 hours ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic