nm-testing/whisper-large-v3.w4a16 Automatic Speech Recognition • 0.3B • Updated Feb 14, 2025 • 16 • 2
nm-testing/llama2.c-stories42M-gsm8k-quantized-only-uncompressed 58.2M • Updated Feb 12, 2025 • 2.44k
nm-testing/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic Text Generation • 71B • Updated Feb 1, 2025 • 5 • 3
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-partial-24-remaining-fp8-compressed 1B • Updated Jan 29, 2025 • 7
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-partial-24-entire-fp8-compressed 1B • Updated Jan 29, 2025 • 5
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-sparse24-0-5-remaining-fp8-compressed 1B • Updated Jan 28, 2025 • 6
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-sparse24-layer-0-5-fp8-compressed 1B • Updated Jan 28, 2025 • 9
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-sparse24-layer-0-fp8-compressed 1B • Updated Jan 28, 2025 • 6
nm-testing/granite-8b-code-instruct-128k2of4-W8A8-FP8-Dynamic-Per-Token 8B • Updated Jan 26, 2025 • 6