-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32
Updated
•
16.8k
•
42
huihui-ai/Huihui-GLM-4.6-abliterated-mlx-4bit
Text Generation
•
353B
•
Updated
•
395
•
13
nightmedia/gpt-oss-120b-heretic-v2-mxfp4-q8-hi-mlx
Text Generation
•
117B
•
Updated
•
659
•
6
QuantTrio/DeepSeek-V3.2-Speciale-AWQ
Text Generation
•
685B
•
Updated
•
46
•
4
mlx-community/Llama-3.2-3B-Instruct-4bit
Text Generation
•
0.5B
•
Updated
•
12.5k
•
37
0xSero/GLM-4.6-REAP-218B-A32B-W4A16-AutoRound
Text Generation
•
2B
•
Updated
•
417
•
3
MaziyarPanahi/Ministral-3-3B-Reasoning-2512-GGUF
Text Generation
•
3B
•
Updated
•
12.1k
•
3
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
210k
•
80
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
128k
•
116
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
553k
•
40
Intel/DeepSeek-R1-0528-Qwen3-8B-int4-AutoRound
Text Generation
•
2B
•
Updated
•
525
•
6
lmstudio-community/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit
Text Generation
•
31B
•
Updated
•
142k
•
11
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
•
Updated
•
37
•
3
Intel/DeepSeek-V3.1-Terminus-int4-mixed-AutoRound
Text Generation
•
Updated
•
77
•
4
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
11.7k
•
8
nhe-ai/maya1-mlx-4Bit
Text-to-Speech
•
0.5B
•
Updated
•
177
•
3
mlx-community/VibeThinker-1.5B-mlx-4bit
Text Generation
•
0.2B
•
Updated
•
1.06k
•
24
VibeStudio/MiniMax-M2-THRIFT-55-MLX-4bit
106B
•
Updated
•
109
•
2
unsloth/Ministral-3-14B-Base-2512-bnb-4bit
14B
•
Updated
•
135
•
2
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
505
•
2
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
•
4B
•
Updated
•
40.3k
•
590
TheBloke/vicuna-7B-v1.5-GPTQ
Text Generation
•
1B
•
Updated
•
80
•
17
TheBloke/dolphin-2.2.1-mistral-7B-AWQ
Text Generation
•
1B
•
Updated
•
111
•
16
TheBloke/deepseek-coder-1.3b-instruct-AWQ
Text Generation
•
0.3B
•
Updated
•
102
•
4
TheBloke/saiga_mistral_7b-AWQ
Text Generation
•
1B
•
Updated
•
139
•
4
TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ
Text Generation
•
6B
•
Updated
•
297k
•
140
unsloth/llama-3-70b-bnb-4bit
Text Generation
•
37B
•
Updated
•
1.04k
•
47
lllyasviel/omost-llama-3-8b-4bits
Text Generation
•
5B
•
Updated
•
274
•
24
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
5B
•
Updated
•
278k
•
89
unsloth/Meta-Llama-3.1-70B-Instruct-bnb-4bit
Text Generation
•
37B
•
Updated
•
6.64k
•
32