-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
Updated
•
32.2k
•
56
unsloth/Qwen2.5-14B-Instruct-bnb-4bit
Text Generation
•
Updated
•
29.9k
•
7
unsloth/Llama-3.2-1B-bnb-4bit
Text Generation
•
Updated
•
31.2k
•
11
unsloth/Llama-3.3-70B-Instruct-bnb-4bit
Text Generation
•
Updated
•
269k
•
32
ibnzterrell/Meta-Llama-3.3-70B-Instruct-AWQ-INT4
Text Generation
•
Updated
•
22.8k
•
18
OPEA/deepseek-vl2-int4-sym-gptq-inc
Wiseyak/OpenWiseyak-0.1-Base-4bit
Text Generation
•
Updated
•
75
•
2
unsloth/DeepSeek-R1-Distill-Qwen-1.5B-unsloth-bnb-4bit
Text Generation
•
Updated
•
30.9k
•
10
mlx-community/DeepSeek-R1-Distill-Qwen-32B-4bit
Text Generation
•
Updated
•
351k
•
28
mlx-community/DeepSeek-R1-Distill-Llama-70B-4bit
Text Generation
•
Updated
•
42.6k
•
2
ReadyArt/L3.3-Nevoria-R1-70b_EXL2_4.0bpw_H8
Text Generation
•
Updated
•
152
•
4
unsloth/DeepSeek-R1-Distill-Llama-70B-unsloth-bnb-4bit
Text Generation
•
Updated
•
1.35k
•
2
MaziyarPanahi/Mistral-Small-24B-Instruct-2501-GGUF
Text Generation
•
Updated
•
132k
•
2
unsloth/Mistral-Small-24B-Instruct-2501-bnb-4bit
Text Generation
•
Updated
•
822
•
2
unsloth/Mistral-Small-24B-Base-2501-unsloth-bnb-4bit
Text Generation
•
Updated
•
1.67k
•
2
unsloth/Qwen2.5-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
•
1.08k
•
3
FINGU-AI/Chocolatine-Fusion-14B
Text Generation
•
Updated
•
135
•
3
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_MLX-4bit
Text Generation
•
Updated
•
102
•
2
unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit
Text Generation
•
Updated
•
29.5k
•
2
unsloth/Qwen2.5-0.5B-unsloth-bnb-4bit
Text Generation
•
Updated
•
1.65k
•
2
moot20/s1-32B-MLX-4bits
Text Generation
•
Updated
•
43
•
2
GuilhermeNaturaUmana/Nature-Reason-1-AGI-LORA
Updated
•
4
•
2
PointerHQ/Qwen2.5-VL-72B-Instruct-Pointer-AWQ
Image-Text-to-Text
•
Updated
•
319
•
2
TheBloke/stable-vicuna-13B-GPTQ
Text Generation
•
Updated
•
63
•
219
TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ
Text Generation
•
Updated
•
4.01k
•
318
TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ
Text Generation
•
Updated
•
3.86k
•
163
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
•
Updated
•
779
•
573
TheBloke/Karen_theEditor_13B-GPTQ
Text Generation
•
Updated
•
31
•
13
TheBloke/Nous-Hermes-13B-GPTQ
Text Generation
•
Updated
•
205
•
176
TheBloke/WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ
Text Generation
•
Updated
•
90
•
46