Edit Models filters

Inference Providers

HF Inference API

Misc

4-bit precision

AutoTrain Compatible

text-generation-inference

Inference Endpoints

8-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

24,225

Full-text search

Active filters: 4-bit

Qwen/Qwen2.5-32B-Instruct-AWQ

Text Generation • Updated Oct 9, 2024 • 32.2k • 56

unsloth/Qwen2.5-14B-Instruct-bnb-4bit

Text Generation • Updated 6 days ago • 29.9k • 7

unsloth/Llama-3.2-1B-bnb-4bit

Text Generation • Updated 20 days ago • 31.2k • 11

unsloth/Llama-3.3-70B-Instruct-bnb-4bit

Text Generation • Updated Jan 7 • 269k • 32

ibnzterrell/Meta-Llama-3.3-70B-Instruct-AWQ-INT4

Text Generation • Updated Dec 7, 2024 • 22.8k • 18

OPEA/deepseek-vl2-int4-sym-gptq-inc

Updated Jan 6 • 237 • 3

Wiseyak/OpenWiseyak-0.1-Base-4bit

Text Generation • Updated Dec 31, 2024 • 75 • 2

unsloth/DeepSeek-R1-Distill-Qwen-1.5B-unsloth-bnb-4bit

Text Generation • Updated 10 days ago • 30.9k • 10

mlx-community/DeepSeek-R1-Distill-Qwen-32B-4bit

Text Generation • Updated 22 days ago • 351k • 28

mlx-community/DeepSeek-R1-Distill-Llama-70B-4bit

Text Generation • Updated 21 days ago • 42.6k • 2

ReadyArt/L3.3-Nevoria-R1-70b_EXL2_4.0bpw_H8

Text Generation • Updated 17 days ago • 152 • 4

unsloth/DeepSeek-R1-Distill-Llama-70B-unsloth-bnb-4bit

Text Generation • Updated 10 days ago • 1.35k • 2

MaziyarPanahi/Mistral-Small-24B-Instruct-2501-GGUF

Text Generation • Updated 12 days ago • 132k • 2

unsloth/Mistral-Small-24B-Instruct-2501-bnb-4bit

Text Generation • Updated 10 days ago • 822 • 2

unsloth/Mistral-Small-24B-Base-2501-unsloth-bnb-4bit

Text Generation • Updated 10 days ago • 1.67k • 2

unsloth/Qwen2.5-VL-7B-Instruct-bnb-4bit

Image-Text-to-Text • Updated 11 days ago • 1.08k • 3

FINGU-AI/Chocolatine-Fusion-14B

Text Generation • Updated 9 days ago • 135 • 3

Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_MLX-4bit

Text Generation • Updated 8 days ago • 102 • 2

unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit

Text Generation • Updated 6 days ago • 29.5k • 2

unsloth/Qwen2.5-0.5B-unsloth-bnb-4bit

Text Generation • Updated 6 days ago • 1.65k • 2

moot20/s1-32B-MLX-4bits

Text Generation • Updated 5 days ago • 43 • 2

GuilhermeNaturaUmana/Nature-Reason-1-AGI-LORA

Updated 3 days ago • 4 • 2

PointerHQ/Qwen2.5-VL-72B-Instruct-Pointer-AWQ

Image-Text-to-Text • Updated 2 days ago • 319 • 2

TheBloke/stable-vicuna-13B-GPTQ

Text Generation • Updated Aug 21, 2023 • 63 • 219

TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ

Text Generation • Updated Sep 27, 2023 • 4.01k • 318

TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ

Text Generation • Updated Sep 27, 2023 • 3.86k • 163

TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ

Text Generation • Updated Sep 27, 2023 • 779 • 573

TheBloke/Karen_theEditor_13B-GPTQ

Text Generation • Updated Sep 27, 2023 • 31 • 13

TheBloke/Nous-Hermes-13B-GPTQ

Text Generation • Updated Aug 21, 2023 • 205 • 176

TheBloke/WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ

Text Generation • Updated Aug 21, 2023 • 90 • 46