Nemotron models that have been converted and/or quantized to work well in vLLM
Michael Goin PRO
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
upvoted
a
paper
1 day ago
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations
updated
a model
5 days ago
nm-testing/pixtral-12b-FP8-dynamic-all
updated
a model
5 days ago
neuralmagic/pixtral-12b-FP8-dynamic
Organizations
Collections
1
spaces
4
models
94
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/pixtral-12b
Image-Text-to-Text
•
Updated
•
314
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/Llama-3.2-1B-Instruct-FP8-ATTN
Updated
•
9
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/Llama-3.2-1B-Instruct-FP8-dynamic-ATTN
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/Pixtral-Large-Instruct-2411
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/Qwen2.5-Coder-32B-Instruct-fp8
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/nemotron-3-8b-chat-4k-sft-hf
Text Generation
•
Updated
•
15
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/llava-onevision-qwen2-7b-ov-hf-bnb-full-4bit
Image-Text-to-Text
•
Updated
•
65
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/MiniCPM-Llama3-V-2_5-int4
Visual Question Answering
•
Updated
•
14
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/DeepSeek-Coder-V2-Lite-Instruct-FP8
Updated
•
2.93k
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/sWaFR-fi_Bk9vy3EC5K0f.jpeg)
mgoin/Mixtral-8x7B-Instruct-v0.1-FP8
Updated
•
6