Compatible with transformers APIs?
1
#18 opened almost 2 years ago
by
qiz
How to get this running on Oobabooga with RTX 4080 16GB?
8
#17 opened almost 2 years ago
by
Goldenblood56
I'm getting 0.4 tokens/s on a 4090
2
#16 opened almost 2 years ago
by
androtester
.pt version uses 2gb less VRAM for me than the non-groupsized .safetensors
3
#10 opened almost 2 years ago
by
Monero
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63d625232e397d9f8e1eccac/AOZv_jnPhcj9t6thSs11d.png)