微调DeepSeek-R1打造SQL语言转自然语言视频教程
#16 opened 1 day ago
by
leo009
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646334477e9025b09bd57d75/MDysdEWFWeJQDeDOLmdvI.png)
One more "0" in model-00001-of-000002.safetensors?
#15 opened 1 day ago
by
PPrimo
Excellent models !!! - Plans for Mistral Nemo and/or Gemma 2 Distills ?
#14 opened 5 days ago
by
DavidAU
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65ea44635b64331c067d3751/yCim-7c3tm67o5wWP_6cE.jpeg)
Adding Evaluation Results
#12 opened 11 days ago
by
Mikhil-jivus
Missing multilanguage capabilities
5
#11 opened 12 days ago
by
h4rz3rk4s3
run in colab t4
#9 opened 16 days ago
by
rakmik
Adding Evaluation Results
#8 opened 16 days ago
by
T145
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/rv3XTyO6TSLNmebutG9wy.png)
Add pipeline tag, link to paper
#7 opened 19 days ago
by
nielsr
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1608042047613-5f1158120c833276f61f1a84.jpeg)
Do the distilled models also have 128K context?
1
#4 opened 22 days ago
by
Troyanovsky
How was this quantized?
1
#3 opened 22 days ago
by
imq
missing special_tokens_map.json file
#2 opened 22 days ago
by
vince62s
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6495b47a74ce69cc4eab61f0/2eg17fMXjshpfQfSq5jyP.png)