Update README.md
#81 opened about 7 hours ago
by
FYSIOBASEN
Update README.md
#80 opened about 13 hours ago
by
zhup
Update README.md
#79 opened about 13 hours ago
by
zhup
chat
#77 opened 4 days ago
by
rojithonline
DeepSeek-V3-lite naming conventions?
5
#76 opened 5 days ago
by
AlphaGaO
torch.distributed.DistNetworkError
#75 opened 9 days ago
by
yu19920006607
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Q52A8ppy7qo_t1CByZpQ0.png)
remove reference to deprecated transformers code
2
#74 opened 13 days ago
by
winglian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/641dfddf3bae5a77636817c5/2IwNwh9kK98eCHUmOGoWD.png)
Update README.md
#73 opened 14 days ago
by
SamimSaikia
DeepSeek R1 answer ChatGPT ??
4
#72 opened 14 days ago
by
valerebron
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/PljiIkj8R4NbDOM5BOtIi.jpeg)
ValueError: Unrecognized configuration class <class 'transformers_modules.configuration_deepseek.DeepseekV3Config'> to build an AutoTokenizer.
6
#69 opened 15 days ago
by
ajtakto
Paralelized script
#67 opened 15 days ago
by
ajtakto
I am getting an error message while executing pip install - r requirements. txt
5
#64 opened 20 days ago
by
yu19920006607
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Q52A8ppy7qo_t1CByZpQ0.png)
Does deepseek allow adding new data?
#63 opened 23 days ago
by
JoshuaBontor
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#61 opened 28 days ago
by
cuichenx
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1647892352892-6237746c4f73a51ab018f994.png)
captcha not loading on edge
#60 opened 30 days ago
by
leo-smi
Upload shreya.zip
#59 opened about 1 month ago
by
Msdthala
Upload IMG_20250111_184317.jpg
#58 opened about 1 month ago
by
Sajalhero
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/SlyuBC5uptLD624QM0ocA.png)
无辅助损失的专家路由
1
#56 opened about 1 month ago
by
qing9
AI Games
#55 opened about 1 month ago
by
ChickenUJHAYIUSGU
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/cjDH051VafiXK_VU32yHN.jpeg)
Upload IMG_0509 4.HEIC
#54 opened about 1 month ago
by
borhanrabbany
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/laHYZPLqz0TSNlt6ZtLZx.png)
how to inference with mtp?
#53 opened about 1 month ago
by
duanyu
Does it support ollama
2
#52 opened about 1 month ago
by
sminbb
Create gngn
#49 opened about 1 month ago
by
axingd
Missing tool call in system prompt
1
#48 opened about 1 month ago
by
bchenfireworks
Update config.json
#47 opened about 1 month ago
by
STATIKwitak
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/t7n1gwpIIkuhXSNg1eBpg.png)
Rename figures/benchmark.png to figures/𓇋𓀀𓍿.png
#46 opened about 1 month ago
by
STATIKwitak
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/t7n1gwpIIkuhXSNg1eBpg.png)
Rename figures/benchmark.png to figures/𓇋𓀀𓍿.png
#45 opened about 1 month ago
by
STATIKwitak
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/t7n1gwpIIkuhXSNg1eBpg.png)
Upload IMG_0295.HEIC
#42 opened about 1 month ago
by
Umarkhan499
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/7w_6MhxlSNrKoa8On0fSf.jpeg)
vLLM on A100s
6
#41 opened about 1 month ago
by
fsaudm
When do you plan to integrate Huggingface Transformer?
#40 opened about 1 month ago
by
echooooooooo
Deciphering messages
1
#39 opened about 1 month ago
by
DoctorDonald
Update README.md
#38 opened about 1 month ago
by
chaitanyayerroju
Update README.md
1
#37 opened about 1 month ago
by
TomGrc
Training problem
3
#29 opened about 1 month ago
by
DonGan13
Update README.md
1
#28 opened about 1 month ago
by
Wisnet
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/TKH2H3sVleJFCPYKPk1Yz.png)
Update README.md
2
#27 opened about 1 month ago
by
Aikun7777777
Failed to run the model with 4 nodes of 8 4090
17
#25 opened about 1 month ago
by
aisensiy
kill openai,come on
#24 opened about 1 month ago
by
chaochaoli
Update modeling_deepseek.py
1
#23 opened about 1 month ago
by
erichartford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/j-sg53QbeCkHiLl-_1Tp1.png)
is_torch_greater_or_equal_than_1_13 deprecated
#22 opened about 1 month ago
by
erichartford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/j-sg53QbeCkHiLl-_1Tp1.png)
Request: DOI
#21 opened about 1 month ago
by
TheDandyMan
Has anyone tried running this model on Ollama?
6
#20 opened about 1 month ago
by
Yuxin362
vLLM on A100s
4
#19 opened about 1 month ago
by
fsaudm
Fine-tuning roadmap
4
#18 opened about 1 month ago
by
RonanMcGovern
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/-6Yq7oM_Ju6Zi2GEvobvb.jpeg)
CUDA out of memory error during fp8 to bf16 model conversion + fix
1
#17 opened about 1 month ago
by
sszymczyk
when llm leaderboard?
3
#14 opened about 1 month ago
by
blazespinnaker
Update README.md
#13 opened about 1 month ago
by
BANblongz
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/8YiJavTzv3xD_JpMXxSKK.png)
Please make V3-lite
3
#12 opened about 2 months ago
by
rombodawg
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642cc1c253e76b4c2286c58e/fGtQ_QeTjUgBhIT89dpUt.jpeg)
minimum vram?
11
#9 opened about 2 months ago
by
CHNtentes