Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
deepseek-ai
/
DeepSeek-R1
like
8.3k
Follow
DeepSeek
31.7k
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
fp8
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
Community
131
Train
Deploy
Use this model
Update model_max_length in tokenizer_config.json
#139
by
kkokkie2360
- opened
1 day ago
base:
refs/heads/main
←
from:
refs/pr/139
Discussion
Files changed
+1
-1
kkokkie2360
1 day ago
The model_max_length should be the same as model's context length
See translation
Update model_max_length in tokenizer_config.json
2ea4a0b8
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Ready to merge
This branch is ready to get merged automatically.
Comment
·
Sign up
or
log in
to comment