Chen Cui
cuichenx
AI & ML interests
None yet
Recent Activity
updated
a model
28 days ago
deepseek-ai/DeepSeek-V3
new activity
28 days ago
deepseek-ai/DeepSeek-V3:`aux_loss_alpha` should be 1e-4 instead of 1e-3?
updated
a model
28 days ago
deepseek-ai/DeepSeek-V3-Base
Organizations
cuichenx's activity
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#61 opened 28 days ago
by
cuichenx
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1647892352892-6237746c4f73a51ab018f994.png)
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#60 opened 28 days ago
by
cuichenx
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1647892352892-6237746c4f73a51ab018f994.png)