noaux_tc not supported for training

#12
by chocoded - opened

Hi,
I'm traing the deepseek-ai/deepseek-vl2 model and find that the default top_k method is noaux_tc. However, line 468 in modeling_deepseek.py shows that noaux_tc is not supported for traing. I wonder why.

截屏2025-02-06 20.31.27.png

Sign up or log in to comment