An Yang
yangapku
AI & ML interests
NLP and Deep Learning
Recent Activity
upvoted
a
paper
about 1 month ago
CodeElo: Benchmarking Competition-level Code Generation of LLMs with
Human-comparable Elo Ratings
upvoted
a
paper
about 2 months ago
Qwen2.5 Technical Report
upvoted
a
paper
2 months ago
ProcessBench: Identifying Process Errors in Mathematical Reasoning
Organizations
yangapku's activity
Create README.md
#1 opened 5 months ago
by
Zhenru
Update README of branch dev_triton.
2
#11 opened about 1 year ago
by
Cheshire94
Does Qwen support 16k context, what is the best config for max_new_tokens?
2
#22 opened over 1 year ago
by
Cheshire94
RuntimeError: The size of tensor a (8192) must match the size of tensor b (11581) at non-singleton dimension 3
1
#32 opened over 1 year ago
by
wujiekd
Fix typo
#29 opened over 1 year ago
by
IlysvlVEizbr
Load tokenizer and model in no internet kernel?
1
#33 opened over 1 year ago
by
nikjohn7
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/BaM1AWo0MWmKM3qhpnIW1.png)
FlashAttention推理时还是需要关闭,目前开启输出是错乱的
1
#27 opened over 1 year ago
by
Trangle
我看模型更新了,有说明吗
2
#21 opened over 1 year ago
by
Weiguo
_convert_id_to_token方法没有实现
2
#1 opened over 1 year ago
by
YeungNLP
does it support Chinese and English mixed input?
5
#1 opened about 2 years ago
by
Baicai003
How can I add context with text input along with the image and the labels?
3
#5 opened about 2 years ago
by
micole66
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60322794e8149a962412a67a/VIV7BQafhNHwu_KFslPG5.jpeg)
remove styling to fix spacing
#4 opened about 2 years ago
by
akhaliq
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674929746905-60f1abe7544c2adfd699860c.jpeg)
Minor nit
1
#3 opened about 2 years ago
by
osanseviero
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6032802e1f993496bc14d9e3/w6hr-DEQot4VVkoyRIBiy.png)