ZhangJin
Benjamin0
ยท
AI & ML interests
None yet
Recent Activity
upvoted
an
article
about 14 hours ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
upvoted
an
article
4 days ago
Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference
liked
a dataset
7 months ago
HuggingFaceM4/the_cauldron
Organizations
None yet
models
None public yet
datasets
None public yet