Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
Yuandong Tian
tydsh
Follow
lispczz's profile picture
athoye's profile picture
vikaswebdev's profile picture
10 followers
ยท
2 following
https://yuandong-tian.com/
tydsh
yuandong-tian
AI & ML interests
Reinforcement Learning, Optimization, Representation Learning
Recent Activity
authored
a paper
5 days ago
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
authored
a paper
14 days ago
Towards General-Purpose Model-Free Reinforcement Learning
authored
a paper
19 days ago
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
View all activity
Organizations
None yet
Papers
21
arxiv:
2502.03275
arxiv:
2501.16142
arxiv:
2501.10799
arxiv:
2412.06769
Expand 21 papers
models
None public yet
datasets
None public yet