yuzhe gu's picture

3 7 3

yuzhe gu

vanilla1116

·

https://guyuzhe.site/

Liqu1d-G

AI & ML interests

LLM; Hallucination; Self-Improvement

Recent Activity

upvoted a paper about 16 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

commented on a paper about 16 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

upvoted a paper 21 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

View all activity

Organizations

Papers 3

arxiv:2407.04693

arxiv:2405.20315

arxiv:2403.17297

models

None public yet

datasets

None public yet