yuzhe gu's picture

3 7 3

yuzhe gu

vanilla1116

·

https://guyuzhe.site/

Liqu1d-G

AI & ML interests

LLM; Hallucination; Self-Improvement

Recent Activity

upvoted a paper about 20 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

commented on a paper about 20 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

upvoted a paper 21 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

View all activity

Organizations

vanilla1116's activity

commented a paper about 20 hours ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 1 day ago • 36 •

New activity in opencompass/anah 7 months ago

[bot] Conversion to Parquet

#1 opened 7 months ago by

parquet-converter

commented 2 papers 7 months ago

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Paper • 2407.04693 • Published Jul 5, 2024 • 1 •

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Paper • 2407.04693 • Published Jul 5, 2024 • 1 •