3 14 11

Xiaoye Qu

Xiaoye08

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

UltraIF: Advancing Instruction Following from the Wild

authored a paper 20 days ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

upvoted a paper 20 days ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

View all activity

Organizations

Xiaoye08's activity

upvoted a paper 5 days ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published 5 days ago • 20

authored a paper 20 days ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published 20 days ago • 55

upvoted a paper 20 days ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published 20 days ago • 55

upvoted 2 papers about 1 month ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

liked a dataset about 1 month ago

hitsmy/PRMBench_Preview

Viewer • Updated Jan 7 • 6.22k • 150 • 4

upvoted a paper about 1 month ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published Jan 6 • 14

liked a Space about 2 months ago

505

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

upvoted 2 papers about 2 months ago

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published Dec 19, 2024 • 16

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

upvoted a paper 2 months ago

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Paper • 2412.01824 • Published Dec 2, 2024 • 65

liked 2 models 2 months ago

llama-moe/LLaMA-MoE-v2-3_8B-residual-sft

Updated Dec 3, 2024 • 62 • 2

llama-moe/LLaMA-MoE-v2-3_8B-2_8-sft

Updated Dec 3, 2024 • 35 • 3

commented a paper 4 months ago

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 19 •

authored 6 papers 4 months ago

ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM

Paper • 2408.12076 • Published Aug 22, 2024 • 12