1 10 5

Yijie Chen

pppa

pppa2019

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper 18 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

upvoted a paper 27 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

View all activity

Organizations

pppa's activity

upvoted a paper 17 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 20 days ago • 315

upvoted a paper 18 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 20 days ago • 43

upvoted a paper 27 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 28 days ago • 273

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

updated a collection about 1 month ago

Code&Math&Reasoning

Collection

5 items • Updated Jan 3 • 1

upvoted 2 papers about 1 month ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 37

HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

Paper • 2412.20735 • Published Dec 30, 2024 • 11

updated 2 collections about 2 months ago

Code&Math&Reasoning

Collection

5 items • Updated Jan 3 • 1

WorldModeling

Collection

3 items • Updated Dec 16, 2024

upvoted a paper about 2 months ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

New activity in DavidLanz/chinese-dolly-input-output-15k 2 months ago

是否方便把原文给对应上去呢

#1 opened 2 months ago by

pppa

liked a model 4 months ago

openbmb/MiniCPM-2B-dpo-fp16

Text Generation • Updated Sep 7, 2024 • 341 • 34

liked a model 8 months ago

Qwen/Qwen-1_8B-Chat-Int4

Text Generation • Updated Dec 13, 2023 • 516 • 33

liked a model 9 months ago

princeton-nlp/Sheared-LLaMA-1.3B

Text Generation • Updated Jan 23, 2024 • 27.5k • 93

liked a model 11 months ago

ai21labs/Jamba-v0.1

Text Generation • Updated Sep 11, 2024 • 10.3k • 1.18k

upvoted 2 papers 12 months ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 115

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 54