Abdullah Abdelrhim's picture

Abdullah Abdelrhim

abdullah

·

abodacs

AI & ML interests

None yet

Recent Activity

liked a dataset about 9 hours ago

internlm/OREAL-RL-Prompts

upvoted a collection about 13 hours ago

liked a model about 15 hours ago

agentica-org/DeepScaleR-1.5B-Preview

View all activity

Organizations

abdullah's activity

upvoted a collection about 13 hours ago

OREAL

7 items • Updated about 19 hours ago • 5

upvoted an article 2 days ago

Article

What is test-time compute and how to scale it?

By

and 1 other •

5 days ago

• 16

upvoted a paper 8 days ago

Improving Transformer World Models for Data-Efficient RL

Paper • 2502.01591 • Published 8 days ago • 9

upvoted an article 9 days ago

Article

Open-R1: Update #1

By

and 7 others •

10 days ago

• 268

upvoted a collection 9 days ago

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated 10 days ago • 50

upvoted an article 20 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

20 days ago

• 124

upvoted a paper 20 days ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 24 days ago • 23

upvoted 3 papers about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 92

upvoted an article about 1 month ago

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

Dec 30, 2024

• 32

upvoted 2 papers about 2 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published Dec 21, 2024 • 22

upvoted a collection 2 months ago

DeTikZify

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 11 items • Updated Dec 4, 2024 • 7

upvoted a paper 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

upvoted an article 2 months ago

Article

Rethinking Backpropagation: Thoughts on What's Wrong with Backpropagation

By

•

Dec 2, 2024

• 5

upvoted a paper 2 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 130

upvoted an article 2 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 76

upvoted a collection 2 months ago

LLaMA-O1-1129 Datasets, Models, Codes and Papers

8 items • Updated Dec 3, 2024 • 18

upvoted a paper 3 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 45