Loubna Ben Allal's picture

Loubna Ben Allal

loubnabnl

·

https://loubnabnl.github.io/

AI & ML interests

SmolLMs, ML for code, data

Recent Activity

new activity about 5 hours ago

open-r1/OpenR1-Math-220k:Update README.md

new activity about 5 hours ago

open-r1/OpenR1-Math-220k:mismatch between the schema of the data

commented on an article about 6 hours ago

Open R1: Update #2

View all activity

Organizations

loubnabnl's activity

New activity in open-r1/OpenR1-Math-220k about 5 hours ago

Update README.md

#3 opened about 6 hours ago by

davidberenstein1957

mismatch between the schema of the data

#2 opened about 12 hours ago by

commented on Open R1: Update #2 about 6 hours ago

The model actually fits on two 8xH100 https://huggingface.co/blog/open-r1/update-1#synthetic-data-generation
And the 15 generations per hour per H100 is the throughput on four nodes divided by 32 GPUs (4 to avoid the cache filling up)

commented on Open R1: Update #2 about 6 hours ago

We only applied Llama verification to the default subset, those rejected by Math Verify from the extended subset didn't go through a second verification step. We can release the unfiltered data with 400k problems if the community wants to do different filtering.

updated a dataset about 23 hours ago

open-r1/OpenR1-Math-220k

Viewer • Updated about 5 hours ago • 225k • 260 • 121

liked a dataset about 23 hours ago

AI-MO/NuminaMath-1.5

Viewer • Updated 1 day ago • 896k • 146 • 63

upvoted an article about 23 hours ago

Article

Open R1: Update #2

By

and 6 others •

about 24 hours ago

• 107

published an article about 24 hours ago

Article

Open R1: Update #2

By

and 6 others •

about 24 hours ago

• 107

published a model about 24 hours ago

open-r1/OpenR1-Qwen-7B

Text Generation • Updated about 8 hours ago • 315 • 13

published a dataset about 24 hours ago

open-r1/OpenR1-Math-220k

Viewer • Updated about 5 hours ago • 225k • 260 • 121

updated a model 1 day ago

open-r1/OpenR1-Qwen-7B

Text Generation • Updated about 8 hours ago • 315 • 13

updated a collection 1 day ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 7 hours ago • 48

liked a model 1 day ago

GAIR/LIMO

Updated 5 days ago • 382 • 23

updated a collection 1 day ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 7 hours ago • 48

published a dataset 2 days ago

loubnabnl/135M-examples

Updated 2 days ago • 4

updated a dataset 2 days ago

loubnabnl/135M-examples

Updated 2 days ago • 4

authored a paper 5 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 153

updated a dataset 5 days ago

HuggingFaceTB/smol-smoltalk

Viewer • Updated 5 days ago • 485k • 712 • 28

updated 2 models 5 days ago

HuggingFaceTB/SmolLM2-135M

Text Generation • Updated 5 days ago • 200k • 55

HuggingFaceTB/SmolLM2-360M

Text Generation • Updated 5 days ago • 14.4k • 31