The model actually fits on two 8xH100 https://huggingface.co/blog/open-r1/update-1#synthetic-data-generation
And the 15 generations per hour per H100 is the throughput on four nodes divided by 32 GPUs (4 to avoid the cache filling up)
Loubna Ben Allal
loubnabnl
AI & ML interests
SmolLMs, ML for code, data
Recent Activity
new activity
about 5 hours ago
open-r1/OpenR1-Math-220k:Update README.md
new activity
about 5 hours ago
open-r1/OpenR1-Math-220k:mismatch between the schema of the data
commented on
an
article
about 6 hours ago
Open R1: Update #2
Organizations
loubnabnl's activity
Update README.md
1
#3 opened about 6 hours ago
by
davidberenstein1957
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1677141720071-634ff41ff32062e9eb7b06a3.jpeg)
mismatch between the schema of the data
2
#2 opened about 12 hours ago
by
ChengyiDu
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61c141342aac764ce1654e43/81AwoT5IQ_Xdw0OVw7TKu.jpeg)
commented on
Open R1: Update #2
about 6 hours ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61c141342aac764ce1654e43/81AwoT5IQ_Xdw0OVw7TKu.jpeg)
commented on
Open R1: Update #2
about 6 hours ago
We only applied Llama verification to the default
subset, those rejected by Math Verify from the extended
subset didn't go through a second verification step. We can release the unfiltered data with 400k problems if the community wants to do different filtering.
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61c141342aac764ce1654e43/81AwoT5IQ_Xdw0OVw7TKu.jpeg)
upvoted
an
article
about 23 hours ago
Article
Open R1: Update #2
By
and 6 others
•
•
107![](https://cdn-avatars.huggingface.co/v1/production/uploads/61c141342aac764ce1654e43/81AwoT5IQ_Xdw0OVw7TKu.jpeg)
published
an
article
about 24 hours ago
Article
Open R1: Update #2
By
and 6 others
•
•
107