Collection of Distills using Open R1
asdf
ewre324
AI & ML interests
None yet
Recent Activity
liked
a Space
3 days ago
open-r1/open-r1-eval-leaderboard
upvoted
an
article
11 days ago
Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial
updated
a model
12 days ago
ewre324/ewre324-R1-Minueza-32M-Distill
Organizations
Collections
3
These models have been finetuned to perform reasoning, chain of thought.
-
ewre324/ewre324-Thinker-Llama-3.2-3B-Instruct-Reasoning
Updated • 267 -
ewre324/ewre324-Thinker-Qwen2.5-0.5B-Instruct-Reasoning
Updated • 24 -
ewre324/ewre324-Thinker-SmolLM2-135M-Instruct-Reasoning
Text Generation • Updated • 32 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 10
models
8
ewre324/ewre324-R1-Minueza-32M-Distill
Updated
ewre324/ewre324-R1-SmolLM2-135M-Distill
Updated
•
19
ewre324/moondream2
Image-Text-to-Text
•
Updated
•
490
ewre324/ewre324-QwQ-0.5B-Distilled-SFT-Reason
Updated
•
10
ewre324/ewre324-Thinker-Llama-3.2-1B-Instruct-Reason
Updated
•
8
ewre324/ewre324-Thinker-Llama-3.2-3B-Instruct-Reasoning
Updated
•
267
ewre324/ewre324-Thinker-Qwen2.5-0.5B-Instruct-Reasoning
Updated
•
24
ewre324/ewre324-Thinker-SmolLM2-135M-Instruct-Reasoning
Text Generation
•
Updated
•
32