27 63 74

Elie Bakouch

eliebak

AI & ML interests

Training LLM's @ 🤗

Recent Activity

upvoted an article about 5 hours ago

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

liked a dataset about 23 hours ago

open-r1/OpenR1-Math-220k

upvoted an article about 23 hours ago

Open R1: Update #2

View all activity

Organizations

eliebak's activity

upvoted an article about 5 hours ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

and 1 other •

about 9 hours ago

• 15

upvoted an article about 23 hours ago

Article

Open R1: Update #2

and 6 others •

1 day ago

• 115

upvoted a paper 2 days ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published 7 days ago • 14

upvoted a paper 5 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 153

upvoted a paper 7 days ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published 11 days ago • 6

upvoted 2 articles 7 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 911

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

8 days ago

• 41

upvoted an article 9 days ago

Article

Open-R1: Update #1

and 7 others •

10 days ago

• 268

upvoted an article 11 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

11 days ago

• 33

upvoted 2 articles 13 days ago

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

19 days ago

• 62

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

•

13 days ago

• 16

upvoted a paper 13 days ago

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

Paper • 2501.14334 • Published 18 days ago • 17

upvoted a paper 14 days ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 43

upvoted an article 14 days ago

Article

Welcome to Inference Providers on the Hub 🔥

15 days ago

• 321

upvoted an article 15 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

15 days ago

• 706

upvoted an article 22 days ago

Article

Yay! Organizations can now publish blog Articles

and 3 others •

22 days ago

• 33

upvoted an article 27 days ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

•

27 days ago

• 40

upvoted a collection 29 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 5 days ago • 227

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

upvoted a collection about 1 month ago

DolphinLabeled Datasets

Collection

Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6 • 12