Elie Bakouch's picture

Elie Bakouch

eliebak

·

AI & ML interests

Training LLM's @ 🤗

Recent Activity

upvoted an article about 5 hours ago

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

liked a dataset about 24 hours ago

open-r1/OpenR1-Math-220k

upvoted an article about 24 hours ago

Open R1: Update #2

View all activity

Organizations

eliebak's activity

commented a paper 7 days ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published 11 days ago • 6 •

commented a paper 14 days ago

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published 21 days ago • 10 •

New activity in open-r1/README 14 days ago

Recommend a dataset in the scientific domain made by us: EricLu/SCP-116K

#2 opened 14 days ago by

LLM Benchmarks and Data Leakage

#1 opened 15 days ago by

New activity in kyutai/helium-1-preview-2b 29 days ago

fix title

#2 opened 29 days ago by

New activity in reach-vb/2024-ai-timeline about 1 month ago

Update index.html

#5 opened about 1 month ago by

New activity in HuggingFaceTB/SmolLM2-1.7B-Instruct 2 months ago

Useless information

#20 opened 2 months ago by

New activity in HuggingFaceTB/SmolLM2-360M-Instruct 3 months ago

finetuning

#2 opened 3 months ago by

New activity in HuggingFaceTB/SmolLM2-1.7B-Instruct 3 months ago

Multi-language support

#4 opened 3 months ago by

Upload ONNX weights

#1 opened 3 months ago by

New activity in HuggingFaceTB/SmolLM2-360M-Instruct 3 months ago

Upload ONNX weights

#1 opened 3 months ago by

New activity in Zyphra/Zamba2-2.7B-instruct 4 months ago

fix ultrachat link in readme

#3 opened 4 months ago by

New activity in Zyphra/Zamba2-1.2B-instruct 4 months ago

fix ultrachat link in readme

#3 opened 4 months ago by

commented a paper 4 months ago

Old Optimizer, New Norm: An Anthology

Paper • 2409.20325 • Published Sep 30, 2024 • 3 •

New activity in HuggingFaceTB/SmolLM-135M 5 months ago

Model code for training from sractch

#6 opened 7 months ago by

New activity in HuggingFaceTB/smollm-corpus 5 months ago

Missing file in python_edu subset

#8 opened 5 months ago by

Fix missing file when downloading from s3

#7 opened 6 months ago by

New activity in HuggingFaceTB/SmolLM-135M 5 months ago

SmolLm and mergekit_moe: is lm_head missing ?

#14 opened 5 months ago by

Training time

#13 opened 6 months ago by

New activity in HuggingFaceTB/smollm-corpus 6 months ago

Fix missing file when loading python edu

#6 opened 6 months ago by