Leandro von Werra's picture

Leandro von Werra

lvwerra

AI & ML interests

NLP and RL

Recent Activity

Organizations

Hugging Face's profile picture Natural Language Processing with Transformers's profile picture BigScience Workshop's profile picture Spaces-explorers's profile picture Hugging Face Course's profile picture BigScience Catalogue Data's profile picture PubMed Central's profile picture BigScience Data's profile picture trl internal testing's profile picture evaluate's profile picture Data Days Zurich's profile picture HuggingFaceM4's profile picture Evaluate Metric's profile picture Evaluate Measurement's profile picture Evaluate Comparison's profile picture TRL's profile picture scikit-learn's profile picture CodeParrot's profile picture BigCode's profile picture CompVis's profile picture Hugging Face H4's profile picture Hugging Face OSS Metrics's profile picture BigBang's profile picture transfer-test-target's profile picture Sphere Fall 2022's profile picture CompVis Community's profile picture BigCode Data's profile picture Stack Overflow's profile picture Reading Group's profile picture Hugging Face Extreme-Scale's profile picture Need4Speed's profile picture Code Llama's profile picture Personal Coding Assistant's profile picture Hugging Face TB Research's profile picture Hugging Face Smol Cluster's profile picture Open LLM Leaderboard's profile picture gg-hf's profile picture Nanotron Research's profile picture Hugging Face SMOL's profile picture HuggingFaceFW's profile picture bigcode nvidia's profile picture hsramall's profile picture mlo-data-cleaning's profile picture HuggingFaceFW-Dev's profile picture StarCoder2 Data's profile picture Data Agents's profile picture CinePile collaboration's profile picture Hugging Face FineVideo's profile picture smol-explorers's profile picture swissai-hf-data's profile picture abcd4321's profile picture Hugging Face Science's profile picture eggs's profile picture LeMaterial's profile picture Open R1's profile picture

lvwerra's activity

published an article 8 days ago
view article
Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

41
published an article 10 days ago
published an article 15 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

705
published an article 2 months ago
view article
Article

LeMaterial: an open source initiative to accelerate materials discovery and research

39
published an article 4 months ago
view article
Article

CinePile 2.0 - making stronger datasets with adversarial refinement

13
published an article 5 months ago
view article
Article

FineVideo: behind the scenes

29
published an article 5 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

217
published an article 6 months ago
view article
Article

A failed experiment: Infini-Attention, and why we should keep trying?

57
published an article 7 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

227
published an article 8 months ago
view article
Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

43
published an article 10 months ago
view article
Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

76
published an article 10 months ago
view article
Article

Welcome Llama 3 - Meta's new open LLM

283
published an article 12 months ago
view article
Article

StarCoder2 and The Stack v2

8
published an article about 1 year ago
view article
Article

Constitutional AI with Open LLMs

13
published an article about 1 year ago
view article
Article

Preference Tuning LLMs with Direct Preference Optimization Methods

45
published an article about 1 year ago
view article
Article

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

11
published an article over 1 year ago
view article
Article

The N Implementation Details of RLHF with PPO

36
published an article over 1 year ago
view article
Article

Finetune Stable Diffusion Models with DDPO via TRL

9
published an article over 1 year ago
view article
Article

Spread Your Wings: Falcon 180B is here

4
published an article over 1 year ago
view article
Article

Code Llama: Llama 2 learns to code

9