Clémentine Fourrier

clefourrier

AI & ML interests

None yet

Recent Activity

updated a dataset about 13 hours ago
gaia-benchmark/results_public
updated a dataset about 13 hours ago
gaia-benchmark/submissions_public
View all activity

Organizations

Hugging Face's profile picture Long Range Graph Benchmark's profile picture Evaluation datasets's profile picture BigScience: LMs for Historical Texts's profile picture HuggingFaceBR4's profile picture Cohere For AI's profile picture Open Graph Benchmark's profile picture HuggingFaceGECLM's profile picture Huggingface Projects's profile picture Pretrained Graph Transformers's profile picture Graph Datasets's profile picture BigCode's profile picture Hugging Face H4's profile picture InternLM's profile picture Vectara's profile picture GAIA's profile picture Hugging Face Smol Cluster's profile picture plfe's profile picture Open LLM Leaderboard's profile picture Qwen's profile picture Secure Learning Lab's profile picture Open Life Science AI's profile picture LLM360's profile picture TTS Eval (OLD)'s profile picture Leaderboard Organization's profile picture Bias Leaderboard Development's profile picture hallucinations-leaderboard's profile picture Demo Leaderboard's profile picture Demo leaderboard with an integrated backend's profile picture gg-hf's profile picture Clinical & Biomedical ML Leaderboards's profile picture AIM-Harvard's profile picture Women on Hugging Face's profile picture LMLLO2's profile picture Lighthouz AI's profile picture Open Arabic LLM Leaderboard's profile picture mx-test's profile picture LeaderboardsOnTheHub's profile picture HuggingFaceFW's profile picture IBM Granite's profile picture HF-contamination-detection's profile picture TTS AGI's profile picture Leader Board Test Org's profile picture Social Post Explorers's profile picture hsramall's profile picture Open RL Leaderboard's profile picture The Fin AI's profile picture Open Hebrew LLM's Leaderboard's profile picture La Leaderboard's profile picture gg-tt's profile picture HuggingFaceEval's profile picture HP Inc.'s profile picture Novel Challenge's profile picture Open LLM Leaderboard Archive's profile picture LLHF's profile picture SLLHF's profile picture lbhf's profile picture Inception's profile picture nltpt's profile picture Lighteval testing org's profile picture CléMax's profile picture Hugging Face Science's profile picture test_org's profile picture Coordination Nationale pour l'IA's profile picture LeMaterial's profile picture open-llm-leaderboard-react's profile picture Prompt Leaderboard's profile picture wut?'s profile picture Your Bench's profile picture leaderboard explorer's profile picture Open R1's profile picture SIMS's profile picture

clefourrier's activity

published an article 1 day ago
view article
Article

The Open Arabic LLM Leaderboard 2

17
published an article 8 days ago
view article
Article

Open-source DeepResearch – Freeing our search agents

906
published an article about 1 month ago
view article
Article

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

18
published an article 2 months ago
view article
Article

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

31
published an article 3 months ago
view article
Article

Introduction to the Open Leaderboard for Japanese LLMs

32
published an article 3 months ago
view article
Article

Letting Large Models Debate: The First Multilingual LLM Debate Competition

30
published an article 3 months ago
view article
Article

Judge Arena: Benchmarking LLMs as Evaluators

54
published an article 4 months ago
view article
Article

Introducing the Open FinLLM Leaderboard

72
published an article 8 months ago
view article
Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

43
published an article 9 months ago
view article
Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

25
published an article 9 months ago
view article
Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

21
published an article 9 months ago
view article
Article

Introducing the Open Arabic LLM Leaderboard

79
published an article 9 months ago
published an article 9 months ago
view article
Article

Introducing the Open Leaderboard for Hebrew LLMs!

32
published an article 9 months ago
view article
Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

13
published an article 10 months ago
view article
Article

Improving Prompt Consistency with Structured Generations

61
published an article 10 months ago
view article
Article

Introducing the Open Chain of Thought Leaderboard

30
published an article 10 months ago
view article
Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

131
published an article 10 months ago
view article
Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

15
published an article 11 months ago
view article
Article

Introducing the Chatbot Guardrails Arena

4