Clémentine Fourrier's picture

Clémentine Fourrier

clefourrier

·

http://clefourrier.github.io

AI & ML interests

None yet

Recent Activity

updated a dataset about 13 hours ago

gaia-benchmark/results_public

updated a dataset about 13 hours ago

gaia-benchmark/submissions_public

published an article 1 day ago

The Open Arabic LLM Leaderboard 2

View all activity

Organizations

clefourrier's activity

published an article 1 day ago

Article

The Open Arabic LLM Leaderboard 2

1 day ago

• 17

published an article 8 days ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 906

published an article about 1 month ago

Article

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

Jan 9

• 18

published an article 2 months ago

Article

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

Dec 4, 2024

• 31

published an article 3 months ago

Article

Introduction to the Open Leaderboard for Japanese LLMs

Nov 20, 2024

• 32

published an article 3 months ago

Article

Letting Large Models Debate: The First Multilingual LLM Debate Competition

Nov 20, 2024

• 30

published an article 3 months ago

Article

Judge Arena: Benchmarking LLMs as Evaluators

Nov 19, 2024

• 54

published an article 4 months ago

Article

Introducing the Open FinLLM Leaderboard

Oct 4, 2024

• 72

published an article 8 months ago

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18, 2024

• 43

published an article 9 months ago

Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

May 24, 2024

• 25

published an article 9 months ago

Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

May 24, 2024

• 21

published an article 9 months ago

Article

Introducing the Open Arabic LLM Leaderboard

May 14, 2024

• 79

published an article 9 months ago

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 151

published an article 9 months ago

Article

Introducing the Open Leaderboard for Hebrew LLMs!

May 5, 2024

• 32

published an article 9 months ago

Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

May 3, 2024

• 13

published an article 10 months ago

Article

Improving Prompt Consistency with Structured Generations

Apr 30, 2024

• 61

published an article 10 months ago

Article

Introducing the Open Chain of Thought Leaderboard

Apr 23, 2024

• 30

published an article 10 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19, 2024

• 131

published an article 10 months ago

Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Apr 16, 2024

• 15

published an article 11 months ago

Article

Introducing the Chatbot Guardrails Arena

Mar 21, 2024

• 4