Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of πŸ€—Datasets: NLP, Multimodal data processing and sharing

Recent Activity

updated a Space about 2 hours ago
lhoestq/run-duckdb
updated a dataset about 2 hours ago
infinite-dataset-hub/LawFirmRevenueMatters
liked a dataset about 3 hours ago
simplescaling/s1K
View all activity

Organizations

Hugging Face's profile picture WMT: Workshop on Statistical Machine Translation's profile picture BigScience Workshop's profile picture Neuropark's profile picture Hugging Face Internal Testing Organization's profile picture Training Transformers Together's profile picture BigScience Catalogue Data's profile picture OpenSLR's profile picture BigScience Data's profile picture Evaluation on the Hub's profile picture 2023 Jan Offsite hackathon's profile picture Datasets Maintainers's profile picture Whisper Distillation's profile picture Open LLM Leaderboard's profile picture huggingPartyParis's profile picture CommonCanvas's profile picture ZeroGPU Explorers's profile picture Datasets examples's profile picture Pixel Parsing's profile picture HuggingFaceFW-Dev's profile picture Infinite Dataset Hub's profile picture Hugging Face FineVideo's profile picture Dataset ReWriter's profile picture Dataset Tools's profile picture Rainforest Connection's profile picture

lhoestq's activity

upvoted an article 7 days ago
view article
Article

Open-source DeepResearch – Freeing our search agents

β€’ 906
upvoted an article 9 days ago
upvoted an article 13 days ago
view article
Article

Mastering Long Contexts in LLMs with KVPress

By nvidia and 1 other β€’
β€’ 62
upvoted an article 14 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

β€’ 705
upvoted an article 18 days ago
view article
Article

Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas

By MaxNomic and 4 others β€’
β€’ 30
upvoted an article 19 days ago
view article
Article

Exploring Synthetic Data Generation with DataDreamer

By asoria β€’
β€’ 6
upvoted an article about 1 month ago
view article
Article

Synthetic Data Generation with FastData and Hugging Face

By asoria β€’
β€’ 14
upvoted 2 articles 2 months ago
view article
Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By omarkamali and 3 others β€’
β€’ 21
view article
Article

Bridging the Gap Between Physical Numerical Simulations and Machine Learning: Introducing The Well

By rubenohana β€’
β€’ 17
upvoted an article 3 months ago