view article Article Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas By MaxNomic and 4 others β’ 19 days ago β’ 30
view article Article Open Preference Dataset for Text-to-Image Generation by the π€ Community Dec 9, 2024 β’ 52
view article Article Letβs make a generation of amazing image generation models By burtenshaw and 4 others β’ Nov 26, 2024 β’ 34
view article Article Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation By davanstrien β’ Jun 20, 2024 β’ 12
view article Article Synthetic dataset generation techniques: generating custom sentence similarity data By davanstrien β’ May 23, 2024 β’ 16
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien β’ May 15, 2024 β’ 14
view article Article Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien β’ May 7, 2024 β’ 8
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20, 2024 β’ 75
view article Article Extracting Insights from Model Cards Using Open Large Language Models By davanstrien β’ Nov 27, 2023
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 β’ 29
view article Article Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub Aug 2, 2023 β’ 1
view article Article The Hugging Face Hub for Galleries, Libraries, Archives and Museums Jun 12, 2023 β’ 1