Towards Best Practices for Open Datasets for LLM Training Paper β’ 2501.08365 β’ Published 28 days ago β’ 54
Towards Best Practices for Open Datasets for LLM Training Paper β’ 2501.08365 β’ Published 28 days ago β’ 54
AllTheDocks road safety dataset: A cyclist's perspective and experience Paper β’ 2404.10528 β’ Published Apr 16, 2024
The AI Community Building the Future? A Quantitative Analysis of Development Activity on Hugging Face Hub Paper β’ 2405.13058 β’ Published May 20, 2024 β’ 1
Towards Openness Beyond Open Access: User Journeys through 3 Open AI Collaboratives Paper β’ 2301.08488 β’ Published Jan 20, 2023
Running on CPU Upgrade 88 88 Am I in The Stack? π Check if your GitHub repositories are in The Stack dataset