view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 1 day ago • 21
Running on Zero 74 74 ColPali fine-tuning Query Generator 🔍 Generate retrieval queries from document images
Parallia/Fairly-Multilingual-ModernBERT-Embed-BE Sentence Similarity • Updated 28 days ago • 428 • 24
view article Article Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation By davanstrien • Jun 20, 2024 • 12
view article Article Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM By Pclanglais • Apr 26, 2024 • 16