SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 5 items • Updated 5 days ago • 22
CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering Paper • 2502.01523 • Published 8 days ago • 1
ScholaWrite: A Dataset of End-to-End Scholarly Writing Process Paper • 2502.02904 • Published 6 days ago • 1
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 7 hours ago • 48
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation Paper • 2412.15594 • Published Dec 20, 2024 • 1
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published 9 days ago • 22
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction Paper • 2403.02270 • Published Mar 4, 2024 • 3
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 8 days ago • 14
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 11 days ago • 33
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 12 days ago • 17
WildChat-50m Collection All model responses associated with the WildChat-50m paper. • 55 items • Updated 13 days ago • 7
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 15 days ago • 336
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 16 days ago • 99
view article Article Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas By MaxNomic and 4 others • 19 days ago • 30